Configuring Matching Algorithms

A matching algorithm allows a user to define:

  • The match criteria is what qualifies objects as duplicates.
  • The match action is what the system should do with such duplicates.

Matching algorithms are used in Match and Merge (here) and Match and Link (here) solutions.

Tool available for tuning and monitoring the results of the matching algorithm are defined in the Match Tuning topic here.

Configuration

Use the following steps to create a matching algorithm:

  1. In System Setup, right-click the node configured to house matching algorithms and select New Matching Algorithm.

  1. In the Create Matching Algorithm dialog, define an ID and Name for the matching algorithm.
  • Check the Embed Match Codes checkbox so the match codes are embedded in the algorithms.

  • If the Embed Match Codes checkbox is not checked (legacy functionality), you must manually create a match code and link it to the matching algorithm. For more information, see the Match Codes topic here.

  • Click Create to display the Matching Algorithm object.

    Initially, the Configuration Validation Status flipper shows a red X. That indicator changes as the required elements are provided and configured correctly. See the last step in this topic for more information.

  1. In the Definition flipper, for the Matching Context parameter, specify the context to run the matching algorithm. By default, the current context is set.

  2. For the Matching Workspace parameter, specify the workspace to run the matching algorithm. By default, the Main workspace is selected.
  3. For the Duplicate Type parameter, click the ellipsis button (). In the 'Select a Duplicate Reference Type' dialog, select the appropriate reference type as defined in the component model. For more information, see the Configuring Matching Component Model topic here.

Note: The Confirmed Duplicate reference type must be multivalued, inheritance None, no dimension dependencies, and not externally maintained.

  1. For the Non-Duplicate Type parameter, click the ellipsis button (). In the 'Select a Duplicate Reference Type' dialog, select the appropriate reference type as defined in the component model. For more information, see the Configuring Matching Component Model topic here.

Note: The Confirmed Non-Duplicate reference type must be multivalued, inheritance None, no dimension dependencies, and not externally maintained. In Match and Link solutions, a valid "Confirmed Justification" attribute can be made valid on the reference type.

  1. In the Global Binds flipper, potentially improve the performance by creating global binds to obtain all attribute values used in the decision table comparison.

The matching process can strain performance. When processing large sets of data, there is potentially a significant performance gain if the matching functionality can fetch the values for matching before the matching process begins. This fetching of data is possible via global binds configured on the matching algorithm, where the matching algorithm logic uses attributes that are bound to specific variable names. The system fetches the values for the attributes before the match criteria logic is applied and can be referenced from both JavaScript and STEP functions.

Important: Global binds are not optimized for use with In-Memory.

  • Click the Edit Global Binds link to open the 'Edit Binds' dialog shown below.

  • Click the Add Bind button to create a new bind.

  • For Variable name, specify a variable name for the bind.

  • For Binds to, select a bind from the dropdown (some binds are displayed within a group).

  • For Parameters, when available, click the ellipsis button () to specify an object to bind.

  • Click OK to close the dialog and return to the Matching Algorithm object.

  1. Open the Evaluator flipper, select two objects to test the selected criteria on a data set.

  1. Set up the match action as needed. For more information, see the Match Actions topic here.

  1. Set up the survivorship rules as needed. For more information, see the Configuring Survivorship Rules topic here.

  2. Verify your matching algorithm configuration status and take any necessary action.

    • A red 'X' displays when the configuration is invalid. Open the flipper to view the errors that must be addressed. Correct any errors shown before running the matching algorithm.

    • A yellow checkmark indicates warnings that should be addressed.

    • A green checkmark indicates the matching algorithm has a valid configuration.