Configuring Matching Algorithms
A matching algorithm allows a user to define:
- The match criteria is what qualifies objects as duplicates.
- The match action is what the system should do with such duplicates.
Matching algorithms are used in Match and Merge (here) and Match and Link (here) solutions.
Tool available for tuning and monitoring the results of the matching algorithm are defined in the Tuning and Monitoring a Matching Algorithm topic here.
Configuration
Use the following steps to create a matching algorithm:
- In System Setup, right-click the node configured to house matching algorithms and select New Matching Algorithm.
- In the Create Matching Algorithm dialog, define an ID and Name for the matching algorithm.
-
Check the Embed Match Codes checkbox so the match codes are embedded in the algorithms.
-
If the Embed Match Codes checkbox is not checked (legacy functionality), you must manually create a match code and link it to the matching algorithm. For more information, see the Match Codes topic here.
-
Click Create to display the Matching Algorithm object.
Initially, the Configuration Validation Status flipper shows a red X. That indicator changes as the required elements are provided and configured correctly. See the last step in this topic for more information.
-
In the Definition flipper, for the Matching Context parameter, specify the context to run the matching algorithm. By default, the current context is set.
- For the Matching Workspace parameter, specify the workspace to run the matching algorithm. By default, the Main workspace is selected.
- For the Duplicate Type parameter, click the ellipsis button (
). In the 'Select a Duplicate Reference Type' dialog, select the appropriate reference type as defined in the component model. For more information, see the Configuring Match Criteria topic. - For the Non-Duplicate Type parameter, click the ellipsis button (
). In the 'Select a Duplicate Reference Type' dialog, select the appropriate reference type as defined in the component model. For more information, see the Configuring Match Criteria topic.
- In the Global Binds flipper, potentially improve the performance by creating global binds to obtain all attribute values used in the decision table comparison.
The matching process can strain performance. When processing large sets of data, there is potentially a significant performance gain if the matching functionality can fetch the values for matching before the matching process begins. This fetching of data is possible via global binds configured on the matching algorithm, where the matching algorithm logic uses attributes that are bound to specific variable names. The system fetches the values for the attributes before the match criteria logic is applied and can be referenced from both JavaScript and STEP functions.
Important: Global binds are not optimized for use with In-Memory.
-
Click the Edit Global Binds link to open the 'Edit Binds' dialog shown below.
-
Click the Add Bind button to create a new bind.
-
For Variable name, specify a variable name for the bind.
-
For Binds to, select a bind from the dropdown (some binds are displayed within a group).
-
For Parameters, when available, click the ellipsis button (
) to specify an object to bind. -
Click OK to close the dialog and return to the Matching Algorithm object.
- If the Match Criteria flipper is displayed (only for legacy algorithms where match codes are not embedded and must be created manually), configure the following. For more information, see the Match Criteria topic here.
- Click the Add Criterion link to display the 'Select Match Criterion' dialog.
- Specify a Name.
- Choose a match criterion from the Select Match Criterion dropdown.
- Click the Add button.
- Click into the Criterion field and then click the ellipsis button (
) to open the editor. - Create the matching criterion and click OK.
- Click into the Weight field and specify a weight for the criterion.
- Open the Evaluator flipper, select two objects to test the selected criteria on a data set.
- Set up the match action as needed. For more information, see the Match Actions topic here.
-
Set up the survivorship rules as needed. For more information, see the Configuring Survivorship Rules topic here.
-
Verify your matching algorithm configuration status and take any necessary action.
-
A red 'X' displays when the configuration is invalid. Open the flipper to view the errors that must be addressed. Correct any errors shown before running the matching algorithm.
-
A yellow checkmark indicates warnings that should be addressed.
-
A green checkmark indicates the matching algorithm has a valid configuration.
-