Match Criteria
The match criteria are responsible for matching records against each other to find those that match. When users are only interested in exact matches, the match criteria are reasonably straightforward.
If the SSN (Social Security Number) for two customer objects or the EAN (European Article Number) for two product objects are identical, the records are likely duplicates and the matching criteria should return 100 percent. If the SSN or EAN does not match, the match criteria should probably return 0 percent.
In many cases you cannot work with exact matches; instead, you will deal with approximate matches or a combination of exact and approximate matches. For example, for a customer you do not have a SSN available so you will identify duplicates based on names, mailing addresses, phone numbers, and street addresses. For a product, you will identify duplicates based on the manufacturer and manufacturer part number.
This data can have variations, even in objects that represent the same real-world entity. Names and addresses can be spelled differently, middle names could be omitted, abbreviations can be used in names and addresses, the customers could be registered with different phone numbers or mailing addresses, and other options that introduce ambiguity to the records.
This complexity can be handled via a decision table in the match criteria logic, which further divides the functionality into normalizers, matchers, and rules.
Match Criteria Tab
The Match Criteria tab defines how to compare two objects and evaluate to what degree they are similar. It is separated into the following flippers:
- Data Elements are the evaluation tools for the matchers and match code generators and allow data to be presented in a format that is easy to compare, and often includes normalization.
- Matchers are the field-level comparison logic that specifies how to match a type of data.
- Rules determine how field-level matchers resolve if two objects are a match or are not a match.
- Match Code Generators identify the records that should be compared for the applicable objects. Only records with at least one equal match code are then evaluated via rules. This allows efficient matching on a dataset of thousands or millions of objects instead of comparing every object with every other object.
- Match Code Filter allows users to filter values that meet criteria set on a Transformation Lookup Table.
- Evaluator allows a user to compare of two objects using the other settings.
Other Match Criteria
For match algorithms without embedded match codes, several legacy options exist for match criteria.
Important: Create new match algorithms with embedded match codes as defined in the Configuring Matching Algorithms topic here.
Match code generators are only available when match codes are not embedded into the matching algorithm. For more information, see the Match Codes topic here.