The following table lists the options for the Filter stage.
Option Name |
Description / Valid Values |
---|---|
Group by |
Specifies the field to use to create groups of records to filter. The Filter stage will retain one or more records from each group, depending on how you configure the stage. In cases where you have used a matching stage earlier in the dataflow, such as Interflow Match, Intraflow Match, or Transactional Match, you should select the CollectionNumber field to use the collections created by the matching stage as the groups. However, if you want to group records by some other field, choose the field here. For example, if you want to filter out all but one record from records that have the same value in the AccountNumber field, you would select AccountNumber. |
Sort |
If you specify a field in the Group by field, check this box to sort the records by the value in the field you chose. This option is enabled by default. |
Advanced |
Click this button to specify sort performance options. By default, the sort performance options specified in Management Console, which are the default performance options for your system, are in effect. If you want to override your system's default performance options, check the Override sort performance options box then specify the values you want in these fields:
Note: The optimal sort performance settings depends on your server's
hardware configuration. You can use this equation as a general
guideline to produce good sort performance:
Note: The optimal sort performance settings depends on your server's hardware configuration. You can use this equation as a general guideline to produce good sort performance:
|
Limit number of returned duplicate records |
Specifies the maximum number of records that are returned from each group. If you set this option to 1, you can define filter rules to determine which record in each group should be returned. If no rules are defined, the first record in each collection is returned and the rest are discarded. In this mode, the filter rules define which record will be retained. For example, if you define a rule where the record with the highest match score in a group is retained, and you set this option to 1, then the record with the highest match score in each group will survive and the other records in the group will be discarded. If you set this option to a value higher than one, you cannot specify filter rules. Note: In the event no records in the collection meet the defined
rule criteria, then no records from the group are
returned.
|
Remove duplicates from collection |
Specifies to use filter rules to determine which records are removed from the collection. The remaining records in the collection are retained. When this option is selected, you must define a rule. Note: If a group contains only one record, the filter rules are
ignored and the record is retained.
|
Rule Options
Filter rules determine which records in a group to retain or remove. If you select the option Limit number of returned duplicate records then the rules determine which records survive the filter. If you select the option Remove duplicates from collection then the rules determine which records are removed from the dataflow.
To add a rule, select Rules in the rule hierarchy and click Add Rule
If you specify multiple rules, you will have to select a logical operator to use between each rule. Choose And if you want the new rule and the previous rule to both pass in order for the condition to be met. Select Or if you want either the previous rule or the new rule to pass in order for the condition to be met.
Option | Description |
---|---|
Field name |
Specifies the name of the dataflow field whose value you want to evaluate to determine whether to filter the record. |
Field Type |
Specifies the type of data in the field. One of the following:
|
Operator |
Specifies the type of comparison you want to use to evaluate the field. One of the following:
|
Value type |
Specifies the type of value you want to compare to the field's value. One of the following: Note: This option is not available if you select the operator Highest, Lowest, or Longest.
|
Value |
Specifies the value to compare to the field's value. If you selected Field in the Field type field, select a dataflow field. If you selected String in the Value type field, type the value you want to use in the comparison. Note: This option is not available if you select the operator Highest, Lowest, or Longest.
|
Example of a Filter Rule
This rule retains the record in each group with the highest value in the MatchScore field. Note that Value and Value Type options do not apply when the Operator is highest or lowest.
Field Name = MatchScore
Field Type = Numeric
Operator = Highest
This rule retains the record where the value in the AccountNumber is "12345".
Field Name = AccountNumber
Field Type = Numeric
Operator = Equals
Value Type = String
Value = 12345