Duplicate Search Workflow

Use duplicate search processes to identify and merge constituent records that represent the same individual or organization. A regular search and merge routine helps maintain accurate constituent data and supports reliable reporting and analysis.

Choose a duplicate search process

Decide which type of duplicate search process to use. Both processes identify potential duplicates based on scores for parameters you configure.

Run a full duplicate search

Run the full duplicate search process to evaluate the entire database.

For details, see Run the Full Duplicate Constituent Search Process.Run the Full Duplicate Constituent Search Process.

Warning: The duration of this process varies based on the number of records and the configuration options you select. It can take an extended amount of time.

Review possible duplicates

Run the Duplicate Constituents Report to view constituent records identified as possible duplicates.

When you run the report, you'll select whether to view results from the most recent SSIS process or the most recent full or incremental search. For details, seeRun the Duplicate Constituents Report.

Verify duplicate records

Review each pair of records in the report and determine whether the records represent the same constituent.

Verify which records are indeed duplicates which should be merged.

For records that aren't duplicates, update the constituent information to correct or clarify information on their constituent records.

If you make updates, run the duplicate search or SSIS process again, and then run the Duplicate Constituents Report to verify the updated results. See Run the Duplicate Constituents Report.

Merge confirmed duplicates

Run the merge process to combine confirmed duplicate records.

After the merge completes, run the full search process again. Then run the Duplicate Constituents Report to verify that the records merged correctly. For details about merges, seeMerge Duplicate Constituents.

Maintain ongoing data integrity

Continue to run search and merge processes until your database is clean.

After you complete a full search, you can reduce processing time by running incremental or SSIS partial searches. These processes are faster because they only compare records added or updated since the most recent search.

Schedule full and incremental or full and partial SSIS duplicate search processes to run at regular intervals.

These combined strategies (including automatic duplicate searches that run during data entry) help maintain an accurate database.

For more information about how to schedule full and incremental search processes, see Configure Duplicate Search Process Job Schedules. For more information about how to schedule SSIS duplicate search processes, see SSIS Duplicate Search Processes.

Note: The duplicates search processes and merge processes work in tandem. Groups of duplicates identified by searches provide the data sources for merge processes. Thus, we recommend you schedule the search and merge processes to run at similar intervals.