SSIS Duplicate Search Processes

As an alternative to the full and incremental search processes included in the program, you can run SQL Server Integration Services (SSIS) search packages to identify duplicates. You can run full or partial SSIS search processes at scheduled times. You can also customize the searches. For example, you can adjust the similarity settings for fuzzy matching or add match fields to the search, such as birth date. To use SSIS duplicate search processes, you must first load the package.

Note: For more information about how to load, customize, or schedule the SSIS duplicate search packages, see the SDK Development Guides.

  • Full search

    The full duplicate search evaluates all constituent records in the database. This search returns thorough results, but large databases may require several hours to run it. For that reason, the full search should be run on an infrequent basis.

  • Partial search

    The partial duplicate search evaluates only constituents records added since the specified date.

To view the list of potential duplicates found in the last full or partial SSIS search process, run the Duplicate Constituents Report. For more information, see Run the Duplicate Constituents Report.