Package com.bakdata.dedupe.duplicate_detection
Provides base interfaces and implementations for finding duplicate clusters.
-
Interface Summary Interface Description DuplicateDetection<C extends java.lang.Comparable<C>,T> A duplicate detection algorithm processes a dataset of records and returns the distinctClusters.PossibleDuplicateHandler<T> A callback that is invoked when aClassification.POSSIBLE_DUPLICATEhas been founds duringDuplicateDetection.