Package com.bakdata.dedupe.duplicate_detection
Provides base interfaces and implementations for finding duplicate clusters.
-
Interface Summary Interface Description DuplicateDetection<C extends java.lang.Comparable<C>,T> A duplicate detection algorithm processes a dataset of records and returns the distinctCluster
s.PossibleDuplicateHandler<T> A callback that is invoked when aClassification.POSSIBLE_DUPLICATE
has been founds duringDuplicateDetection
.