Class FingerprintKeyer
- java.lang.Object
-
- org.openrefine.clustering.binning.Keyer
-
- org.openrefine.clustering.binning.FingerprintKeyer
-
- Direct Known Subclasses:
NGramFingerprintKeyer
public class FingerprintKeyer extends Keyer
Fingerprint keyer where fingerprint is sorted list of unique words after case and diacritic folding and removing all punctuation. Word boundary is any whitespace character, while output key has words joined with a single ASCII space character.
-
-
Field Summary
Fields Modifier and Type Field Description static Pattern
DIACRITICS_AND_FRIENDS
-
Constructor Summary
Constructors Constructor Description FingerprintKeyer()
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Deprecated Methods Modifier and Type Method Description protected String
asciify(String s)
Deprecated.by tfmorris 2020-07-07 Usenormalize(String)
or {normalize(String, boolean)
String
key(String s, Object... o)
protected String
normalize(String s)
protected String
normalize(String s, boolean strong)
protected static String
stripDiacritics(String str)
-
-
-
Field Detail
-
DIACRITICS_AND_FRIENDS
public static final Pattern DIACRITICS_AND_FRIENDS
-
-
Method Detail
-
asciify
@Deprecated protected String asciify(String s)
Deprecated.by tfmorris 2020-07-07 Usenormalize(String)
or {normalize(String, boolean)
-
-