org.candelbio.multitool.nlp
levenshtein
(levenshtein str1 str2)
overexpressed
(overexpressed freq base-freq)
remove-numbers
(remove-numbers tokenized)
remove-ruthlessly
(remove-ruthlessly tokenized)
remove-shorts
(remove-shorts tokenized)
remove-stops
(remove-stops tokenized stops)