David Graus

David Graus

Search for:

Computing string similarity with TF-IDF and Python
October 3, 2011
“The tf–idf weight (term frequency–inverse document frequency) is a weight often used in information retrieval and text mining. This weight is a statistical measure used to evaluate how important a word is to a document in a collection or corpus.”[wikipedia] It is also the weight I use to measure similarity between texts, for these two […]

Search for: