November 2017
Beginner to intermediate
366 pages
7h 59m
English
The term frequency (tf) for a given word w is the number of times the word occurs in document d. Conveniently, we can write it as tf(w, d). In our case, this is how many times it appears in a tweet. Term frequencies are called local weights. They indicate the importance of a word in a document. Many times, they are normalized by dividing it by the number of words in a document, that is the length of the document. The higher the value of tf of a word in a document, the higher the importance.
The document frequency or df for a given word w is the number of documents in which the word has occurred. Document frequency is considered the global weight.
Inverse document frequency or idf is calculated ...
Read now
Unlock full access