April 2019
Intermediate to advanced
544 pages
17h 29m
English
Chapter 2. Build your vocabulary (word tokenization)
Listing 2.1 Example Monticello sentence split into tokens
Listing 2.2 One-hot vector sequence for the Monticello sentence
Listing 2.3 Prettier one-hot vectors
Listing 2.4 Construct a DataFrame of bag-of-words vectors
Listing 2.5 Example dot product calculation
Listing 2.6 Overlap of word counts for two bag-of-words vectors
Listing 2.7 Tokenize the Monticello sentence with a regular expression
Chapter 3. Math with words (TF-IDF vectors)
Chapter 4. Finding meaning in word counts (semantic analysis)