Table of Contents
Preface
Part 1: NLP Basics
1
Introduction to NLP
Introduction to natural language processing
NLU + NLG = NLP
NLU
NLG
Gensim and its NLP modeling techniques
BoW and TF-IDF
LSA/LSI
Word2Vec
Doc2Vec
LDA
Ensemble LDA
Topic modeling with BERTopic
Common NLP Python modules included in this book
spaCy
NLTK
Summary
Questions
References
2
Text Representation
Technical requirements
What word embedding is
Simple encoding methods
One-hot encoding
BoW
Bag-of-N-grams
What TF-IDF is
Shining applications of BoW and TF-IDF
Coding – BoW
Gensim for BoW
scikit-learn for BoW (CountVectorizer)
Coding – Bag-of-N-grams
Gensim for N-grams
scikit-learn for N-grams
NLTK for N-grams
Coding – TF-IDF
Gensim for TF-IDF
scikit-learn for TF-IDF
Summary
Questions ...
Get The Handbook of NLP with Gensim now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.