12.2 TextBlob1
TextBlob is an object-oriented NLP text-processing library that is built on the NLTK and pattern NLP libraries and simplifies many of their capabilities. Some of the NLP tasks TextBlob can perform include:
Tokenization—splitting text into pieces called tokens, which are meaningful units, such as words and numbers.
Parts-of-speech (POS) tagging—identifying each word’s part of speech, such as noun, verb, adjective, etc.
Noun phrase extraction—locating groups of words that represent nouns, such as “red brick factory.”2
Get Intro to Python for Computer Science and Data Science: Learning to Program with AI, Big Data and The Cloud now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.