NLP Basics

This section tries to scratch the surface of the complex field of NLP. The previous chapters have mentioned some of the basics that are necessary for dealing with textual data (for example, tokenization) without going too much into the details. Here, we'll try to go one step further into the basic understanding of this discipline. Due to its complexity and many aspects, we're taking a pragmatic approach and only scratching the surface of the theoretical foundations in favor of practical examples.

Text preprocessing

An essential part of any NLP system is the preprocessing pipeline. Before we can perform any interesting task on a piece of text, we must first convert it in a useful representation.

In the previous chapters, we already performed ...

Get Mastering Social Media Mining with Python now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Mastering Social Media Mining with Python by Marco Bonzanini

NLP Basics

Text preprocessing

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly