Summary

In this chapter, you learned how to clean unstructured text data and then plotted a wordcloud out of this data. You learned how to tokenize words and sentences using NLTK. You learned how to perform parts of speech tagging and also the concepts of stemming and lemmatization. You were introduced to Named Entity Recognition and learned how to apply it using Stanford NER. Finally, you learned how to fetch tweets using the Twitter API and then perform sentiment analysis on it.

In the next chapter, you'll learn how to use Python in the world of big data.

Get Mastering Python for Data Science now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.