Skip to Content
Practical Data Analysis Cookbook
book

Practical Data Analysis Cookbook

by Tomasz Drabas
April 2016
Beginner to intermediate content levelBeginner to intermediate
384 pages
8h 36m
English
Packt Publishing
Content preview from Practical Data Analysis Cookbook

Chapter 9. Natural Language Processing

In this chapter, you will learn the following recipes:

  • Reading raw text from the Web
  • Tokenizing and normalizing text
  • Identifying parts of speech, handling n-grams, and recognizing named entities
  • Identifying the topic of an article
  • Identifying the sentence structure
  • Classifying movies based on their reviews

Introduction

Modeling based on structured data gathered via a controlled experiment (as we were doing in previous chapters) is relatively straightforward. However, in the real world, we rarely deal with structured data. This is especially true when it comes to understanding human-generated feedback or analyzing an article in a newspaper.

Natural Language Processing (NLP) is a discipline of computer science, statistics, ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Python Data Analysis Cookbook

Python Data Analysis Cookbook

Ivan Idris
Practical Simulations for Machine Learning

Practical Simulations for Machine Learning

Paris Buttfield-Addison, Mars Buttfield-Addison, Tim Nugent, Jon Manning

Publisher Resources

ISBN: 9781783551668Supplemental Content