7 Natural Language Processing for Analyzing Unstructured Data

DOI: 10.1201/9781003278177-7

Before discussing Natural Language Processing (NLP) tasks and techniques, it is important to understand its need for analytical purposes. The data available to us for any research or analysis endeavor can be either in a structured or unstructured format. Earlier chapters have focused on analyzing structured (i.e., tabular) data and in that process, we have learned how to uncover hidden information in the data by utilizing different analytical techniques.

In a similar manner, unstructured data (e.g., text, image, and audio) also contain hidden information that can be mined. For example, the social media site Twitter contains a corpus of tweets, each about ...

Get What Every Engineer Should Know About Data-Driven Analytics now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.