Chapter 7. Entity Recognition

Extracting information out of unstructured text data is a tedious process, because of the complex nature of natural language. Even after advancements in the field of Natural language processing (NLP), we are far from the point where any unrestricted text can be analyzed and the meaning can be extracted for general purposes. However, if we just focus on a specific set of questions, we can extract a significant amount of information from the text data. Named entity recognition helps identify the important entities in a text, to be able to derive the meaning from the unstructured data. It is a vital component of NLP applications, for example, question-answering systems, product discovery on e-commerce websites, and so ...

Get Mastering Text Mining with R now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.