O'Reilly logo

Python 3 Text Processing with NLTK 3 Cookbook by Jacob Perkins

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 5. Extracting Chunks

In this chapter, we will cover the following recipes:

  • Chunking and chinking with regular expressions
  • Merging and splitting chunks with regular expressions
  • Expanding and removing chunks with regular expressions
  • Partial parsing with regular expressions
  • Training a tagger-based chunker
  • Classification-based chunking
  • Extracting named entities
  • Extracting proper noun chunks
  • Extracting location chunks
  • Training a named entity chunker
  • Training a chunker with NLTK-Trainer

Introduction

Chunk extraction, or partial parsing, is the process of extracting short phrases from a part-of-speech tagged sentence. This is different from full parsing in that we're interested in standalone chunks, or phrases, instead of full parse trees (for more on parse ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required