Using Stanford MaxentTagger

The MaxentTagger class uses a model to perform the tagging task. There are a number of models that come bundled with the API, all with the file extension .tagger. They include English, Chinese, Arabic, French, and German models. The English models are listed here. The prefix, wsj, refers to models based on the Wall Street Journal. The other terms refer to techniques used to train the model. These concepts are not covered here:

  • wsj-0-18-bidirectional-distsim.tagger
  • wsj-0-18-bidirectional-nodistsim.tagger
  • wsj-0-18-caseless-left3words-distsim.tagger
  • wsj-0-18-left3words-distsim.tagger
  • wsj-0-18-left3words-nodistsim.tagger
  • english-bidirectional-distsim.tagger
  • english-caseless-left3words-distsim.tagger
  • english-left3words-distsim.tagger ...

Get Natural Language Processing with Java - Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.