Building a new dataset with the NER annotation tool

There are many annotation tools available in different forms. Some are standalone and can be configured or installed on a local machine, some are cloud-based, some are free, and some are paid. In this section, we will focus on free annotation tools, get an idea of how to use them, and see what we can achieve with annotation.

To see how we can use annotations to create a dataset, we will look at these tools:

  • brat
  • Stanford Annotator

brat stands for brat rapid annotation tool and can be found at http://brat.nlplab.org/index.html. It can be used online or offline. Installing it on your local machine is simple: follow the steps listed at http://brat.nlplab.org/installation.html. Once installed ...

Get Natural Language Processing with Java - Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.