Hands-on NLP with Spark and Stanford core NLP

The first example covered in this chapter involves a Scala Spark wrapper of the Stanford core NLP (https://github.com/stanfordnlp/CoreNLP) library, which is open source and released with the GNU general public licence v3 (https://www.gnu.org/licenses/gpl-3.0.en.html). It is a Java library that provides a set of natural language analysis tools. Its basic distribution provides model files for the analysis of English, but the engine is compatible with models for other languages as well. It is stable and ready for production, and widely used across different areas of academic and industry. Spark CoreNLP (https://github.com/databricks/spark-corenlp) is a wrapper of the Stanford Core NLP Java library ...

Get Hands-On Deep Learning with Apache Spark now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.