Hands-on NLP with Spark and Stanford core NLP

The first example covered in this chapter involves a Scala Spark wrapper of the Stanford core NLP (https://github.com/stanfordnlp/CoreNLP) library, which is open source and released with the GNU general public licence v3 (https://www.gnu.org/licenses/gpl-3.0.en.html). It is a Java library that provides a set of natural language analysis tools. Its basic distribution provides model files for the analysis of English, but the engine is compatible with models for other languages as well. It is stable and ready for production, and widely used across different areas of academic and industry. Spark CoreNLP (https://github.com/databricks/spark-corenlp) is a wrapper of the Stanford Core NLP Java library ...

Get Hands-On Deep Learning with Apache Spark now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.