Text Mining & Natural Language Understanding at Scale

by David Talby, Claudiu Branzan

Released July 2016

Publisher(s): O'Reilly Media, Inc.

ISBN: 9781491964293

Start your free trial

Video description

A text mining system must go way beyond indexing and search to appear truly intelligent. First, it should understand language beyond keyword matching. For example, it should be able to distinguish the critical difference between “Jane has the flu” and “Jane had the flu when she was 9.” Second, it should be capable of making likely inferences even if they’re not explicitly written. For example, inferring that Jane may have the flu if she has had a fever, headache, fatigue, and runny nose for three days. And third, it should do its work as part of a robust, scalable, efficient, and easy to extend system. This course teaches software engineers and data scientists how to build intelligent natural language understanding (NLU) based text mining systems at scale using Java, Scala, and Spark for distributed processing.

Learn the meaning of natural language understanding (NLU) and its use in text mining
Discover how to build a natural language processing (NLP) pipeline within a big data framework
Recognize the differences between NLP pipelines and other approaches to semantic text mining
Learn about standard UIMA annotators, custom annotators, and machine learned annotators
Discover how different types of annotators are composed into a text processing pipeline
Use machine learning to generate annotators and apply them within a data pipeline
See pipeline architectures that incorporate Kafka, Spark, SparkSQL, Cassandra, and ElasticSearch

David Talby (PhD , Computer Science, Hebrew University) and Claudio Branzan (Masters, Industrial Intelligent Systems, Polytechnic University of Timișoara) work for big data analytics firm Atigeo. David is CTO and Claudio runs the Modeling and Predictive Analytics team. David and Claudio co-presented on text mining and natural language understanding at O'Reilly's Strata+Hadoop World London 2016 conference.

Product information

Title: Text Mining & Natural Language Understanding at Scale
Author(s): David Talby, Claudiu Branzan
Release date: July 2016
Publisher(s): O'Reilly Media, Inc.
ISBN: 9781491964293

book

Mastering Text Mining with R

by Ashish Kumar, Avinash Paul

Master text-taming techniques and build effective text-processing applications with R About This Book Develop all the …

video

Deep Learning for Natural Language Processing: Applications of Deep Neural Networks to Machine Learning Tasks

by Jon Krohn

5+ Hours of Video Instruction An intuitive introduction to processing natural language data with Deep Learning …

book

Text Analytics with Python: A Practitioner's Guide to Natural Language Processing

by Dipanjan Sarkar

Leverage Natural Language Processing (NLP) in Python and learn how to set up your own robust …

book

Applied Natural Language Processing in the Enterprise

by Ankur A. Patel, Ajay Uppili Arasanipalai

NLP has exploded in popularity over the last few years. But while Google, Facebook, OpenAI, and …

Text Mining & Natural Language Understanding at Scale

Video description

Table of contents

Product information

You might also like

Mastering Text Mining with R

Deep Learning for Natural Language Processing: Applications of Deep Neural Networks to Machine Learning Tasks

Text Analytics with Python: A Practitioner's Guide to Natural Language Processing

Applied Natural Language Processing in the Enterprise

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly