Skip to Main Content
The Natural Language Processing Workshop
book

The Natural Language Processing Workshop

by Rohan Chopra, Aniruddha M. Godbole, Nipun Sadvilkar, Muzaffar Bashir Shah, Sohom Ghosh, Dwight Gunning, Ankit Bhatia, Nagendra Nagaraj, John Bura, Sumit Kumar Raj, Tom Taulli, Ankit Verma
August 2020
Beginner to intermediate content levelBeginner to intermediate
452 pages
7h 42m
English
Packt Publishing
Content preview from The Natural Language Processing Workshop

6. Vector Representation

Overview

This chapter introduces you to the various ways in which text can be represented in the form of vectors. You will start by learning why this is important, and the different types of vector representation. You will then perform one-hot encoding on words, using the preprocessing package provided by scikit-learn, and character-level encoding, both manually and using the powerful Keras library. After covering learned word embeddings and pre-trained embeddings, you will use Word2Vec and Doc2Vec for vector representation for Natural Language Processing (NLP) tasks, such as finding the level of similarity between multiple texts.

Introduction

The previous chapters laid a firm foundation for NLP. But now we will ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

The Applied AI and Natural Language Processing Workshop

The Applied AI and Natural Language Processing Workshop

Krishna Sankar, Jeffrey Jackovich, Ruze Richards
Natural Language Processing and Computational Linguistics

Natural Language Processing and Computational Linguistics

Brian Sacash, Bhargav Srinivasa-Desikan, Reddy Anil Kumar
The Applied Data Science Workshop - Second Edition

The Applied Data Science Workshop - Second Edition

Alex Galea, Paul Van Branteghem, Guillermina Bea j, Shovon Sengupta, Karen Yang

Publisher Resources

ISBN: 9781800208421Supplemental Content