3 Text embeddings

This chapter covers

  • Preparing texts for deep learning using word and document embeddings
  • Using self-developed vs. pretrained embeddings
  • Implementing word similarity with Word2Vec
  • Retrieving documents using Doc2Vec

After reading this chapter, you will have a practical command of basic and popular text embedding algorithms, and you will have developed insight into how to use embeddings for NLP. We will go through a number of concrete scenarios to reach that goal. But first, let’s review the basics of embeddings.

3.1 Embeddings

Embeddings are procedures for converting input data into vector representations. As mentioned in chapter 1, a vector is like a container (such as an array) containing numbers. Every vector lives in a multidimensional ...

Get Deep Learning for Natural Language Processing now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.