July 2018
Beginner
162 pages
3h 25m
English
After training, we have a new document and we want to find its document vector. We'll use the word similarities learned during training to construct a vector that will predict the words in the new document. We will use a dummy filename since the actual name is not important. What's important is that it's just one name. So, all of these words get connected together under that one name:

Once we get that new document vector, we can compare it with other document vectors and find which known document from the past is the most similar, as follows:
Thus, we can use Doc2Vec to find which documents are most similar to each other. ...
Read now
Unlock full access