January 2018
Beginner to intermediate
284 pages
8h 35m
English
In the Skip-Gram model, we generate pairs of words for training as follows:

A positive training pair can be generated as follows:

From these illustration, one can easily see that the network is going to learn the statistics from the number of times each pairing (target, context) shows up. For example, the model may see more samples of York, New or New, York rather ...
Read now
Unlock full access