We learned that, in the CBOW model, we try to predict the target word given the context words, so it takes some number of context words as an input and returns one target word as an output. In CBOW model with a single context word, we will have only one context word, that is, . So, the network takes only one context word as an input and returns one target word as an output.
Before going ahead, first, let's familiarize ourselves with the notations. All the unique words we have in our corpus is called the vocabulary ...