November 2018
Beginner to intermediate
182 pages
4h 48m
English
The simplest explanation is from the spaCy docs (spacy-101) itself.
First, the raw text is split on whitespace characters, similar to text.split (' '). Then, the tokenizer processes the text from left to right. On each substring, it performs two checks:

Read now
Unlock full access