December 2018
Beginner to intermediate
684 pages
21h 9m
English
To pass a larger number of documents through the processing pipeline, we can use spaCy's streaming API as follows:
iter_texts = (bbc_articles[i] for i in range(len(bbc_articles)))for i, doc in enumerate(nlp.pipe(iter_texts, batch_size=50, n_threads=8)): assert doc.is_parsed