January 2019
Intermediate to advanced
378 pages
8h 27m
English
We have all the URLs for our stories, but, unfortunately, this isn't enough to train on; we'll need the full article body. This in itself could become a huge challenge if we want to roll our own scraper, especially if we are going to be pulling stories from dozens of sites. We would need to write code to target the article body while carefully avoiding all the other site gunk that surrounds it. Fortunately, as far as we are concerned, there are a number of free services that will do this for us. I'm going to be using Embedly to do this, but there are a number of other services that you could use instead.
The first step is to sign up for Embedly API access. You can do that at https://app.embed.ly/signup ...
Read now
Unlock full access