February 2018
Beginner to intermediate
364 pages
10h 32m
English
You can the sample Python script. It will crawl a single Wikipedia page using Scrapy. The page it will crawl is the Python page at https://en.wikipedia.org/wiki/Python_(programming_language), and collect relevant links on that page.
When run you will see the similar output to the following:
/Users/michaelheydt/anaconda/bin/python3.6 /Users/michaelheydt/Dropbox/Packt/Books/PyWebScrCookbook/code/py/08/05_wikipedia_scrapy.pyparsing: https://en.wikipedia.org/wiki/Python_(programming_language)parsing: https://en.wikipedia.org/wiki/C_(programming_language)parsing: https://en.wikipedia.org/wiki/Object-oriented_programmingparsing: https://en.wikipedia.org/wiki/Ruby_(programming_language)parsing: https://en.wikipedia.org/wiki/Go_(programming_language) ...
Read now
Unlock full access