O'Reilly logo

Practical Web Scraping for Data Science: Best Practices and Examples with Python by Bart Baesens, Seppe vanden Broucke

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

© Seppe vanden Broucke and Bart Baesens 2018

Seppe vanden Broucke and Bart Baesens, Practical Web Scraping for Data Science, https://doi.org/10.1007/978-1-4842-3582-9_3

3. Stirring the HTML and CSS Soup

Seppe vanden Broucke and Bart Baesens2

(1)KU Leuven, Leuven, Belgium

(2)Dept of Decision Sci & Info Managem, KU Leuven Dept of Decision Sci & Info Managem, Leuven, Belgium

So far we have discussed the basics of HTTP and how you can perform HTTP requests in Python using the requests library. However, since most web pages are formatted using the Hypertext Markup Language (HTML), we need to understand how to extract information from such pages. As such, this chapter introduces you to HTML, as well as another core building block that is used to format ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required