June 2025
Beginner to intermediate
473 pages
13h 30m
English
Of the three programming languages covered in this book, Python is best suited for web scraping. However, simple tasks can also be solved using PowerShell. The Invoke-WebRequest cmdlet returns a BasicHtmlWebResponseObject, which provides access to selected properties of the response to the request. This cmdlet does not give you a full DOM, but for some tasks, the included properties are sufficient.
|
Property |
Meaning |
|---|---|
|
Content |
HTML code (string) |
|
Headers |
The returned headers (dictionary with strings) |
|
Status |
The HTML status code of the response (integer) |
|
Images |
List of all images contained in the HTML code |
|
InputFields |
List of all input fields in the forms of the HTML page |
Read now
Unlock full access