Skip to Content
Data Wrangling with Python
book

Data Wrangling with Python

by Dr. Tirthajyoti Sarkar, Shubhadeep Roychowdhury
February 2019
Beginner to intermediate
452 pages
7h 6m
English
Packt Publishing
Content preview from Data Wrangling with Python

Chapter 5

Getting Comfortable with Different Kinds of Data Sources

Learning Objectives

By the end of this chapter, you will be able to:

  • Read CSV, Excel, and JSON files into pandas DataFrames
  • Read PDF documents and HTML tables into pandas DataFrames
  • Perform basic web scraping using powerful yet easy to use libraries such as Beautiful Soup
  • Extract structured and textual information from portals

In this chapter, you will be exposed to real-life data wrangling techniques, as applied to web scraping.

Introduction

So far in this book, we have focused on learning pandas DataFrame objects as the main data structure for the application of wrangling techniques. Now, we will learn about various techniques by which we can read data into a DataFrame ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Data Wrangling with Python

Data Wrangling with Python

Jacqueline Kazil, Katharine Jarmul
Python: End-to-end Data Analysis

Python: End-to-end Data Analysis

Phuong Vothihong, Martin Czygan, Ivan Idris, Magnus Vilhelm Persson, Luiz Felipe Martins

Publisher Resources

ISBN: 9781789800111Supplemental Content