O'Reilly logo

Mastering Python for Data Science by Samir Madhavan

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Studying the Titanic

To perform the data analysis, we'll be using the Titanic dataset from Kaggle.

This dataset is simple to understand and does not require any domain understanding to derive insights.

This dataset contains the details of each passenger on the Titanic and also whether they survived or not.

The following are the field descriptions:

Field

Descriptions

survival

Survival(0 = No, 1 = Yes)

pclass

Passenger class(1 = 1st, 2 = 2nd, 3 = 3rd)

name

Name of the passenger

sex

Gender of the passenger

age

Age of the passenger

sibsp

Number of siblings/spouses aboard

parch

Number of parents/children aboard

ticket

Ticket number

fare

Passenger fare

cabin

Cabin

embarked

Port of embarkation

(C = Cherbourg, ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required