Studying the Titanic

To perform the data analysis, we'll be using the Titanic dataset from Kaggle.

This dataset is simple to understand and does not require any domain understanding to derive insights.

This dataset contains the details of each passenger on the Titanic and also whether they survived or not.

The following are the field descriptions:

Field

Descriptions

survival

Survival(0 = No, 1 = Yes)

pclass

Passenger class(1 = 1st, 2 = 2nd, 3 = 3rd)

name

Name of the passenger

sex

Gender of the passenger

age

Age of the passenger

sibsp

Number of siblings/spouses aboard

parch

Number of parents/children aboard

ticket

Ticket number

fare

Passenger fare

cabin

Cabin

embarked

Port of embarkation

(C = Cherbourg, ...

Get Mastering Python for Data Science now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.