3

Starting Our Travel – Surviving the Titanic Disaster

In this chapter, we will start our journey around the data world. The first dataset we will analyze is from the competition Titanic - Machine Learning from Disaster (refer Reference 1 at the end of this chapter for a link to this dataset). It is a rather small dataset and, because it is related to a competition, it is split between train and test sets.

In this chapter, besides the competition approach, we will introduce our systematic approach to exploratory data analysis and apply it to get familiar with the data, understand it in more detail, and extract useful insights. We will also provide a short introduction to the process of using the results of data analysis to build model training ...

Get Developing Kaggle Notebooks now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.