© Karthik Ramasubramanian and Abhishek Singh 2017

Karthik Ramasubramanian and Abhishek Singh, Machine Learning Using R, 10.1007/978-1-4842-2334-5_2

2. Data Preparation and Exploration

Karthik Ramasubramanian and Abhishek Singh1

(1)New Delhi, Delhi, India

As we emphasized in our introductory chapter on applying machine learning (ML) algorithms with a simplified process flow, in this chapter, we go deeper into the first block of machine learning process flow—data exploration and preparation.

The subject of data exploration was very formally introduced by John W. Tukey almost four decades ago with his book on Exploratory Data Analysis (EDA) . The methods discussed in the book were profound and there aren’t many software programs that include all of ...

Get Machine Learning Using R now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.