© Karthik Ramasubramanian and Abhishek Singh 2019
Karthik Ramasubramanian and Abhishek SinghMachine Learning Using Rhttps://doi.org/10.1007/978-1-4842-4215-5_2

2. Data Preparation and Exploration

Karthik Ramasubramanian1  and Abhishek Singh1
(1)
New Delhi, Delhi, India
 

As we emphasized in our introductory chapter on applying machine learning (ML) algorithms with a simplified process flow, in this chapter, we go deeper into the first block of machine learning process flow—data exploration and preparation.

The subject of data exploration was very formally introduced by John W. Tukey almost four decades ago with his book entitled Exploratory Data Analysis (EDA). The methods discussed in the book were profound and there aren’t many software programs that ...

Get Machine Learning Using R: With Time Series and Industry-Based Use Cases in R now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.