© Karthik Ramasubramanian and Abhishek Singh 2019
Karthik Ramasubramanian and Abhishek SinghMachine Learning Using Rhttps://doi.org/10.1007/978-1-4842-4215-5_2

2. Data Preparation and Exploration

Karthik Ramasubramanian1  and Abhishek Singh1
(1)
New Delhi, Delhi, India
 

As we emphasized in our introductory chapter on applying machine learning (ML) algorithms with a simplified process flow, in this chapter, we go deeper into the first block of machine learning process flow—data exploration and preparation.

The subject of data exploration was very formally introduced by John W. Tukey almost four decades ago with his book entitled Exploratory Data Analysis (EDA). The methods discussed in the book were profound and there aren’t many software programs that ...

Get Machine Learning Using R: With Time Series and Industry-Based Use Cases in R now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.