Skip to Content
Hands-On Automated Machine Learning
book

Hands-On Automated Machine Learning

by Sibanjan Das, Umit Mert Cakmak
April 2018
Beginner to intermediate content levelBeginner to intermediate
282 pages
6h 52m
English
Packt Publishing
Content preview from Hands-On Automated Machine Learning

Data understanding

This is the phase where you develop an understanding of the data sources that you will use throughout the project.

In this phase, you are typically trying to address the following items:

  • Clearing data access and authorization issues.
  • Loading data into a platform of preference for initial analysis.
  • Being aware of sensitive information and performing necessary operations, such as anonymizing or deletion of sensitive data.
  • Identifying datasets to be used.
  • Identifying data schema and getting field descriptions.
  • Determining the quantity for each dataset and identifying discrepancies. For example, you check if the variables that are present in different tables have the same datatype, for example, a variable could be an integer ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Automated Machine Learning

Automated Machine Learning

Adnan Masood
R: Unleash Machine Learning Techniques

R: Unleash Machine Learning Techniques

Raghav Bali, Dipanjan Sarkar, Brett Lantz, Cory Lesmeister

Publisher Resources

ISBN: 9781788629898Supplemental Content