Skip to Content
Practical Automated Machine Learning on Azure
book

Practical Automated Machine Learning on Azure

by Deepak Mukunthu, Parashar Shah, Wee Hyong Tok
September 2019
Intermediate to advanced
196 pages
4h 11m
English
O'Reilly Media, Inc.
Content preview from Practical Automated Machine Learning on Azure

Chapter 4. Feature Engineering and Automated Machine Learning

Feature engineering is one of the most important parts of the data science process. If you ask data scientists to break down the time spent in each stage of the data science process, you’ll often hear that they spend a significant amount of time understanding and exploring the data, and doing feature engineering. Most experienced data scientists do not jump into model building. Rather, they first spend time doing feature engineering.

But what is feature engineering? With feature engineering, you can transform your original data into a form that is more easily understood by the machine learning algorithms. For example, you might perform data processing, add new features (e.g., additional data columns that combine values from existing columns), or you might transform the features from their original domain to a different domain. You might also remove features that are not useful or relevant to the model. When doing feature engineering, you will generate new features, transform existing features, or select a subset of features.

To illustrate how you can transform features, let’s consider a simple example of working with categorical features (otherwise known as categorical variables). Suppose that you have a dataset for an airline customer program with a feature called Status, which determines the status of the customers (e.g., based on how often the customer flies, total miles traveled, and others). Status contains the ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Automated Machine Learning with Microsoft Azure

Automated Machine Learning with Microsoft Azure

Dennis Sawyers
Mastering Azure Machine Learning

Mastering Azure Machine Learning

Christoph Körner, Kaijisse Waaijer
Mastering Azure Machine Learning - Second Edition

Mastering Azure Machine Learning - Second Edition

Christoph Körner, Marcel Alsdorf
Azure AI Services at Scale for Cloud, Mobile, and Edge

Azure AI Services at Scale for Cloud, Mobile, and Edge

Simon Bisson, Mary Branscombe, Chris Hoder, Anand Raman

Publisher Resources

ISBN: 9781492055587Errata Page