Chapter 2Data Collection and Preprocessing
WHAT'S IN THIS CHAPTER
- Sources to obtain training data
- Techniques to explore data
- Techniques to impute missing values
- Feature engineering techniques
In the previous chapter, you were given a general overview of machine learning, and learned about the different types of machine learning systems. In this chapter you will learn to use NumPy, Pandas, and Scikit-learn to perform common feature engineering tasks.
- NOTE To follow along with this chapter ensure you have installed Anaconda Navigator and Jupyter Notebook as described in Appendix A.
You can download the code files for this chapter from
www.wiley.com/go/machinelearningawscloud
or from GitHub using the following URL:
Get Machine Learning in the AWS Cloud now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.