12 Data distributions
This chapter covers
- Applying statistical principles of distributions in machine learning
- Understanding the differences between curated and uncurated datasets
- Using population, sampling, and subpopulation distributions
- Applying distribution concepts when training a model
As a data scientist and educator, I get a lot of questions from software engineers on how to improve the accuracy of a model. The five basic answers I give out to increase the accuracy of a model are as follows:
-
Increase training time.
-
Increase the depth (or width) of the model.
-
Add regularization.
-
Expand the dataset with data augmentation.
-
Increase hyperparameter tuning.
These are the five most likely places to address, and often working ...
Get Deep Learning Patterns and Practices now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.