12 Data distributions

This chapter covers

Applying statistical principles of distributions in machine learning
Understanding the differences between curated and uncurated datasets
Using population, sampling, and subpopulation distributions
Applying distribution concepts when training a model

As a data scientist and educator, I get a lot of questions from software engineers on how to improve the accuracy of a model. The five basic answers I give out to increase the accuracy of a model are as follows:

Increase training time.
Increase the depth (or width) of the model.
Add regularization.
Expand the dataset with data augmentation.
Increase hyperparameter tuning.

These are the five most likely places to address, and often working ...

Get Deep Learning Patterns and Practices now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Deep Learning Patterns and Practices by Andrew Ferlitsch

12 Data distributions

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly