6

Data Imbalance in Deep Learning

Class imbalanced data is a common issue for deep learning models. When one or more classes have significantly fewer samples, the performance of deep learning models can suffer as they tend to prioritize learning from the majority class, resulting in poor generalization for the minority class(es).

A lot of real-world data is imbalanced, which presents challenges to deep learning classification tasks. Figure 6.1 shows some common categories of imbalanced data problems in various deep learning applications:

Figure 6.1 – Some common categories of imbalanced data problems

We will cover the following topics in ...

Get Machine Learning for Imbalanced Data now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.