book

Python: Real World Machine Learning

by Prateek Joshi, John Hearty, Bastiaan Sjardin, Luca Massaron, Alberto Boschetti

November 2016

Beginner to intermediate

941 pages

21h 55m

English

Packt Publishing

Read now

Unlock full access

Content preview from Python: Real World Machine Learning

Tackling class imbalance

Until now, we dealt with problems where we had a similar number of datapoints in all our classes. In the real world, we might not be able to get data in such an orderly fashion. Sometimes, the number of datapoints in one class is a lot more than the number of datapoints in other classes. If this happens, then the classifier tends to get biased. The boundary won't reflect of the true nature of your data just because there is a big difference in the number of datapoints between the two classes. Therefore, it becomes important to account for this discrepancy and neutralize it so that our classifier remains impartial.

How to do it…

Let's load the data:

input_file = 'data_multivar_imbalance.txt' X, y = utilities.load_data(input_file) ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Start your free trial

Interpretable Machine Learning with Python

Publisher Resources

ISBN: 9781787123212Supplemental Content Purchase Link

Python: Real World Machine Learning

by Prateek Joshi, John Hearty, Bastiaan Sjardin, Luca Massaron, Alberto Boschetti

Tackling class imbalance

How to do it…

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

You might also like

Interpretable Machine Learning with Python

Large Scale Machine Learning with Python

Hands-On Machine Learning with scikit-learn and Scientific Python Toolkits

Python Machine Learning Cookbook - Second Edition

Publisher Resources