Basic naive Bayes classifier baseline

As per the rules of the challenge, the participants had to outperform the basic naive Bayes classifier to qualify for prizes, which makes an assumption that features are independent (refer to Chapter 1, Applied Machine Learning Quick Start).

The KDD Cup organizers run the vanilla naive Bayes classifier, without any feature selection or hyperparameter adjustments. For the large dataset, the overall scores of the naive Bayes on the test set were as follows:

Churn problem: AUC = 0.6468
Appetency problem: AUC = 0.6453
Upselling problem: AUC=0.7211

Note that the baseline results are reported for large dataset only. Moreover, while both training and test datasets are provided at the KDD Cup site, the actual true labels ...

Get Deep Learning: Practical Neural Networks with Java now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Deep Learning: Practical Neural Networks with Java by Yusuke Sugomori, Boštjan Kaluža, Fábio M. Soares, Alan M. F. Souza

Basic naive Bayes classifier baseline

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly