book

Machine Learning in Java - Second Edition

by AshishSingh Bhatia, Bostjan Kaluza

November 2018

Intermediate to advanced

300 pages

7h 42m

English

Packt Publishing

Read now

Unlock full access

Content preview from Machine Learning in Java - Second Edition

Data preprocessing

First, we will utilize Weka's built-in weka.filters.unsupervised.attribute.RemoveUseless filter, which works exactly as its name suggests. It removes the attributes that do not vary much, for instance, all constant attributes are removed. The maximum variance, which is only applied to nominal attributes, is specified with the -M parameter. The default parameter is 99%, which means that if more than 99% of all instances have unique attribute values, the attribute is removed, as follows:

RemoveUseless removeUseless = new RemoveUseless(); 
removeUseless.setOptions(new String[] { "-M", "99" });// threshold 
removeUseless.setInputFormat(data); 
data = Filter.useFilter(data, removeUseless);

Next, we will replace all of the missing ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Start your free trial

Mastering Java Machine Learning

Uday Kamath, Krishna Choppella

Java Deep Learning Cookbook

Rahul Raj

Mastering Java for Data Science

Alexey Grigorev

Java: Data Science Made Easy

Richard M. Reese, Jennifer L. Reese, Alexey Grigorev

Publisher Resources

ISBN: 9781788474399Supplemental Content