Skip to Content
Practical Applications of Data Mining
book

Practical Applications of Data Mining

by Sang C. Suh
January 2011
Intermediate to advanced
420 pages
12h 32m
English
Jones & Bartlett Learning
Content preview from Practical Applications of Data Mining
3.4 divide-and-Conquer approaCh 105
Suppose we need to determine if further testing on attribute X is needed.
Then, the chi-square value is calculated using the value of the attribute in the
formula above. If the value is lower than a pre-assigned threshold, say 95%,
we cannot reject that the test on attribute X is irrelevant to the classification.
Further dividing of the datasets is not needed. If the value is higher than the
threshold, the test on the attribute is necessary. If no attribute is found to be
relevant, then the tree should stop growing at the point of the subtree. The
decision tree built this way will avoid over-fitting.
A Problem ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Data Mining

Data Mining

Nong Ye
Data Mining and Machine Learning Applications

Data Mining and Machine Learning Applications

Rohit Raja, Kapil Kumar Nagwanshi, Sandeep Kumar, K. Ramya Laxmi
R Data Mining

R Data Mining

Enrico Pegoraro, Andrea Cirillo

Publisher Resources

ISBN: 9780763785871