Skip to Content
Practical Applications of Data Mining
book

Practical Applications of Data Mining

by Sang C. Suh
January 2011
Intermediate to advanced
420 pages
12h 32m
English
Jones & Bartlett Learning
Content preview from Practical Applications of Data Mining
162 Chapter 4 StatiStiCS for Data Mining
(Query 11)
Find attribute_1 times attribute_2 divided by attribute_3 from the table.
(SQL 11)
Select attribute_1 × attribute_2 / attribute_3 from the table.
These queries compute the value of the expected frequency based on
three groups of data: attribute_1, attribute_2, and attribute_3, where their size
must be the same. Finally, we define the c
2
-based query as follows, which
involves two subqueries, one for the observed variable and the other for the
expected variable:
(Query 12)
Are attribute_1 and attribute_2 independent in the table?
(SQL 12)
Select chi-square() from the table where variable is observed and variable is
expected.
From the previous example dealing with price and feature attributes, w
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Data Mining

Data Mining

Nong Ye
Data Mining and Machine Learning Applications

Data Mining and Machine Learning Applications

Rohit Raja, Kapil Kumar Nagwanshi, Sandeep Kumar, K. Ramya Laxmi
R Data Mining

R Data Mining

Enrico Pegoraro, Andrea Cirillo

Publisher Resources

ISBN: 9780763785871