O'Reilly logo

Illuminating Statistical Analysis Using Scenarios and Simulations by Jeffrey E. Kottemann

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

GData Mining

Data mining aims to discover and make use of patterns and relationships that are manifest in data. There are many different methods that can be used for data mining, and linear regression is one. In a data mining context, we could use linear regression to build a model to predict numbers of interest. Then, once the model is built using a set of cases, we would assess how well the model can predict a new set of cases. Further, we'll also want to compare the model's performance to that of models constructed using other predictive analytics methods, such as neural networks and decision trees. Let's go through this process a step at a time.

We will no longer dwell on img-values, but instead we'll focus on measures of fit such as Adjusted img-squared. Further, we will not dwell so much on the adjusted img-squared we get when we build a model, but instead we'll focus more on the adjusted img-squared we get when we use the model to predict new cases.

Below is the regression equation we developed in Chapter 43. I will call it “The Model.”

The above adjusted -squared value of 0.67 tells ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required