Skip to Content
Microsoft SQL Server 2012 Bible
book

Microsoft SQL Server 2012 Bible

by Adam Jorgensen, Jorge Segarra, Patrick LeBlanc, Jose Chinchilla, Aaron Nelson
August 2012
Intermediate to advanced
1416 pages
33h 39m
English
Wiley
Content preview from Microsoft SQL Server 2012 Bible

Algorithms

When working with data mining, it is useful to understand mining algorithm basics and when to apply each algorithm. Table 57.2 summarizes common algorithms used for the problem categories presented in this chapter's introduction.

Table 57.2 Common Mining Algorithm Usage

Problem Type Primary Algorithms
Segmentation Clustering, Sequence Clustering
Classification Decision Trees, Naive Bayes, Neural Network, Logistic Regression
Association Association Rules, Decision Trees
Estimation Decision Trees, Linear Regression, Logistic Regression, Neural Network
Forecasting Time Series
Sequence Analysis Sequence Clustering

These are guidelines only because not every data mining problem falls into these categories. In addition, there may be other algorithms that you can apply to the listed problem types.

Decision Trees

The decision trees algorithm is the most accurate for many problems. It operates by building a decision tree beginning with the All node, corresponding to all the training cases, as shown in Figure 57.3. Then an attribute is chosen to split those cases into groups, which then separate based on another attribute, and so on. The goal is to generate leaf nodes with a single predictable outcome. For example, if the goal is to identify who will purchase a bike, then leaf nodes should contain cases that are either bike buyers or not bike buyers, but no combinations (or as close to that goal as possible).

Figure 57.3 This is a great example of the decision tree ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Microsoft SQL Server 2012 Internals

Microsoft SQL Server 2012 Internals

Bob Beauchemin Kalen Delaney Conor Cunningham, Jonathan Kehayias, Benjamin Nevarez, and Paul S. Randal
SQL Server 2012 T-SQL Recipes: A Problem-Solution Approach

SQL Server 2012 T-SQL Recipes: A Problem-Solution Approach

Jason Brimhall, David Dye, Jonathan Gennick, Andy Roberts, Wayne Sheffield

Publisher Resources

ISBN: 9781118282175Purchase book