Skip to Content
Microsoft SQL Server 2012 Bible
book

Microsoft SQL Server 2012 Bible

by Adam Jorgensen, Jorge Segarra, Patrick LeBlanc, Jose Chinchilla, Aaron Nelson
August 2012
Intermediate to advanced
1416 pages
33h 39m
English
Wiley
Content preview from Microsoft SQL Server 2012 Bible

The Data Mining Process

A traditional practice in data mining is to train a data mining model using existing data for which an outcome is already known and then use that model to predict the outcome of new data. This requires several steps, only some of which happen within Analysis Services:

  • Business and data understanding: Understand the important questions and the available data to answer those questions. Insights gained must be relevant to business goals to be of use. Data must be of acceptable quality and relevance to obtain reliable answers.
  • Prepare data: Preparing data can be a simple or difficult task depending on the current state of the data. Some of the tasks to consider include the following:
    • Eliminate rows of low data quality. The measure of quality is domain-specific. Eliminate values outside of expected norms, or failing any test that proves the row describes an impossible or highly improbable case.
    • Eliminate duplicates, invalid values, or inconsistent values.
    • Denormalize data by creating views to create a single “case” table.
    • Erratic time series data may benefit from smoothing to remove dramatic variations.
    • Derived attributes, such as profit, can be useful in the modeling process.
  • Model: You build Analysis Services models by first defining a data mining structure that specifies the tables to use as input. Then, add data mining models (different algorithms) to the structure. Use the training data to simultaneously train all the models within the structure.
  • Evaluate ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Microsoft SQL Server 2012 Internals

Microsoft SQL Server 2012 Internals

Bob Beauchemin Kalen Delaney Conor Cunningham, Jonathan Kehayias, Benjamin Nevarez, and Paul S. Randal
SQL Server 2012 T-SQL Recipes: A Problem-Solution Approach

SQL Server 2012 T-SQL Recipes: A Problem-Solution Approach

Jason Brimhall, David Dye, Jonathan Gennick, Andy Roberts, Wayne Sheffield

Publisher Resources

ISBN: 9781118282175Purchase book