book

Mastering Data Warehouse Aggregates: Solutions for Star Schema Performance

Name: Mastering Data Warehouse Aggregates: Solutions for Star Schema Performance
Author: Christopher Adamson
ISBN: 9780471777090

by Christopher Adamson

July 2006

Intermediate to advanced

378 pages

9h 38m

English

Wiley

Read now

Unlock full access

Content preview from Mastering Data Warehouse Aggregates: Solutions for Star Schema Performance

1.4. Summary

This chapter has laid the foundation for the chapters to come, reviewing the basics of star schema design, introducing the aggregate table and aggregate navigator, defining some standard vocabulary, and establishing some guiding principles for invisible aggregates.

While operational systems focus on process execution, data warehouse systems focus on process evaluation. These contrasting purposes lead to distinct operational profiles, which in turn suggest different principles to guide schema design.
The principles of dimensional modeling govern the development of warehouse systems. Process evaluation is enabled by identifying the facts that measure a business process and the dimensions that give them context. These attributes are grouped into tables that form a star schema design.
Dimension tables contain sets of dimensional attributes. They drive access to the facts, constrain queries, and serve as row headers on reports. The use of a surrogate key permits the dimension table to track history, regardless of how changes are handled in operational systems.
Facts are placed in fact tables, along with foreign key references to the appropriate dimension tables. The grain of a fact table identifies the level of detail represented by each row. It is set at the lowest level possible, as determined by available data.
Although the specific questions asked by end users are unpredictable and change over time, queries follow a standard pattern. Questions that cross subject areas ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Usage-Driven Database Design: From Logical Data Modeling through Physical Schema Definition

Publisher Resources

ISBN: 9780471777090Purchase book

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills