Skip to Content
Mastering Data Warehouse Aggregates: Solutions for Star Schema Performance
book

Mastering Data Warehouse Aggregates: Solutions for Star Schema Performance

by Christopher Adamson
July 2006
Intermediate to advanced
378 pages
9h 38m
English
Wiley
Content preview from Mastering Data Warehouse Aggregates: Solutions for Star Schema Performance

8.4. Summary

Advanced dimensional design techniques affect the design of dimensional aggregates in a variety of ways:

  • The semi-additive facts of periodic snapshots must not be summed across time periods. Invisible aggregates can summarize these facts across other dimensions. Derived schemas may average the semi-additive fact across periods, but cannot be further summarized.

  • An accumulating snapshot is generally a poor candidate for aggregation; omission of numerous milestones may be required to produce any savings in size, and the resulting aggregate can include lag times only as averages.

  • A factless fact table that models a transaction or event can be aggregated, producing a transaction count. Symmetry with the base schema can be restored by including this fact in the base schema with a constant value of 1.

  • Coverage tables are usually poor candidates for aggregation because summarization destroys the business definition of the relationship the table represents.

  • Aggregates of transaction dimensions usually omit timestamps. Their presence in an aggregate is likely to limit the number of rows summarized, and updates to the expiration timestamp make incremental loads complex.

  • Bridge tables are often the source of derived schemas that eliminate repeating values; these schemas in turn are summarized by invisible aggregates. If the bridge does not include an allocation factor, it may be possible to summarize a bridge table itself, or limit aggregation to other dimensions in the schema.

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Usage-Driven Database Design: From Logical Data Modeling through Physical Schema Definition

Usage-Driven Database Design: From Logical Data Modeling through Physical Schema Definition

George Tillmann

Publisher Resources

ISBN: 9780471777090Purchase book