book

Mastering Data Warehouse Aggregates: Solutions for Star Schema Performance

Name: Mastering Data Warehouse Aggregates: Solutions for Star Schema Performance
Author: Christopher Adamson
ISBN: 9780471777090

by Christopher Adamson

July 2006

Intermediate to advanced

378 pages

9h 38m

English

Wiley

Read now

Unlock full access

Content preview from Mastering Data Warehouse Aggregates: Solutions for Star Schema Performance

5.3. Loading the Aggregate Schema

The process of loading the aggregate schema is very similar to the process of loading the base schema. This should come as no surprise; an aggregate star schema is a star schema itself, albeit with a different grain. And pre-joined aggregates share similar properties.

As it turns out, much of the complexity involved in processing the base schema is eliminated during aggregate processing. Examples include multiple-source loads, changed data identification, and the transformation of data for processing one row at a time. Chapter 6 shows how these complexities are eliminated by sourcing the aggregate schema from the base schema.

But before examining the specific processes that load aggregate tables, it is important to consider how aggregate processing fits into the overall load process.

Aggregate loads usually follow the same approach as base schema loads, where a separate program, or process, is developed for each table. The presence of aggregates requires that the load process for the base schema manage the availability of aggregates, taking them off-line during processes that update base tables. This also affects the frequency of aggregate loads. The use of RDBMS features such as materialized views or materialized query tables eliminates the need to design a load process, but availability and load frequency must still be attended to.

Additionally, a choice must be made on the approach to aggregate loads. They may be rebuilt entirely with each ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Usage-Driven Database Design: From Logical Data Modeling through Physical Schema Definition

Publisher Resources

ISBN: 9780471777090Purchase book

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills