Skip to Main Content
Getting Started with Kudu
book

Getting Started with Kudu

by Jean-Marc Spaggiari, Mladen Kovacevic, Brock Noland, Ryan Bosshart
July 2018
Beginner to intermediate content levelBeginner to intermediate
156 pages
4h 2m
English
O'Reilly Media, Inc.
Content preview from Getting Started with Kudu

Chapter 6. Table and Schema Design

In this chapter, we cover schema design in Kudu with the goal of explaining the basic concepts and primitives to make your project successful. An ideal schema would result in read and write operations spreading evenly across the cluster and also result in the minimum amount of data being processed during query evaluation. It’s our belief that by understanding the basics described in this chapter, you will be closer to building an ideal schema and thus be on the pathway to success.

The Kudu project itself has fantastic schema design documentation, so even though there is some overlap, we will also focus on topics of particular importance and provide additional background.

In any data storage system, schema design is extremely important and the cause of many headaches and showstoppers. Poor schema design in relational databases can cause issues ranging from intensive resource consumption to data corruption. HBase and Cassandra require extensive knowledge of how the data will be accessed prior to designing a schema, and a deficiency here is the most common cause of project blockers due to slow query performance—almost always due to intensive resource consumption. In Kudu, schema design is as important, but Kudu provides some features these other systems don’t provide to make a larger range of use cases possible.

Schema Design Basics

This section provides basics of Kudu schema design for readers who have not read the official schema design documentation: ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Building a Near Real-Time Analytical Application with Kudu

Building a Near Real-Time Analytical Application with Kudu

Ryan Bosshart

Publisher Resources

ISBN: 9781491980248Errata Page