O'Reilly logo

Architecting Modern Data Platforms by Lars George, Paul Wilkinson, Ian Buss, Jan Kunigk

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Part I. Infrastructure

The defining characteristics of big data—volume, variety, and velocity—don’t just apply to the information stored within a modern data platform; they also apply to the knowledge required to build and use one effectively.

The topics touched upon are varied and deep, ranging from hardware selection and datacenter management through to statistics and machine learning. Even from just a platform architecture perspective, which is the scope of this book, the body of knowledge required is considerable. With such a wide selection of topics to cover, we have decided to present the material in parts.

In this first part, our intention is to equip the reader with foundational knowledge and understanding relating to infrastructure, both physical and organizational. Some chapters will be a deep dive into subjects such as compute and storage technologies, while others provide a high-level overview of subjects such as datacenter considerations and organizational challenges.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required