Skip to Main Content
The Cloud Data Lake
book

The Cloud Data Lake

by Rukmani Gopalan
December 2022
Beginner to intermediate content levelBeginner to intermediate
244 pages
7h
English
O'Reilly Media, Inc.
Book available
Content preview from The Cloud Data Lake

Chapter 1. Big Data—Beyond the Buzz

Without big data, you are blind and deaf and in the middle of a freeway.

Geoffrey Moore

If we were playing workplace bingo, there is a big chance you would win by crossing off all these terms that you have heard in your organization in the past three months: digital transformation, data strategy, transformational insights, data lake, warehouse, data science, machine learning, and intelligence. It is now common knowledge that data is a key ingredient for organizations to succeed, and organizations that rely on data and AI clearly outperform their contenders. According to an IDC study sponsored by Seagate, the amount of data that is captured, collected, or replicated is expected to grow to 175 zettabytes (ZB) by the year 2025. This data that is captured, collected, or replicated is referred to as the Global DataSphere. This data comes from three classes of sources:

The core

Traditional or cloud-based datacenters

The edge

Hardened infrastructure, such as cell towers

The endpoints

PCs, tablets, smartphones, and Internet of Things (IoT) devices

This study also predicts that 49% of this Global DataSphere will be residing in public cloud environments by the year 2025.

If you have ever wondered, “Why does this data need to be stored? What is it good for?” the answer is very simple. Think of all of this data as pieces of words strewn around the globe in different languages, each sharing a sliver of information, like pieces of a puzzle. Stitching ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

The Enterprise Big Data Lake

The Enterprise Big Data Lake

Alex Gorelik
Designing Cloud Data Platforms

Designing Cloud Data Platforms

Lynda Partner, Danil Zburivsky

Publisher Resources

ISBN: 9781098116576Errata Page