Skip to Content
The Cloud Data Lake
book

The Cloud Data Lake

by Rukmani Gopalan
December 2022
Beginner to intermediate
244 pages
7h
English
O'Reilly Media, Inc.
Content preview from The Cloud Data Lake

Chapter 1. Big Data—Beyond the Buzz

Without big data, you are blind and deaf and in the middle of a freeway.

Geoffrey Moore

If we were playing workplace bingo, there is a big chance you would win by crossing off all these terms that you have heard in your organization in the past three months: digital transformation, data strategy, transformational insights, data lake, warehouse, data science, machine learning, and intelligence. It is now common knowledge that data is a key ingredient for organizations to succeed, and organizations that rely on data and AI clearly outperform their contenders. According to an IDC study sponsored by Seagate, the amount of data that is captured, collected, or replicated is expected to grow to 175 zettabytes (ZB) by the year 2025. This data that is captured, collected, or replicated is referred to as the Global DataSphere. This data comes from three classes of sources:

The core

Traditional or cloud-based datacenters

The edge

Hardened infrastructure, such as cell towers

The endpoints

PCs, tablets, smartphones, and Internet of Things (IoT) devices

This study also predicts that 49% of this Global DataSphere will be residing in public cloud environments by the year 2025.

If you have ever wondered, “Why does this data need to be stored? What is it good for?” the answer is very simple. Think of all of this data as pieces of words strewn around the globe in different languages, each sharing a sliver of information, like pieces of a puzzle. Stitching ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

The Enterprise Big Data Lake

The Enterprise Big Data Lake

Alex Gorelik
Designing Cloud Data Platforms

Designing Cloud Data Platforms

Lynda Partner, Danil Zburivsky

Publisher Resources

ISBN: 9781098116576Errata Page