Chapter7: Big Data Platforms

Cloud Object Storage

At the heart of a Big Data is a Cloud Object storage system.

Amazon S3

The best known cloud object storage system is Amazon S3.

[

![](https://img.youtube.com/vi/1gauWMpmf_E/0.jpg)

](https://youtu.be/1gauWMpmf_E

The Three “Vs” of Big Data: Variety, Velocity and Volume

There are many ways to define Big Data. One way of describing Big Data is it is data that it too large to process on your laptop. Another method is to the Three “Vs” of Big Data.

Big Data Challenges

Variety

Many types of data.

  • Unstructured text
  • CSV files
  • binary files
  • big data files: Apache Parquet
  • Database files

Velocity

Get 20200406PAIML-Raw-Unedited now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.