Skip to Main Content
20200406PAIML-Raw-Unedited
book

20200406PAIML-Raw-Unedited

by Noah Gift, Alfredo Deza
December 2020
Intermediate to advanced content levelIntermediate to advanced
196 pages
3h 22m
English
Pragmatic AI Labs
Content preview from 20200406PAIML-Raw-Unedited

Chapter7: Big Data Platforms

Cloud Object Storage

At the heart of a Big Data is a Cloud Object storage system.

Amazon S3

The best known cloud object storage system is Amazon S3.

[

![](https://img.youtube.com/vi/1gauWMpmf_E/0.jpg)

](https://youtu.be/1gauWMpmf_E

The Three “Vs” of Big Data: Variety, Velocity and Volume

There are many ways to define Big Data. One way of describing Big Data is it is data that it too large to process on your laptop. Another method is to the Three “Vs” of Big Data.

Big Data Challenges

Variety

Many types of data.

  • Unstructured text
  • CSV files
  • binary files
  • big data files: Apache Parquet
  • Database files

Velocity

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Reinventing the Organization for GenAI and LLMs

Reinventing the Organization for GenAI and LLMs

Ethan Mollick
Building Sensor Networks

Building Sensor Networks

Ioanis Nikolaidis, Krzysztof Iniewski

Publisher Resources

ISBN: 20200406PAIMLOtherPublisher Website