Chapter 8

Big Data

Abstract

“Big Data” describes much more than the amounts of data involved. The lack of structure and the way this data are used require different approaches to building a storage. While still in the early days, this will rapidly become an issue for most enterprises and medium-sized businesses.

Keywords

Analytics; Apache; Big Data; Cassandra; Ceph; GlusterFS; GoogleFS; Hadoop; HBase; HDFS; In-memory database; Lustre; NVMe; OpenStack Swift; PVFS; Quantcast; Spark; Stream processing
The advent of cheap storage and analytics hardware and software is making it possible to gather and utilize much more of the data that we threw away in the past. This is often called “Big Data [1]”, but Big Data is really a misnomer. It should be “Lots of ...

Get Network Storage now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.