April 2017
Beginner to intermediate
432 pages
10h 53m
English
Chapter 13
IN THIS CHAPTER
Understanding why simply bigger, larger, and faster isn’t always the right solution
Looking inside the storage and computational approaches of Internet companies
Figuring out how using clusters of commodity hardware reduces costs
Reducing complex algorithms into separable parallel operations by MapReduce
Managing immense amounts of data using streaming or sampling strategies has clear advantages (as discussed in Chapter 12) when you have to deal with massive data processing. Using streaming and sampling algorithms helps you obtain a result even when your computational power is limited (for instance, when using your own computer). However, some costs are associated with these approaches:
Read now
Unlock full access