Skip to Content
MapReduce Design Patterns
book

MapReduce Design Patterns

by Donald Miner, Adam Shook
December 2012
Intermediate to advanced content levelIntermediate to advanced
247 pages
6h 48m
English
O'Reilly Media, Inc.
Content preview from MapReduce Design Patterns

Index

A note on the digital index

A link in an index entry is displayed as the section title in which that entry appears. Because some sections have multiple index markers, it is not unusual for an entry to have several links to the same section. Clicking on any link will take you directly to the place in the text in which the marker appears.

A

access dates, partitioning users by, Partitioning users by last access dateReducer code, Querying for user reputation by last access dateDriver code
anonymizing data, MotivationReducer code, Anonymous comments and distinct usersDriver code
antijoin operations, A Refresher on Joins
Apache Hadoop (see Hadoop)
audio, trends in nature of data, Images, Audio, and Video
averages, calculating, Average exampleData flow diagram

B

BigTable design (Google), Google BigTable
binning pattern
description, Pattern DescriptionPerformance analysis
examples, Binning by Hadoop-related tagsMapper code
Bloom filtering pattern
description, Pattern DescriptionPerformance analysis
examples, Hot listMapper Code
reduce side joins with, Reputable user and comment joinComment mapper code
Bloom filters
about, Overview
downsides, Downsides
tweaking, Tweaking Your Bloom Filter
use cases, Use CasesGoogle BigTable
Bloom, Burton Howard, Overview
BloomFilter class, Bloom filter training

C

Cartesian product pattern
description, Pattern DescriptionPerformance Analysis
examples, Comment ComparisonMapper code
Cartesian products, A Refresher on Joins
chain folding
about, Chain FoldingChain Folding ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Microservices Patterns

Microservices Patterns

Chris Richardson
Java Concurrency in Practice

Java Concurrency in Practice

Brian Goetz, Tim Peierls, Joshua Bloch, Joseph Bowbeer, David Holmes, Doug Lea
Machine Learning Design Patterns

Machine Learning Design Patterns

Valliappa Lakshmanan, Sara Robinson, Michael Munn

Publisher Resources

ISBN: 9781449341954Errata PageSupplemental Content