Main Page

From MapReduce Design Patterns
Jump to: navigation, search

Design Patterns

Summarization Patterns
Numerical Summarizations
Inverted Index
Counting with Counters
Filtering Patterns
Filtering
Bloom Filtering
Top Ten
Distinct
Data Organization Patterns
Structured to Hierarchical
Partitioning
Binning
Total Order Sorting
Shuffling
Join Patterns
Reduce Side Join
Reduce Side Join with Bloom Filter
Replicated Join
Composite Join
Cartesian Product
Metapatterns
Job Chaining
Chain Folding
Job Merging
Input and Output Patterns
Generating Data
External Source Output
External Source Input
Partition Pruning


About This Project

MapReducePatterns.com is an open and collaborative documentation project in which the Hadoop community can rally around a well-defined set of standard MapReduce design patterns.

Check the To Do List to see how you may be able to contribute to this project or see what we are currently working on.


Pattern levels

List of official patterns.

List of candidate patterns.

List of patterns in development.


Useful and related links

The MapReduce Design Patterns book

Repository for all "MapReduce Design Patterns" book code examples

https://github.com/adamjshook/mapreducepatterns

Blog post on why the time is right for MapReduce design patterns.

A presentation on MapReduce design patterns given at Strata.

Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox