Graph explorer

Hadoop Performance Models

Hadoop MapReduce is now a popular choice for performing large-scale data analytics. This technical report describes a detailed set of mathematical performance models for describing the execution of a MapReduce job on Hadoop. The models describe dataflow and cost information at the fine granularity of phases within the map and reduce tasks of a job execution. The models can be used to estimate the performance of MapReduce jobs as well as to find the optimal configuration settings to use when running the jobs.

3 nodes2 linksoverview mapHadoop Performance Models
3 nodes2 links
Hadoop Performance Models3 visible / 3 total nodes / 2 links
AuthorshipTopic signalWHadoop Performance Modelspreprint / 2011AHerodotos HerodotouResearcherTDistributed, Parallel, ...4102 works
PaperSignal 102 links

Hadoop Performance Models

preprint / 2011

Open