Skip to main navigation Skip to search Skip to main content

Towards a cost-efficient mapreduce: Mitigating power peaks for hadoop clusters

  • Nan Zhu
  • , Xue Liu*
  • , Jie Liu
  • , Yu Hua
  • *Corresponding author for this work
  • McGill University
  • Microsoft USA
  • Huazhong University of Science and Technology

Research output: Contribution to journalArticlepeer-review

Abstract

Distributed data processing system is becoming one of the most important components for data-intensive computational tasks in the enterprise software infrastructure. Deploying and operating such systems require large amount of costs, including hardware costs to build clusters and energy costs to run clusters. To make these systems sustainable and scalable, power management has been an important research problem. In this paper, we take Hadoop as an example to illustrate the power peak problem which causes power inefficiency and provides in-depth analysis to explain issues with existing system designs. We propose a novel power capping module in the Hadoop scheduler to mitigate power peaks. Extensive simulation studies show that our proposed solution can effectively smooth the power consumption curve and mitigate temporary power peaks for Hadoop clusters.

Original languageEnglish
Pages (from-to)24-32
Number of pages9
JournalTsinghua Science and Technology
Volume19
Issue number1
DOIs
StatePublished - Feb 2014
Externally publishedYes

Keywords

  • MapReduce
  • Power management
  • Power peaks

Fingerprint

Dive into the research topics of 'Towards a cost-efficient mapreduce: Mitigating power peaks for hadoop clusters'. Together they form a unique fingerprint.

Cite this