Skip to main navigation Skip to search Skip to main content

Key based reducer placement for data analytics across data centers considering Bi-level resource provision in cloud computing

  • School of Computer Science and Technology, Harbin Institute of Technology
  • Public Service Platform of Mobile Internet Application Security Industry
  • Shenzhen Key Laboratory of Internet Information Collaboration
  • Shenzhen Applied Technology Engineering Laboratory for Internet Multimedia Application

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Due to the distribution characteristic of the data source, such as astronomy and sales, or the legal prohibition, it is not always practical to store the world-wide data in only one data center (DC). Hadoop is a commonly accepted framework for big data analytics. But it can only deal with data within one DC. The distribution of data necessitates the study of Hadoop across DCs. In this situation, though we can place mapper in the local DCs, where to place reducers is a great challenge, since each reducer almost needs to process all map output across all involved DCs. Aiming to reduce costs, a key based scheme is proposed which can respect the locality principle of traditional Hadoop as much as possible while realizing deployment of reducers with lower cost. Considering both data center level and server level resource provision, a bi-level programming is used to formalize the problem and it is solved by a tailored two level group genetic algorithm (TLGGA). Extensive simulations demonstrate the effectiveness of TLGGA. It can outperform both the baseline and the state-of-the-art mechanisms by 49% and 40%, respectively.

Original languageEnglish
Title of host publicationIoTBD 2016 - Proceedings of the International Conference on Internet of Things and Big Data
EditorsMuthu Ramachandran, Gary Wills, Robert Walters, Victor Mendez Munoz, Victor Chang
PublisherSciTePress
Pages243-254
Number of pages12
ISBN (Electronic)9789897581830
DOIs
StatePublished - 2016
Externally publishedYes
EventInternational Conference on Internet of Things and Big Data, IoTBD 2016 - Rome, Italy
Duration: 23 Apr 201625 Apr 2016

Publication series

NameIoTBD 2016 - Proceedings of the International Conference on Internet of Things and Big Data

Conference

ConferenceInternational Conference on Internet of Things and Big Data, IoTBD 2016
Country/TerritoryItaly
CityRome
Period23/04/1625/04/16

Keywords

  • Distributed cloud
  • Hadoop across data centers
  • Reducer placement
  • Resource provision

Fingerprint

Dive into the research topics of 'Key based reducer placement for data analytics across data centers considering Bi-level resource provision in cloud computing'. Together they form a unique fingerprint.

Cite this