On an Approximation Algorithm Combined with D3QN for HDFS Data Block Recovery in Heterogeneous Hadoop Clusters

Yijie Zhang, Chase Q. Wu, Aiqin Hou

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Hadoop stands as a cornerstone in the realm of big data processing, with its Hadoop Distributed File System (HDFS) serving as a pivotal layer ensuring fault tolerance and high throughput data storage. Through mechanisms such as block replication and cluster-wide distribution, HDFS facilitates parallel computing in higher layers. However, the inherent heterogeneity within Hadoop clusters introduces complexities, particularly concerning the reliability of stored data. The failure of DataNodes within heterogeneous clusters poses a significant risk, potentially leading to data loss and compromising data reliability. Notably, the default block recovery strategy within HDFS overlooks the varying capacities of data nodes and the diverse patterns of data access, rendering it inadequate for heterogeneous environments. To address this gap, we first propose a novel approach for block recovery selection based on dueling double deep Q-networks, augmented with Gaussian Process Regression. We further formulate block recovery placement as an optimization problem in heterogeneous clusters, show its NP-completeness, and design an approximation algorithm that leverages linear programming-based iterative rounding (LPIR-BR), which offers a robust performance guarantee. Extensive experimental results validates the efficacy of LPIR-BR, showcasing its superiority over existing algorithms and affirming the soundness of our theoretical framework.

Original languageEnglish (US)
Title of host publicationIntelligent Systems and Applications - Proceedings of the 2024 Intelligent Systems Conference IntelliSys Volume 1
EditorsKohei Arai
PublisherSpringer Science and Business Media Deutschland GmbH
Pages381-401
Number of pages21
ISBN (Print)9783031663284
DOIs
StatePublished - 2024
EventIntelligent Systems Conference, IntelliSys 2024 - Amsterdam, Netherlands
Duration: Sep 5 2024Sep 6 2024

Publication series

NameLecture Notes in Networks and Systems
Volume1065 LNNS
ISSN (Print)2367-3370
ISSN (Electronic)2367-3389

Conference

ConferenceIntelligent Systems Conference, IntelliSys 2024
Country/TerritoryNetherlands
CityAmsterdam
Period9/5/249/6/24

All Science Journal Classification (ASJC) codes

  • Control and Systems Engineering
  • Signal Processing
  • Computer Networks and Communications

Keywords

  • Approximation algorithm
  • Big data
  • Block distribution
  • Dueling double deep Q-network
  • Hadoop distributed file system
  • Performance bound
  • Reliability

Fingerprint

Dive into the research topics of 'On an Approximation Algorithm Combined with D3QN for HDFS Data Block Recovery in Heterogeneous Hadoop Clusters'. Together they form a unique fingerprint.

Cite this