Harnessing Data Movement in Virtual Clusters for In-Situ Execution

Dan Huang, Qing Liu, Scott Klasky, Jun Wang, Jong Youl Choi, Jeremy Logan, Norbert Podhorszki

Research output: Contribution to journalArticlepeer-review

10 Scopus citations

Abstract

As a result of increasing data volume and velocity, Big Data science at exascale has shifted towards the in-situ paradigm, where large scale simulations run concurrently alongside data analytics. With in-situ, data generated from simulations can be processed while still in memory, thereby avoiding the slow storage bottleneck. However, running simulations and analytics together on shared resources will likely result in substantial contention if left unmanaged, as demonstrated in this work, leading to much reduced efficiency of simulations and analytics. Recently, virtualization technologies such as Linux containers have been widely applied to data centers and physical clusters to provide highly efficient and elastic resource provisioning for consolidated workloads including scientific simulations and data analytics. In this paper, we investigate to facilitate network traffic manipulation and reduce mutual interference on the network for in-situ applications in virtual clusters. In order to dynamically allocate the network bandwidth when it is needed, we adopt SARIMA-based techniques to analyze and predict MPI traffic issued from simulations. Although this can be an effective technique, the naïve usage of network virtualization can lead to performance degradation for bursty asynchronous transmissions within an MPI job. We analyze and resolve this performance degradation in virtual clusters.

Original languageEnglish (US)
Article number8451897
Pages (from-to)615-629
Number of pages15
JournalIEEE Transactions on Parallel and Distributed Systems
Volume30
Issue number3
DOIs
StatePublished - Mar 1 2019

All Science Journal Classification (ASJC) codes

  • Signal Processing
  • Hardware and Architecture
  • Computational Theory and Mathematics

Keywords

  • ARIMA
  • In-situ applications
  • MPI
  • collective communication
  • virtual network
  • virtual switch

Fingerprint

Dive into the research topics of 'Harnessing Data Movement in Virtual Clusters for In-Situ Execution'. Together they form a unique fingerprint.

Cite this