Adaptive performance control of computing systems via distributed cooperative control: Application to power management in computing clusters

Mianyu Wang, Nagarajan Kandasamy, Allon Guez, Moshe Kam

Research output: Chapter in Book/Report/Conference proceedingConference contribution

24 Scopus citations

Abstract

Advanced control and optimization techniques offer a theoretically sound basis to enable self-managing behavior in distributed computing models such as utility computing. To tractably solve the performance management problems of interest, including resource allocation and provisioning in such distributed computing environments, we develop a fully decentralized control framework wherein the optimization problem for the system is first decomposed into sub-problems, and each sub-problem is solved separately by individual controllers to achieve the overall performance objectives. Concepts from optimal control theory are used to implement individual controllers. The proposed framework is highly scalable, naturally tolerates controller failures, and allows for the dynamic addition/removal of controllers during system operation. As a case study, we apply the control framework to minimize the power consumed by a computing cluster subject to a dynamic workload while satisfying the specified quality-of-service goals. Simulations using real-world workload traces show that the proposed technique has very low control overhead, and adapts quickly to both workload variations and controller failures.

Original languageEnglish (US)
Title of host publicationProceedings - 3rd International Conference on Autonomic Computing, ICAC 2006
Pages165-174
Number of pages10
StatePublished - 2006
Externally publishedYes
Event3rd International Conference on Autonomic Computing, ICAC 2006 - Dublin, Ireland
Duration: Jun 13 2006Jun 16 2006

Publication series

NameProceedings - 3rd International Conference on Autonomic Computing, ICAC 2006
Volume2006

Other

Other3rd International Conference on Autonomic Computing, ICAC 2006
Country/TerritoryIreland
CityDublin
Period6/13/066/16/06

All Science Journal Classification (ASJC) codes

  • Computer Science Applications
  • Software
  • General Mathematics

Fingerprint

Dive into the research topics of 'Adaptive performance control of computing systems via distributed cooperative control: Application to power management in computing clusters'. Together they form a unique fingerprint.

Cite this