Experience with multi-tier grid MySQL database service resiliency at BNL

Tomasz Wlodek, Michael Ernst, John Hover, Dimitrios Katramatos, Jay Packard, Yuri Smirnov, Dantong Yu

Research output: Contribution to journalConference articlepeer-review


We describe the use of F5's BIG-IP smart switch technology (3600 Series and Local Traffic Manager v9.0) to provide load balancing and automatic fail-over to multiple Grid services (GUMS, VOMS) and their associated back-end MySQL databases. This resiliency is introduced in front of the external application servers and also for the back-end database systems, which is what makes it "multi-tier". The combination of solutions chosen to ensure high availability of the services, in particular the database replication and fail-over mechanism, are discussed in detail. The paper explains the design and configuration of the overall system, including virtual servers, machine pools, and health monitors (which govern routing), as well as the master-slave database scheme and fail-over policies and procedures. Pre-deployment planning and stress testing will be outlined. Integration of the systems with our Nagios-based facility monitoring and alerting is also described. And application characteristics of GUMS and VOMS which enable effective clustering will be explained. We then summarize our practical experiences and real-world scenarios resulting from operating a major US Grid center, and assess the applicability of our approach to other Grid services in the future.

Original languageEnglish (US)
Article number42044
JournalJournal of Physics: Conference Series
Issue numberPART 4
StatePublished - 2011
EventInternational Conference on Computing in High Energy and Nuclear Physics, CHEP 2010 - Taipei, Taiwan, Province of China
Duration: Oct 18 2010Oct 22 2010

All Science Journal Classification (ASJC) codes

  • General Physics and Astronomy


Dive into the research topics of 'Experience with multi-tier grid MySQL database service resiliency at BNL'. Together they form a unique fingerprint.

Cite this