Abstract
We describe the use of F5's BIG-IP smart switch technology (3600 Series and Local Traffic Manager v9.0) to provide load balancing and automatic fail-over to multiple Grid services (GUMS, VOMS) and their associated back-end MySQL databases. This resiliency is introduced in front of the external application servers and also for the back-end database systems, which is what makes it "multi-tier". The combination of solutions chosen to ensure high availability of the services, in particular the database replication and fail-over mechanism, are discussed in detail. The paper explains the design and configuration of the overall system, including virtual servers, machine pools, and health monitors (which govern routing), as well as the master-slave database scheme and fail-over policies and procedures. Pre-deployment planning and stress testing will be outlined. Integration of the systems with our Nagios-based facility monitoring and alerting is also described. And application characteristics of GUMS and VOMS which enable effective clustering will be explained. We then summarize our practical experiences and real-world scenarios resulting from operating a major US Grid center, and assess the applicability of our approach to other Grid services in the future.
Original language | English (US) |
---|---|
Article number | 42044 |
Journal | Journal of Physics: Conference Series |
Volume | 331 |
Issue number | PART 4 |
DOIs | |
State | Published - 2011 |
Event | International Conference on Computing in High Energy and Nuclear Physics, CHEP 2010 - Taipei, Taiwan, Province of China Duration: Oct 18 2010 → Oct 22 2010 |
All Science Journal Classification (ASJC) codes
- General Physics and Astronomy