Disaster recovery and data centre operational continuity

J. Caballero Bejar, C. Caramarcu, J. De Stefano, M. Ernst, J. Fetzko, C. Gamboa, C. Hollowell, J. Hover, S. Kandasamy, M. Karasawa, Z. Liu, S. Misawa, W. Strecker-Kellogg, O. Rind, J. Smith, T. Wlodek, A. Wong, D. Yu, A. Zaytsev, X. Zhao

Research output: Contribution to journalConference articlepeer-review

Abstract

The RHIC and ATLAS Computing Facility (RACF) at Brookhaven Lab is a dedicated data center serving the needs of the RHIC and US ATLAS community. Since it began operations in the mid-1990's, it has operated continuously with few unplanned downtimes. In the past 15 months, Brookhaven Lab has been affected by two hurricanes and a record-breaking snowstorm. In this presentation, we discuss lessons learned regarding (natural or man-made) disaster preparedness, operational continuity, remote access and safety protocols, including overall operational procedures developed as a result of these recent events.

Original languageEnglish (US)
Article number062052
JournalJournal of Physics: Conference Series
Volume513
Issue numberTRACK 6
DOIs
StatePublished - 2014
Event20th International Conference on Computing in High Energy and Nuclear Physics, CHEP 2013 - Amsterdam, Netherlands
Duration: Oct 14 2013Oct 18 2013

All Science Journal Classification (ASJC) codes

  • Physics and Astronomy(all)

Fingerprint Dive into the research topics of 'Disaster recovery and data centre operational continuity'. Together they form a unique fingerprint.

Cite this