Disaster recovery and data centre operational continuity

  • J. Caballero Bejar
  • , C. Caramarcu
  • , J. De Stefano
  • , M. Ernst
  • , J. Fetzko
  • , C. Gamboa
  • , C. Hollowell
  • , J. Hover
  • , S. Kandasamy
  • , M. Karasawa
  • , Z. Liu
  • , S. Misawa
  • , W. Strecker-Kellogg
  • , O. Rind
  • , J. Smith
  • , T. Wlodek
  • , A. Wong
  • , D. Yu
  • , A. Zaytsev
  • , X. Zhao

Research output: Contribution to journalConference articlepeer-review

1 Scopus citations

Abstract

The RHIC and ATLAS Computing Facility (RACF) at Brookhaven Lab is a dedicated data center serving the needs of the RHIC and US ATLAS community. Since it began operations in the mid-1990's, it has operated continuously with few unplanned downtimes. In the past 15 months, Brookhaven Lab has been affected by two hurricanes and a record-breaking snowstorm. In this presentation, we discuss lessons learned regarding (natural or man-made) disaster preparedness, operational continuity, remote access and safety protocols, including overall operational procedures developed as a result of these recent events.

Original languageEnglish (US)
Article number062052
JournalJournal of Physics: Conference Series
Volume513
Issue numberTRACK 6
DOIs
StatePublished - 2014
Event20th International Conference on Computing in High Energy and Nuclear Physics, CHEP 2013 - Amsterdam, Netherlands
Duration: Oct 14 2013Oct 18 2013

All Science Journal Classification (ASJC) codes

  • General Physics and Astronomy

Fingerprint

Dive into the research topics of 'Disaster recovery and data centre operational continuity'. Together they form a unique fingerprint.

Cite this