Abstract
Massively parallel storage systems are becoming more and more prevalent on HPC systems due to the emergence of a new generation of data-intensive applications. To achieve the level of I/O throughput and capacity that is demanded by data intensive applications, storage systems typically deploy a large number of storage devices (also known as LUNs or data stores). In doing so, parallel applications are allowed to access storage concurrently, and as a result, the aggregate I/O throughput can be linearly increased with the number of storage devices, reducing the application’s end-to-end time. For a production system where storage devices are shared between multiple applications, contention is often a major problem leading to a significant reduction in I/O throughput. In this paper, we describe our efforts to resolve this issue in the context of HPC using a balanced re-routing + throttling approach. The proposed scheme re-routes I/O requests to a less congested storage location in a controlled manner so that write performance is improved while limiting the impact on read.
| Original language | English (US) |
|---|---|
| State | Published - 2013 |
| Externally published | Yes |
| Event | 5th USENIX Workshop on Hot Topics in Storage and File Systems, HotStorage 2013 - San Jose, United States Duration: Jun 27 2013 → Jun 28 2013 |
Conference
| Conference | 5th USENIX Workshop on Hot Topics in Storage and File Systems, HotStorage 2013 |
|---|---|
| Country/Territory | United States |
| City | San Jose |
| Period | 6/27/13 → 6/28/13 |
All Science Journal Classification (ASJC) codes
- Computer Networks and Communications
- Hardware and Architecture
- Information Systems
- Software