TY - GEN
T1 - HP-MDR
T2 - 2025 International Conference for High Performance Computing, Networking, Storage, and Analysis, SC 2025
AU - Li, Yanliang
AU - Li, Wenbo
AU - Gong, Qian
AU - Liu, Qing
AU - Podhorszki, Norbert
AU - Klasky, Scott
AU - Liang, Xin
AU - Chen, Jieyang
N1 - Publisher Copyright:
© 2025 Copyright held by the owner/author(s).
PY - 2025/11/15
Y1 - 2025/11/15
N2 - Scientific applications produce vast amounts of data, posing grand challenges in the underlying data management and analytic tasks. Progressive compression is a promising way to address this problem, as it allows for on-demand data retrieval with significantly reduced data movement cost. However, most existing progressive methods are designed for CPUs, leaving a gap for them to unleash the power of today's heterogeneous computing systems with GPUs.In this work, we propose HP-MDR, a high-performance and portable data refactoring and progressive retrieval framework for GPUs. Our contributions are four-fold: (1) We carefully optimize the bitplane encoding and lossless encoding, two key stages in progressive methods, to achieve high performance on GPUs; (2) We propose pipeline optimization and incorporate it with data refactoring and progressive retrieval workflows to further enhance the performance for large data process; (3) We leverage our framework to enable high-performance data retrieval with guaranteed error control for common Quantities of Interest; (4) We evaluate HP-MDR and compare it with state of the arts using five real-world datasets. Experimental results demonstrate that HP-MDR delivers an average 13.68× and 6.31× throughput in data refactoring and progressive retrieval tasks, respectively. It also leads to 11.22× throughput for recomposing required data representations under Quantity-of-Interest error control and 6.04× performance for the corresponding end-to-end data retrieval, when compared with state-of-the-art solutions.
AB - Scientific applications produce vast amounts of data, posing grand challenges in the underlying data management and analytic tasks. Progressive compression is a promising way to address this problem, as it allows for on-demand data retrieval with significantly reduced data movement cost. However, most existing progressive methods are designed for CPUs, leaving a gap for them to unleash the power of today's heterogeneous computing systems with GPUs.In this work, we propose HP-MDR, a high-performance and portable data refactoring and progressive retrieval framework for GPUs. Our contributions are four-fold: (1) We carefully optimize the bitplane encoding and lossless encoding, two key stages in progressive methods, to achieve high performance on GPUs; (2) We propose pipeline optimization and incorporate it with data refactoring and progressive retrieval workflows to further enhance the performance for large data process; (3) We leverage our framework to enable high-performance data retrieval with guaranteed error control for common Quantities of Interest; (4) We evaluate HP-MDR and compare it with state of the arts using five real-world datasets. Experimental results demonstrate that HP-MDR delivers an average 13.68× and 6.31× throughput in data refactoring and progressive retrieval tasks, respectively. It also leads to 11.22× throughput for recomposing required data representations under Quantity-of-Interest error control and 6.04× performance for the corresponding end-to-end data retrieval, when compared with state-of-the-art solutions.
KW - High-performance computing
KW - advanced GPUs
KW - progressive compression
KW - scientific data
UR - https://www.scopus.com/pages/publications/105023963671
UR - https://www.scopus.com/pages/publications/105023963671#tab=citedBy
U2 - 10.1145/3712285.3759845
DO - 10.1145/3712285.3759845
M3 - Conference contribution
AN - SCOPUS:105023963671
T3 - Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis, SC 2025
SP - 2076
EP - 2093
BT - Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis, SC 2025
PB - Association for Computing Machinery, Inc
Y2 - 16 November 2025 through 21 November 2025
ER -