TY - GEN
T1 - Performance-energy tradeoffs for matrix multiplication on FPGA-based mixed-mode chip multiprocessors
AU - Wang, Xiaofang
AU - Ziavras, Sotirios G.
N1 - Copyright:
Copyright 2008 Elsevier B.V., All rights reserved.
PY - 2007
Y1 - 2007
N2 - Chip multiprocessing has demonstrated to be a promising approach in microprocessor design. With ever-increasing concerns for energy consumption, performance-energy trade-offs are often necessary, especially in the design of real-time embedded systems. This paper presents our performance and energy study on an in-house developed FPGA-based mixed-mode chip multiprocessor, where the SIMD (Single-Instruction, Multiple-Data), MIMD (Multiple-Instruction, Multiple-Data) and M-SIMD (Multiple-SIMD) computing modes can coexist simultaneously in one system. We propose performance-energy trade-off techniques based on the observation that SIMD and MIMD tasks involve substantially different granularities of computation and communication, which result in different time and energy behaviors; this provides us with opportunities to realize various performance-energy objectives. Generalized matrix-matrix multiplication (MMM) is employed as an example to illustrate our approach. Experimental results on a Xilinx Virtex II XC2V6000-5 FPGA demonstrate the effectiveness of the proposed approach.
AB - Chip multiprocessing has demonstrated to be a promising approach in microprocessor design. With ever-increasing concerns for energy consumption, performance-energy trade-offs are often necessary, especially in the design of real-time embedded systems. This paper presents our performance and energy study on an in-house developed FPGA-based mixed-mode chip multiprocessor, where the SIMD (Single-Instruction, Multiple-Data), MIMD (Multiple-Instruction, Multiple-Data) and M-SIMD (Multiple-SIMD) computing modes can coexist simultaneously in one system. We propose performance-energy trade-off techniques based on the observation that SIMD and MIMD tasks involve substantially different granularities of computation and communication, which result in different time and energy behaviors; this provides us with opportunities to realize various performance-energy objectives. Generalized matrix-matrix multiplication (MMM) is employed as an example to illustrate our approach. Experimental results on a Xilinx Virtex II XC2V6000-5 FPGA demonstrate the effectiveness of the proposed approach.
UR - http://www.scopus.com/inward/record.url?scp=34548131030&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34548131030&partnerID=8YFLogxK
U2 - 10.1109/ISQED.2007.119
DO - 10.1109/ISQED.2007.119
M3 - Conference contribution
AN - SCOPUS:34548131030
SN - 0769527957
SN - 9780769527956
T3 - Proceedings - Eighth International Symposium on Quality Electronic Design, ISQED 2007
SP - 386
EP - 391
BT - Proceedings - Eighth International Symposium on Quality Electronic Design, ISQED 2007
T2 - 8th International Symposium on Quality Electronic Design, ISQED 2007
Y2 - 26 March 2007 through 28 March 2007
ER -