Abstract
Recent advances in field-programmable gate array (FPGA) technologies have made feasible the implementation of low-cost parallel computing platforms for high-performance matrix computations. Compared to conventional multiprocessor systems, the resulting multiprocessors-on-a-programmable-chip (MPoPC) can provide unique advantages and opportunities in both software and hardware. It is shown in this paper that the performance of an MPoPC can be improved dramatically by adapting slightly intellectual property (IP)-based processing elements, and customising the memory and the interconnection network. The parallel LU factorisation of large, sparse doubly-bordered block diagonal (DBBD) matrices is employed as an application example. To enhance further the performance by software techniques, a run-time load balancing strategy for this algorithm is proposed and analysed. Extensive experimental results on benchmark matrices of size up to 7,917 x 7,917 for power networks demonstrate the effectiveness of our effort.
Original language | English (US) |
---|---|
Pages (from-to) | 181-191 |
Number of pages | 11 |
Journal | International Journal of Computational Science and Engineering |
Volume | 10 |
Issue number | 1 |
DOIs | |
State | Published - Jan 1 2015 |
All Science Journal Classification (ASJC) codes
- Software
- Modeling and Simulation
- Hardware and Architecture
- Computational Mathematics
- Computational Theory and Mathematics
Keywords
- DBBD
- Doubly-bordered block diagonal
- Dynamic load balancing
- FPGA
- Field-programmable gate array
- MPoPC
- Multiprocessors-on-a-programmable-chip
- Parallel LU factorisation