Remote memory access: A case for portable, efficient and library independent parallel programming

Research output: Contribution to journalArticlepeer-review

6 Scopus citations


In this work we make a strong case for remote memory access (RMA) as the effective way to program a parallel computer by proposing a framework that supports RMA in a library independent, simple and intuitive way. If one uses our approach the parallel code one writes will run transparently under MPI-2 enabled libraries but also bulk-synchronous parallel libraries. The advantage of using RMA is code simplicity, reduced programming complexity, and increased efficiency. We support the latter claims by implementing under this framework a collection of benchmark programs consisting of a communication and synchronization performance assessment program, a dense matrix multiplication algorithm, and two variants of a parallel radix-sort algorithm and examine their performance on a LINUX-based PC cluster under three different RMA enabled libraries: LAM MPI, BSPlib, and PUB. We conclude that implementations of such parallel algorithms using RMA communication primitives lead to code that is as efficient as the message-passing equivalent code and in the case of radix-sort substantially more efficient. In addition our work can be used as a comparative study of the relevant capabilities of the three libraries.

Original languageEnglish (US)
Pages (from-to)169-183
Number of pages15
JournalScientific Programming
Issue number3
StatePublished - 2004

All Science Journal Classification (ASJC) codes

  • Software
  • Computer Science Applications


Dive into the research topics of 'Remote memory access: A case for portable, efficient and library independent parallel programming'. Together they form a unique fingerprint.

Cite this