EaseMiss: HW/SW Co-Optimization for Efficient Large Matrix-Matrix Multiply Operations

Ali Nezhadi, Shaahin Angizi, Arman Roohi

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

Due to the essential role of matrix multiplication in many scientific applications, especially in data and compute -intensive applications, we explore the efficiency of highly used matrix production algorithms. This paper proposes an HW/SW co-optimization technique, entitled EaseMiss, to reduce the cache miss ratio for large general matrix-matrix multiplications. First, we revise the algorithms by applying three software optimization techniques to improve performance. Choosing the proper algorithms to achieve the best performance is examined and formulated. By leveraging the proposed optimizations, the number of cache misses decreases by a factor of 3 in a conventional data cache. To further improve, we then propose SPLiTCACHE to virtually split data cache regarding matrices' dimensions for better data reuse. This method can be easily embedded into conventional general-purpose processors or GPUs at the cost of negligible logical circuit overhead. After using the correct and valid splitting, the obtained results show that the cache misses reduce by a factor of 2 compared to the conventional data cache on average in the machine learning workloads.

Original languageEnglish (US)
Title of host publicationProceedings of the 2022 IEEE Dallas Circuits and Systems Conference, DCAS 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781665498852
DOIs
StatePublished - 2022
Event15th IEEE Dallas Circuits and Systems Conference, DCAS 2022 - Richardson, United States
Duration: Jun 17 2022Jun 19 2022

Publication series

NameProceedings of the 2022 IEEE Dallas Circuits and Systems Conference, DCAS 2022

Conference

Conference15th IEEE Dallas Circuits and Systems Conference, DCAS 2022
Country/TerritoryUnited States
CityRichardson
Period6/17/226/19/22

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Computer Science Applications
  • Hardware and Architecture
  • Electrical and Electronic Engineering
  • Safety, Risk, Reliability and Quality

Fingerprint

Dive into the research topics of 'EaseMiss: HW/SW Co-Optimization for Efficient Large Matrix-Matrix Multiply Operations'. Together they form a unique fingerprint.

Cite this