Skip to main navigation Skip to search Skip to main content

CPU versus GPU: which can perform matrix computation faster—performance comparison for basic linear algebra subprograms

  • Harbin Institute of Technology Shenzhen
  • City University of Hong Kong

Research output: Contribution to journalArticlepeer-review

Abstract

Matrix computing is the core component of machine learning and artificial intelligence. Fast matrix computations can facilitate many large-scale computational projects greatly. Basic linear algebra subprograms (BLAS) are proposed, which classify different matrices and provide a standardized interface. Currently, the most commonly used heterogeneous computing platforms are central processing unit (CPU) and graphics processing unit (GPU). At present, BLAS has been implemented on both CPU and GPU. However, due to the different characteristics of algorithms and hardware, a particular matrix method should be designed for a particular processor. It is important to choose the right processor for a particular matrix computation. This paper first briefly reviews the BLAS, and then introduces architecture and optimization methods of CPU and GPU. The effect of different subroutines in BLAS is studied through experiments. Finally, we discuss the reasons and the processor selection scheme of matrix computations.

Original languageEnglish
Pages (from-to)4353-4365
Number of pages13
JournalNeural Computing and Applications
Volume31
Issue number8
DOIs
StatePublished - 1 Aug 2019
Externally publishedYes

Keywords

  • Basic linear algebra subprograms
  • CPU
  • GPU
  • Matrix computation

Fingerprint

Dive into the research topics of 'CPU versus GPU: which can perform matrix computation faster—performance comparison for basic linear algebra subprograms'. Together they form a unique fingerprint.

Cite this