Skip to main navigation Skip to search Skip to main content

FPGA implementation for solving linear least square problem

  • Shaojun Wang*
  • , Qi Liu
  • , Xuejie Zhong
  • , Xiyuan Peng
  • *Corresponding author for this work
  • Harbin Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

Large calculation delay and poor parallelism greatly limit the solution efficiency of least square problem based on FPGA. We propose a novel approach of modified Cholesky factorization to solve this problem. With this approach, the least square problem is divided into matrix factorization part and triangle matrix solving part. The optimal parallelism is achieved by maximizing the amount of PEs (Processing Element) in each part. The calculation delay is decreased by avoiding the root operation and eliminating the division operation with modified Cholesky factorization. In triangle matrix solving part, the same PEs are used to solve both the upper triangle matrix and lower triangle matrix, which saves the FPGA resources. The experiments on Virtex XC5VFX130T FPGA with a 100 MHz clock show a speedup of 8× over a dual core CPU implementation in single-precision.

Original languageEnglish
Pages (from-to)701-707
Number of pages7
JournalYi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument
Volume33
Issue number3
StatePublished - Mar 2012

Keywords

  • FPGA
  • Least square problem
  • Modified Cholesky factorization

Fingerprint

Dive into the research topics of 'FPGA implementation for solving linear least square problem'. Together they form a unique fingerprint.

Cite this