Imperial College London > Talks@ee.imperial > CAS Talks > Optimising Memory Bandwidth Use for Matrix-Vector Multiplication in Iterative Methods

Optimising Memory Bandwidth Use for Matrix-Vector Multiplication in Iterative Methods

Add to your list(s) Download to your calendar using vCal

If you have a question about this talk, please contact George A Constantinides.

Accelerating iterative methods using FPG As has previously been studied in detail within this research group, with large performance improvements achieved over general purpose processors. In these works, this performance gain is largely a result of parallelisation of the matrix-vector multiplication; parallelism that mainly results from taking advantage of the nature of iterative methods which allow the use of on-chip memory buffers to increase the bandwidth. Unfortunately, the RAM available on FPG As has limited this high performance to only relatively small matrices. This work examines simple hardware changes that take advantage of symmetrical and banded matrix structure, as well as methods to optimise the RAM use in order to both increase the performance and retain this performance for larger order matrices.

This talk is part of the CAS Talks series.

Tell a friend about this talk:

This talk is included in these lists:

Note that ex-directory lists are not shown.

 

Changes to Talks@imperial | Privacy and Publicity