Imperial College London > Talks@ee.imperial > CAS Talks > HiPC Practice Talk: Balancing locality and concurrency: solving sparse triangular systems on GPUs'
Log inImperial users Other users No account?Information onFinding a talk Adding a talk Syndicating talks Who we are Everything else |
HiPC Practice Talk: Balancing locality and concurrency: solving sparse triangular systems on GPUs'Add to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact George A Constantinides. Many numerical optimisation problems rely on fast algorithms for solving sparse triangular systems of linear equations (STLs). To accelerate the solution of such equations, two types of approaches have been used: on GPUs, concurrency has been prioritised to the disadvantage of data locality, while on multi-core CPUs, data locality has been prioritised to the disadvantage of concurrency. In this paper, we discuss the interaction between data locality and concurrency in the solution of STLs on GPUs, and we present a new algorithm that balances both. We demonstrate empirically that, subject to there being enough concurrency available in the input matrix, our algorithm outperforms Nvidia’s concurrencyprioritising CUSPARSE algorithm for GPUs. Experimental results show a maximum speedup of 5.8-fold. Our solution algorithm, which we have implemented in OpenCL, requires a pre-processing phase that partitions the graph associated with the input matrix into sub-graphs, whose data can be stored in low-latency local memories. This preliminary analysis phase is expensive, but because it depends only on the input matrix, its cost can be amortised when solving for many different right-hand sides. This talk is part of the CAS Talks series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsTalks Type the title of a new list here CAS TalksOther talksPerceptual Sound Field Recording, Reconstruction, and Synthesis Holographic Radar and the Detection of micro-drone targets Models and Capacity Bounds for Optical Fiber Channels (Information Theory Society Distinguished Lecture) |