Optimization Techniques and Performance Analyses of two Life Science Algorithms for Novel GPU Architectures
In this paper we evaluate two life science algorithms, namely Needleman-Wunsch sequence alignment and Direct Coulomb Summation, for GPUs. Whereas for Needleman-Wunsch it is difficult to get good performance numbers, Direct Coulomb Summation is particularly suitable for graphics cards. We present several optimization techniques, analyze the theoretical potential of the optimizations with respect to the algorithms, and measure the effect on execution times. We target the recent NVIDIA Fermi architecture to evaluate the performance impacts of novel hardware features like the cache subsystem on optimizing transformations. We compare the execution times of CUDA and OpenCL code versions for Fermi and predecessor models with parallel OpenMP versions executed on the main CPU.
Top- Dilch, David
- Mehofer, Eduard
Category |
Paper in Conference Proceedings or in Workshop Proceedings (Paper) |
Event Title |
20th Euromicro International Conference on Parallel, Distributed and Network-based Processing |
Divisions |
Scientific Computing |
Subjects |
Parallele Datenverarbeitung Rechnerarchitektur |
Event Location |
Garching, Germany |
Event Type |
Conference |
Event Dates |
15-17 February 2012 |
Publisher |
IEEE Computer Society |
Date |
February 2012 |
Export |