Optimization Techniques and Performance Analyses of two Life Science Algorithms for Novel GPU Architectures

Optimization Techniques and Performance Analyses of two Life Science Algorithms for Novel GPU Architectures

Abstract

In this paper we evaluate two life science algorithms, namely Needleman-Wunsch sequence alignment and Direct Coulomb Summation, for GPUs. Whereas for Needleman-Wunsch it is difficult to get good performance numbers, Direct Coulomb Summation is particularly suitable for graphics cards. We present several optimization techniques, analyze the theoretical potential of the optimizations with respect to the algorithms, and measure the effect on execution times. We target the recent NVIDIA Fermi architecture to evaluate the performance impacts of novel hardware features like the cache subsystem on optimizing transformations. We compare the execution times of CUDA and OpenCL code versions for Fermi and predecessor models with parallel OpenMP versions executed on the main CPU.

Grafik Top
Authors
  • Dilch, David
  • Mehofer, Eduard
Grafik Top
Shortfacts
Category
Paper in Conference Proceedings or in Workshop Proceedings (Paper)
Event Title
20th Euromicro International Conference on Parallel, Distributed and Network-based Processing
Divisions
Scientific Computing
Subjects
Parallele Datenverarbeitung
Rechnerarchitektur
Event Location
Garching, Germany
Event Type
Conference
Event Dates
15-17 February 2012
Publisher
IEEE Computer Society
Date
February 2012
Export
Grafik Top