CaKernel – A parallel application programming framework for heterogenous computing architectures

Blazewicz, Marek; Brandt, Steven R.; Kierzynka, Michal; Kurowski, Krzysztof; Ludwiczak, Bogdan; Tao, Jian; Weglarz, Jan

doi:10.3233/SPR-2011-0333

CaKernel – A parallel application programming framework for heterogenous computing architectures

Article type: Research Article

Affiliations: Poznań Supercomputing and Networking Center, Poznań, Poland | Center for Computation and Technology, Louisiana State University, Baton Rouge, LA, USA | Department of Computer Science, Louisiana State University, Baton Rouge, LA, USA | Institute of Computing Science, Poznań University of Technology, Poznań, Poland

Note: [] Corresponding author: M. Blazewicz, E-mail: marqs@man. poznan.pl.

Abstract: With the recent advent of new heterogeneous computing architectures there is still a lack of parallel problem solving environments that can help scientists to use easily and efficiently hybrid supercomputers. Many scientific simulations that use structured grids to solve partial differential equations in fact rely on stencil computations. Stencil computations have become crucial in solving many challenging problems in various domains, e.g., engineering or physics. Although many parallel stencil computing approaches have been proposed, in most cases they solve only particular problems. As a result, scientists are struggling when it comes to the subject of implementing a new stencil-based simulation, especially on high performance hybrid supercomputers. In response to the presented need we extend our previous work on a parallel programming framework for CUDA – CaCUDA that now supports OpenCL. We present CaKernel – a tool that simplifies the development of parallel scientific applications on hybrid systems. CaKernel is built on the highly scalable and portable Cactus framework. In the CaKernel framework, Cactus manages the inter-process communication via MPI while CaKernel manages the code running on Graphics Processing Units (GPUs) and interactions between them. As a non-trivial test case we have developed a 3D CFD code to demonstrate the performance and scalability of the automatically generated code.

Keywords: GPGPU programming, computational framework, HPC, finite difference, stencil computation, CUDA, OpenCL

DOI: 10.3233/SPR-2011-0333

Journal: Scientific Programming, vol. 19, no. 4, pp. 185-197, 2011

Published: 2011

Price: EUR 27.50

North America

IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA

Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]

For editorial issues, like the status of your submitted paper or proposals, write to [email protected]

Europe

IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands

Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]

For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]

Asia

Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China

Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]

For editorial issues, like the status of your submitted paper or proposals, write to [email protected]

如果您在出版方面需要帮助或有任何建, 件至: [email protected]

Share this:

North America

Europe

Asia