alpaka - An Abstraction Library for Parallel Kernel Acceleration

Its aim is to provide performance portability across accelerators by abstracting the underlying levels of parallelism.

The library is platform-independent and supports the concurrent and cooperative use of multiple devices, including host CPUs (x86, ARM, RISC-V, and Power8+) and GPUs from different vendors (NVIDIA, AMD, and Intel). A variety of accelerator backends—CUDA, HIP, SYCL, OpenMP, and serial execution—are available and can be selected based on the target device. Only a single implementation of a user kernel is required, expressed as a function object with a standardized interface. This eliminates the need to write specialized CUDA, HIP, SYCL, OpenMP, or threading code. Moreover, multiple accelerator backends can be combined to target different vendor hardware within a single system and even within a single application.

Caution

The readthedocs pages are provided with best effort, but may contain outdated sections.

alpaka - How to Read This Document

Generally, follow the manual pages in-order to get started. Individual chapters are based on the information of the chapters before.

The online version of this document is versioned and shows by default the manual of the last stable version of alpaka. If you are looking for the latest development version, click here.

Note

Are you looking for our latest Doxygen docs for the API?

See https://alpaka-group.github.io/alpaka3/

Tutorial - Getting Started

Tutorial - Advanced

Advanced

Developer

Contribution

API Reference

alpaka - How to Read This Document

Indices and Tables