Introducing a parallel cache oblivious blocking approach for the lattice Boltzmann method (Q929255)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Introducing a parallel cache oblivious blocking approach for the lattice Boltzmann method
scientific article

    Statements

    Introducing a parallel cache oblivious blocking approach for the lattice Boltzmann method (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    16 June 2008
    0 references
    Summary: We propose a parallel cache oblivious spatial and temporal blocking algorithm for the lattice Boltzmann method in three spatial dimensions. The algorithm has originally been proposed by \textit{M. Frigo} et al. [ACM Trans. Algorithms 8, No. 1, Paper No. 4, 22 p. (2012; Zbl 1295.68236)] and divides the space-time domain of stencil-based methods in an optimal way, independently of any external parameters, e.g., cache size. In view of the increasing gap between processor speed and memory performance this approach offers a promising path to increase cache utilisation. We find that even a straightforward cache oblivious implementation can reduce memory traffic at least by a factor of two if compared to a highly optimised standard kernel and improves scalability for shared memory parallelisation. Due to the recursive structure of the algorithm we use an unconventional parallelisation scheme based on task queuing.
    0 references
    0 references
    0 references
    0 references
    0 references
    lattice Boltzmann
    0 references
    cache optimisation
    0 references
    cache oblivious
    0 references
    multi core
    0 references
    task queuing
    0 references
    shared memory parallelisation
    0 references