Multicore-optimized wavefront diamond blocking for optimizing stencil updates
DOI10.1137/140991133zbMATH Open1331.68286arXiv1410.3060OpenAlexW1506424797MaRDI QIDQ5264147FDOQ5264147
Authors: Tahir Malas, Georg Hager, Hatem Ltaief, Holger Stengel, Gerhard Wellein, D. E. Keyes
Publication date: 20 July 2015
Published in: SIAM Journal on Scientific Computing (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1410.3060
Recommendations
- Optimization and Performance Modeling of Stencil Computations on Modern Microprocessors
- Algorithm 942
- Modeling the performance of geometric multigrid stencils on multicore computer architectures
- Introducing a parallel cache oblivious blocking approach for the lattice Boltzmann method
- Locally recursive non-locally asynchronous algorithms for stencil computation
multicorestencil computationsenergy-efficient algorithmsdiamond tilingtemporal blockingwavefront parallelization
Parallel numerical computation (65Y05) Analysis of algorithms and problem complexity (68Q25) Performance evaluation, queueing, and scheduling in the context of computer systems (68M20) Parallel algorithms in computer science (68W10) Distributed algorithms (68W15) Distributed systems (68M14)
Cites Work
Cited In (6)
- Optimization and Performance Modeling of Stencil Computations on Modern Microprocessors
- Locally recursive non-locally asynchronous algorithms for stencil computation
- Designing a 3D parallel memory-aware lattice Boltzmann algorithm on manycore systems
- A new memory mapping mechanism for GPGPUs' stencil computation
- Accelerating stencil computation on GPGPU by novel mapping method between the global memory and the shared memory
- Accelerating solutions of one-dimensional unsteady PDEs with GPU-based swept time-space decomposition
Uses Software
This page was built for publication: Multicore-optimized wavefront diamond blocking for optimizing stencil updates
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5264147)