Hyperbolic diffusion in flux reconstruction: optimisation through kernel fusion within tensor-product elements
From MaRDI portal
Publication:6159601
Abstract: Novel methods are presented in this initial study for the fusion of GPU kernels in the artificial compressibility method (ACM), using tensor product elements with constant Jacobians and flux reconstruction. This is made possible through the hyperbolisation of the diffusion terms, which eliminates the expensive algorithmic steps needed to form the viscous stresses. Two fusion approaches are presented, which offer differing levels of parallelism. This is found to be necessary for the change in workload as the order of accuracy of the elements is increased. Several further optimisations of these approaches are demonstrated, including a generation time memory manager which maximises resource usage. The fused kernels are able to achieve 3-4 times speedup, which compares favourably with a theoretical maximum speedup of 4. In three dimensional test cases, the generated fused kernels are found to reduce total runtime by , and, when compared to the standard ACM formulation, simulations demonstrate that a speedup of times can be achieved.
Cites work
- A calculation procedure for heat, mass and momentum transfer in three- dimensional parabolic flows
- A comparison of vortex and pseudo-spectral methods for the simulation of periodic vortical flows at high Reynolds numbers
- A first-order system approach for diffusion equation. II: Unification of advection and diffusion
- A high-order cross-platform incompressible Navier-Stokes solver via artificial compressibility with application to a turbulent jet
- A new class of high-order energy stable flux reconstruction schemes
- A numerical method for solving incompressible viscous flow problems
- Advanced computational fluid and aerodynamics
- Effective high-order energy stable flux reconstruction methods for first-order hyperbolic linear and nonlinear systems
- Fast Poisson solvers for spectral methods
- Inline vector compression for computational physics
- Locally adaptive pseudo-time stepping for high-order flux reconstruction
- MFEM: a modular finite element methods library
- Multivariate polynomial approximation in the hypercube
- PyFR: an open source framework for solving advection-diffusion type problems on streaming architectures using the flux reconstruction approach
- Small-scale structure of the Taylor–Green vortex
- Spatial eigensolution analysis of discontinuous Galerkin schemes with practical insights for under-resolved computations and implicit LES
- The \textsc{deal.II} finite element library: design, features, and insights
Cited in
(2)
This page was built for publication: Hyperbolic diffusion in flux reconstruction: optimisation through kernel fusion within tensor-product elements
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6159601)