Multi-GPU implementation of a time-explicit finite volume solver using CUDA and a CUDA-aware version of OpenMPI with application to shallow water flows
From MaRDI portal
Publication:6156956
Abstract: This paper shows the development of a multi-GPU version of a time-explicit finite volume solver for the Shallow-Water Equations (SWE) on a multi-GPU architecture. MPI is combined with CUDA-Fortran in order to use as many GPUs as needed. The METIS library is leveraged to perform a domain decomposition on the 2D unstructured triangular meshes of interest. A CUDA-Aware OpenMPI version is adopted to speed up the messages between the MPI processes. A study of both speed-up and efficiency is conducted; first, for a classic dam-break flow in a canal, and then for two real domains with complex bathymetries: the Mille ^Iles river and the Montreal archipelago. In both cases, meshes with up to 13 million cells are used. Using 24 to 28 GPUs on these meshes leads to an efficiency of 80% and more. Finally, the multi-GPU version is compared to the pure MPI multi-CPU version, and it is concluded that in this particular case, about 100 CPU cores would be needed to achieve the same performance as one GPU.
Cites work
- scientific article; zbMATH DE number 6376220 (Why is no real title available?)
- scientific article; zbMATH DE number 3932666 (Why is no real title available?)
- A Fast and Stable Well-Balanced Scheme with Hydrostatic Reconstruction for Shallow Water Flows
- A GPU accelerated adjoint-based optimizer for inverse modeling of the two-dimensional shallow water equations
- A multi-GPU parallel algorithm in hypersonic flow computations
- A well-balanced positivity preserving ``second-order scheme for shallow water flows on unstructured meshes
- An edge-based stabilized finite element method for solving compressible flows: formulation and parallel implementation.
- Efficient GPU implementation of a two waves TVD-WAF method for the two-dimensional one layer shallow water system on structured meshes
- Efficient shallow water simulations on GPUs: implementation, visualization, verification, and validation
- High resolution schemes for hyperbolic conservation laws
- High-order finite-element seismic wave propagation modeling with MPI on a large GPU cluster
- Modeling of wetting-drying transitions in free surface flows over complex topographies
- Non-hydrostatic pressure shallow flows: GPU implementation using finite volume and finite difference scheme
- Non-intrusive reduced-order modeling using uncertainty-aware deep neural networks and proper orthogonal decomposition: application to flood modeling
- On Upstream Differencing and Godunov-Type Schemes for Hyperbolic Conservation Laws
- Riemann Solvers and Numerical Methods for Fluid Dynamics
- Shock-capturing methods for free-surface shallow flows
- Simulation and visualization of the Saint-Venant system using GPUs
- Towards a generalised GPU/CPU shallow-flow modelling tool
- Zero mass error using unsteady wetting–drying conditions in shallow flows over dry irregular topography
Cited in
(6)- Heterogeneous CPU-GPU parallelization for modeling supersonic reacting flows with detailed chemical kinetics
- Finite-Difference in Time-Domain Scalable Implementations on CUDA and OpenCL
- On the benefits of using GPUs to simulate shallow flows with finite volume schemes
- Parallel high-order resolution of the shallow-water equations on real large-scale meshes with complex bathymetries
- Fully parallel and pipelined sparse direct solver for large symmetric indefinite finite element problems
- Hybrid OpenMP/AVX acceleration of a split HLL finite volume method for the shallow water and Euler equations
This page was built for publication: Multi-GPU implementation of a time-explicit finite volume solver using CUDA and a CUDA-aware version of OpenMPI with application to shallow water flows
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6156956)