PetFMM -- a dynamically load-balancing parallel fast multipole library
From MaRDI portal
Publication:3017992
Abstract: Fast algorithms for the computation of -body problems can be broadly classified into mesh-based interpolation methods, and hierarchical or multiresolution methods. To this last class belongs the well-known fast multipole method (FMM), which offers O(N) complexity. This paper presents an extensible parallel library for -body interactions utilizing the FMM algorithm, built on the framework of PETSc. A prominent feature of this library is that it is designed to be extensible, with a view to unifying efforts involving many algorithms based on the same principles as the FMM and enabling easy development of scientific application codes. The paper also details an exhaustive model for the computation of tree-based -body algorithms in parallel, including both work estimates and communications estimates. With this model, we are able to implement a method to provide automatic, a priori load balancing of the parallel execution, achieving optimal distribution of the computational work among processors and minimal inter-processor communications. Using a client application that performs the calculation of velocity induced by vortex particles, ample verification and testing of the library was performed. Strong scaling results are presented with close to a million particles in up to 64 processors, including both speedup and parallel efficiency. The library is currently able to achieve over 85% parallel efficiency for 64 processors. The software library is open source under the PETSc license; this guarantees the maximum impact to the scientific community and encourages peer-based collaboration for the extensions and applications.
Recommendations
- A parallel version of the fast multipole method
- Massively parallel implementation of a fast multipole method for distributed memory machines
- Fast multipole method for particle interactions: an open source parallel library component
- On a probabilistic approach to the Schrödinger equation with a time-dependent potential
- PVFMM: A parallel kernel independent FMM for particle and volume potentials
Cites work
- scientific article; zbMATH DE number 41859 (Why is no real title available?)
- A Fast Adaptive Multipole Algorithm for Particle Simulations
- A Generalized Fast Multipole Method for Nonoscillatory Kernels
- A fast algorithm for particle simulations
- A kernel-independent adaptive fast multipole algorithm in two and three dimensions
- A parallel implementation of the fast multipole method for Maxwell's equations
- A parallel version of the fast multipole method
- Advances in viscous vortex methods?meshless spatial adaption based on radial basis function interpolation
- An Implementation of the Fast Multipole Method without Multipoles
- Characterization of the accuracy of the fast multipole method in particle simulations
- Emergence and evolution of tripole vortices from net-circulation initial conditions
- Massively parallel implementation of a fast multipole method for distributed memory machines
- Provably Good Partitioning and Load Balancing Algorithms for Parallel Adaptive N-Body Simulation
- The black-box fast multipole method
Cited in
(17)- A distributed kernel summation framework for general‐dimension machine learning
- Fast and scalable evaluation of pairwise potentials
- RPYFMM: parallel adaptive fast multipole method for Rotne-Prager-Yamakawa tensor in biomolecular hydrodynamics simulations
- Computational software: simple FMM libraries for electrostatics, slow viscous flow, and frequency-domain wave propagation
- PVFMM: A parallel kernel independent FMM for particle and volume potentials
- PPM -- a highly efficient parallel particle-mesh library for the simulation of continuum systems
- On a probabilistic approach to the Schrödinger equation with a time-dependent potential
- Distributed and adaptive fast multipole method in three dimensions
- Fast multipole method for particle interactions: an open source parallel library component
- Biomolecular electrostatics using a fast multipole BEM on up to 512 GPUs and a billion unknowns
- Revision of DASHMM: dynamic adaptive system for hierarchical multipole methods
- PetFMM
- DASHMM: dynamic adaptive system for hierarchical multipole methods
- Optimizing the multipole-to-local operator in the fast multipole method for graphical processing units
- scientific article; zbMATH DE number 2080877 (Why is no real title available?)
- A parallel version of the fast multipole method
- A parallel fast multipole method for a space-time boundary element method for the heat equation
This page was built for publication: PetFMM -- a dynamically load-balancing parallel fast multipole library
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3017992)