A high-performance, portable implementation of the MPI message passing interface standard
From MaRDI portal
Publication:671443
DOI10.1016/0167-8191(96)00024-5zbMath0875.68206OpenAlexW2081612620WikidataQ62380985 ScholiaQ62380985MaRDI QIDQ671443
Anthony Skjellum, William D. Gropp, Ewing L. Lusk, Nathan E. Doss
Publication date: 27 February 1997
Published in: Parallel Computing (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/0167-8191(96)00024-5
Related Items (only showing first 100 items - show all)
Nagging: A scalable fault-tolerant paradigm for distributed search ⋮ Multi-plume flow simulation of small bipropellant thrusters using parallel DSMC method ⋮ Evaluation of selected resource allocation and scheduling methods in heterogeneous many-core processors and graphics processing units ⋮ SABR/LIBOR market models: pricing and calibration for some interest rate derivatives ⋮ Nonparametric Bayesian learning of heterogeneous dynamic transcription factor networks ⋮ Parallel computation of aeroacoustics of industrially relevant complex-geometry aeroengine jets ⋮ Synchronous parallelization of particle swarm optimization with digital pheromones ⋮ Performance analysis of a parallel finite element solution to the direct numerical simulation of fluid turbulence on linux PC clusters ⋮ Explorative and dynamic visualization of data in virtual reality ⋮ Parallel adaptive simplical re-meshing for deforming domain CFD computations ⋮ A modelling approach to explore the critical environmental parameters influencing the growth and establishment of the invasive seaweed \textit{Undaria pinnatifida} in Europe ⋮ Interconnection network simulation using traces of MPI applications ⋮ Multidimensional integration through Markovian sampling under steered function morphing: a physical guise from statistical mechanics ⋮ Reordering of hybrid unstructured grids for an implicit Navier-Stokes solver based on OpenMP parallelization ⋮ A comparison between the surface compression method and an interface reconstruction method for the VOF approach ⋮ Optimized high-order derivative and dissipation operators satisfying summation by parts, and applications in three-dimensional multi-block evolutions ⋮ A variationally bounded scheme for delayed detached eddy simulation: application to vortex-induced vibration of offshore riser ⋮ Highly scalable DNS solver for turbulent bubble-laden channel flow ⋮ Model-based fault localization: finding behavioral outliers in large-scale computing systems ⋮ QCMPI: A parallel environment for quantum computing ⋮ Parallel solution of large-scale free surface viscoelastic flows via sparse approximate inverse preconditioning ⋮ An efficient parallel and fully implicit algorithm for the simulation of transient free-surface flows of multimode viscoelastic liquids ⋮ Numerical performance of parallel group explicit solvers for the solution of fourth order elliptic equations ⋮ Parallel framework for topology optimization using the method of moving asymptotes ⋮ Estimation of dynamical characteristics of a parallel program on a model ⋮ Corba and MPI code coupling ⋮ A balanced accumulation scheme for parallel PDE solvers ⋮ A massively parallel geometric multigrid solver on hierarchically distributed grids ⋮ \textit{UG} 4: a novel flexible software system for simulating PDE based models on high performance computers ⋮ Large-scale rigid body simulations ⋮ Formal specification of MPI 2.0: case study in specifying a practical concurrent programming API ⋮ Parallel PIPS-SBB: multi-level parallelism for stochastic mixed-integer programs ⋮ Domain decomposition methods using dual conversion for the total variation minimization with \(L^1\) fidelity term ⋮ PEBBL: an object-oriented framework for scalable parallel branch and bound ⋮ Parallel computing in the statistical system Jasp ⋮ Data race avoidance and replay scheme for developing and debugging parallel programs on distributed shared memory systems ⋮ Distributed disk-based algorithms for model checking very large Markov chains ⋮ On optimization of finite-difference time-domain (FDTD) computation on heterogeneous and GPU clusters ⋮ Performance evaluation of OpenMP-based algorithms for handling Kronecker descriptors ⋮ FEVS: a functional equivalence verification suite for high-performance scientific computing ⋮ Fault tolerant algorithms for heat transfer problems ⋮ Parallel computation of the eigenvalues of symmetric Toeplitz matrices through iterative methods ⋮ Large-scale parallel numerical integration ⋮ Implementation of ensemble-based simulated annealing with dynamic load balancing under MPI ⋮ An easily implemented task-based parallel scheme for the Fourier pseudospectral solver applied to 2D Navier-Stokes turbulence. ⋮ A computationally efficient, consistent bootstrap for inference with non-parametric DEA estimators ⋮ A distributed memory parallel element-by-element scheme for semiconductor device simulation ⋮ A parallel subdomain by subdomain implementation of the implicitly restarted Arnoldi/Lanczos method ⋮ Parallel integer relation detection: Techniques and applications ⋮ High performance computing for the level-set reconstruction algorithm ⋮ A program for sequential allocation of three Bernoulli populations ⋮ A computational periporomechanics model for localized failure in unsaturated porous media ⋮ \(p\)-multigrid with partial smoothing: an efficient preconditioner for discontinuous Galerkin discretizations with modal bases ⋮ Implementation and scalability analysis of balancing domain decomposition methods ⋮ An optimal multiprocessor combinatorial auction solver ⋮ Alternating criteria search: a parallel large neighborhood search algorithm for mixed integer programs ⋮ Multi-core CPUs, clusters, and grid computing: A tutorial ⋮ OpenMP + MPI parallel implementation of a numerical method for solving a kinetic equation ⋮ A moving mesh finite volume interface tracking method for surface tension dominated interfacial fluid flow ⋮ Scheduling of hard real-time multi-phase multi-thread (MPMT) periodic tasks ⋮ Reachability computation for polynomial dynamical systems ⋮ Scalable parallel implementation of CISAMR: a non-iterative mesh generation algorithm ⋮ High-precision numerical integration: progress and challenges ⋮ Parallel methods for optimality criteria-based topology optimization ⋮ Broadcasting on networks of workstations ⋮ A parallel/recursive algorithm ⋮ A denotational semantics of textually aligned SPMD programs ⋮ Modeling fractured and faulted regions: local grid refinement methods for implicit solvers ⋮ Dynamical geometry for multiscale dissipative particle dynamics ⋮ Multibillion-atom molecular dynamics simulation: design considerations for vector-parallel processing ⋮ Efficient first-principles calculations of the electronic structure of periodic systems ⋮ A superposition-based parallel discrete operator splitting method for incompressible flows ⋮ A taxonomy and survey of grid resource management systems for distributed computing ⋮ Instrumenting and tuningdataView?a networked application for navigating through large scientific datasets ⋮ Parallel calculation of accurate path lines in virtual environments through exploitation of multi-block CFD data set topology ⋮ Dynamic symbolic verification of MPI programs ⋮ Solving stable Sylvester equations via rational iterative schemes ⋮ Numerical strategies towards peta-scale simulations of nanoelectronics devices ⋮ Using multiple levels of parallelism to enhance the performance of domain decomposition solvers ⋮ Parallelization and optimization of Mfold on shared memory system ⋮ Efficient methods to organize the parallel execution of optimization algorithms ⋮ Modeling heat transfer in Bi\(_2\)Te\(_3\)-Sb\(_2\)Te\(_3\) nanostructures ⋮ A finite element formulation to solve a non-local constitutive model with stresses and strains due to slip gradients ⋮ HordeQBF: A Modular and Massively Parallel QBF Solver ⋮ User-friendly parallel computations with econometric examples ⋮ Efficient parallel algorithms for elastic-plastic finite element analysis ⋮ Evidential instance selection for \(K\)-nearest neighbor classification of big data ⋮ An approximation algorithm and dynamic programming for reduction in heterogeneous environments ⋮ On the Schwarz alternating method for oceanic models on parallel computers ⋮ A hierarchical partition model for adaptive finite element computation ⋮ Parallel lattice Boltzmann method with blocked partitioning ⋮ An empirical investigation on parallelization strategies for scatter search ⋮ Solution adaptive grid strategies based on point redistribution ⋮ A multipopulation cultural algorithm for the electrical generator scheduling problem ⋮ A parallel optimisation approach for the realisation problem in intensity modulated radiotherapy treatment planning ⋮ Parallel solvers for the depletion region identification in metal semiconductor field effect transistors ⋮ An efficient geometric method for incompressible hydrodynamics on the sphere ⋮ Parallel distributed kernel estimation ⋮ Solving 3D time-fractional diffusion equations by high-performance parallel computing ⋮ Scalable SAT solving in the cloud
Uses Software
This page was built for publication: A high-performance, portable implementation of the MPI message passing interface standard