Hiding Global Communication Latency in the GMRES Algorithm on Massively Parallel Machines

From MaRDI portal
Publication:4917164

DOI10.1137/12086563XzbMath1273.65050MaRDI QIDQ4917164

Karl Meerbergen, Pieter Ghysels, T. Ashby, Wim Vanroose

Publication date: 29 April 2013

Published in: SIAM Journal on Scientific Computing (Search for Journal in Brave)




Related Items (28)

Anatomically accurate high resolution modeling of human whole heart electromechanics: A strongly scalable algebraic multigrid solver method for nonlinear deformationDeflation-accelerated preconditioning of the Poisson-Neumann Schur problem on long domains with a high-order discontinuous element-based collocation methodA Communication Optimization Scheme for Basis Computation of Krylov Subspace Methods on Multi-GPUsMinimizing synchronizations in sparse iterative solvers for distributed supercomputersThe non-symmetric \(s\)-step Lanczos algorithm: derivation of efficient recurrences and synchronization-reducing variants of BiCG and QMRThe Numerical Stability Analysis of Pipelined Conjugate Gradient Methods: Historical Context and MethodologyRobust and efficient adjoint solver for complex flow conditionsAnalysis and practical use of flexible biCGStabThe Adaptive $s$-Step Conjugate Gradient MethodChaotic multigrid methods for the solution of elliptic equationsAlternating Anderson-Richardson method: an efficient alternative to preconditioned Krylov methods for large, sparse linear systemsCoarsest-level improvements in multigrid for lattice QCD on large-scale computersNovel Multilevel Preconditioners for the Systems Arising from Plane Wave Discretization of Helmholtz Equations with Large Wave NumbersInner product free iterative solution and elimination methods for linear systems of a three-by-three block matrix formGMRES algorithms over 35 yearsCommunication-hiding pipelined BiCGSafe methods for solving large linear systemsAnalyzing the Effect of Local Rounding Error Propagation on the Maximal Attainable Accuracy of the Pipelined Conjugate Gradient MethodScalable Asynchronous Domain Decomposition SolversDASHMM Accelerated Adaptive Fast Multipole Poisson-Boltzmann Solver on Distributed Memory ArchitectureDistributed fast boundary element methods for Helmholtz problemss-Step Enlarged Krylov Subspace Conjugate Gradient MethodsEnlarged Krylov Subspace Conjugate Gradient Methods for Reducing CommunicationPipelined, Flexible Krylov Subspace MethodsViennaCL---Linear Algebra Library for Multi- and Many-Core ArchitecturesOn the cost of iterative computationsAn adaptive \(s\)-step conjugate gradient algorithm with dynamic basis updating.Communication Avoiding ILU0 PreconditionerPipelined Iterative Solvers with Kernel Fusion for Graphics Processing Units


Uses Software



This page was built for publication: Hiding Global Communication Latency in the GMRES Algorithm on Massively Parallel Machines