Parallel solver for shifted systems in a hybrid CPU-GPU framework
From MaRDI portal
Publication:4580279
Abstract: This paper proposes a combination of a hybrid CPU--GPU and a pure GPU software implementation of a direct algorithm for solving shifted linear systems with large number of complex shifts and multiple right-hand sides. Such problems often appear e.g. in control theory when evaluating the transfer function, or as a part of an algorithm performing interpolatory model reduction, as well as when computing pseudospectra and structured pseudospectra, or solving large linear systems of ordinary differential equations. The proposed algorithm first jointly reduces the general full matrix and the full right-hand side matrix to the controller Hessenberg canonical form that facilitates efficient solution: is transformed to a so-called -Hessenberg form and is made upper-triangular. This is implemented as blocked highly parallel CPU--GPU hybrid algorithm; individual blocks are reduced by the CPU, and the necessary updates of the rest of the matrix are split among the cores of the CPU and the GPU. To enhance parallelization, the reduction and the updates are overlapped. In the next phase, the reduced -Hessenberg--triangular systems are solved entirely on the GPU, with shifts divided into batches. The benefits of such load distribution are demonstrated by numerical experiments. In particular, we show that our proposed implementation provides an excellent basis for efficient implementations of computational methods in systems and control theory, from evaluation of transfer function to the interpolatory model reduction.
Recommendations
- A mixed-precision algorithm for the solution of Lyapunov equations on hybrid CPU-GPU platforms
- Efficient generalized Hessenberg form and applications
- Efficient algorithm for simultaneous reduction to the \(m\)-Hessenberg-triangular-triangular form
- Performance models and workload distribution algorithms for optimizing a hybrid CPU-GPU multifrontal solver
- Accelerating the reduction to upper Hessenberg, tridiagonal, and bidiagonal forms through hybrid GPU-based computing
Cites work
- scientific article; zbMATH DE number 3841228 (Why is no real title available?)
- $\mathcal{H}_2$ Model Reduction for Large-Scale Linear Dynamical Systems
- A note on shifted Hessenberg systems and frequency response computation
- A survey of projection-based model reduction methods for parametric dynamical systems
- Accelerating the reduction to upper Hessenberg, tridiagonal, and bidiagonal forms through hybrid GPU-based computing
- An alternative Givens ordering
- BiCGStab(\(\ell\)) for families of shifted linear systems
- Calculation of Pseudospectra by the Arnoldi Iteration
- Computation of Pseudospectra by Continuation
- Computing $A^\alpha, \log(A)$, and Related Matrix Functions by Contour Integrals
- Computing the field of values and pseudospectra using the Lanczos method with continuation
- Efficient generalized Hessenberg form and applications
- Hessenberg and Hessenberg/triangular forms in linear system thcory†
- Interpolatory model reduction of large-scale dynamical systems
- Interpolatory projection methods for parameterized model reduction
- LAPACK Users' Guide
- Large-scale computation of pseudospectra using ARPACK and eigs
- Preconditioned multishift BiCG for \(\mathcal{H}_2\)-optimal model reduction
- Recycling BICG with an application to model reduction
- Recycling BiCGSTAB with an application to parametric model order reduction
- Recycling Krylov Subspaces for Sequences of Linear Systems
- Restarted GMRES for Shifted Linear Systems
- Restarted full orthogonalization method for shifted linear systems
- Spectral value sets: A graphical tool for robustness analysis
- Transfer functions and resolvent norm approximation of large matrices
- \(h_{2}\)-norm optimal model reduction for large scale discrete dynamical MIMO systems
Cited in
(3)
This page was built for publication: Parallel solver for shifted systems in a hybrid CPU-GPU framework
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4580279)