| Publication | Date of Publication | Type |
|---|
| Heterogenous Acceleration for Linear Algebra in Multi-coprocessor Environments | 2022-12-09 | Paper |
| Mixed-Precision Orthogonalization Scheme and Adaptive Step Size for Improving the Stability and Performance of CA-GMRES on GPUs | 2022-12-09 | Paper |
A Set of Batched Basic Linear Algebra Subprograms and LAPACK Routines ACM Transactions on Mathematical Software | 2022-02-01 | Paper |
Mixed-precision iterative refinement using tensor cores on GPUs to accelerate solution of linear systems Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences | 2021-10-29 | Paper |
The singular value decomposition: anatomy of optimizing an algorithm for extreme scale SIAM Review | 2018-11-12 | Paper |
Stability and Performance of Various Singular Value QR Implementations on Multicore CPU with a GPU ACM Transactions on Mathematical Software | 2018-07-20 | Paper |
| High-performance matrix-matrix multiplications of very small matrices | 2018-01-11 | Paper |
Linear algebra software for large-scale accelerated multicore computing Acta Numerica | 2016-07-08 | Paper |
Accelerating numerical dense linear algebra calculations with GPUs Numerical Computations with GPUs | 2015-07-03 | Paper |
Mixed-Precision Cholesky QR Factorization and Its Case Studies on Multicore CPU with Multiple GPUs SIAM Journal on Scientific Computing | 2015-06-10 | Paper |
Accelerating Linear System Solutions Using Randomization Techniques ACM Transactions on Mathematical Software | 2014-09-12 | Paper |
Divide and conquer on hybrid GPU-accelerated multicore systems SIAM Journal on Scientific Computing | 2012-08-23 | Paper |
A Scalable High Performant Cholesky Factorization for Multicore with GPU Accelerators Lecture Notes in Computer Science | 2011-03-08 | Paper |
Accelerating GPU kernels for dense linear algebra Lecture Notes in Computer Science | 2011-03-08 | Paper |
Accelerating the reduction to upper Hessenberg, tridiagonal, and bidiagonal forms through hybrid GPU-based computing Parallel Computing | 2010-11-26 | Paper |
Accelerating scientific computations with mixed precision algorithms Computer Physics Communications | 2010-10-28 | Paper |
Towards dense linear algebra for hybrid GPU accelerated manycore systems Parallel Computing | 2010-09-02 | Paper |
Using Mixed Precision for Sparse Matrix Computations to Enhance the Performance while Achieving 64-bit Accuracy ACM Transactions on Mathematical Software | 2008-12-21 | Paper |
State-of-the-art eigensolvers for electronic structure calculations of large scale nano-systems Journal of Computational Physics | 2008-07-29 | Paper |
The use of bulk states to accelerate the band edge state calculation of a semiconductor quantum dot Journal of Computational Physics | 2007-05-23 | Paper |
Computational Science – ICCS 2005 Lecture Notes in Computer Science | 2005-11-30 | Paper |
Explicit and Averaging A Posteriori Error Estimates for Adaptive Finite Volume Methods SIAM Journal on Numerical Analysis | 2005-10-28 | Paper |
| scientific article; zbMATH DE number 1894292 (Why is no real title available?) | 2003-06-23 | Paper |
A posteriori error estimates for finite volume element approximations of convection-diffusion-reaction equations Computational Geosciences | 2003-04-03 | Paper |
Interior penalty discontinuous approximations of elliptic problems Computational Methods in Applied Mathematics | 2002-03-19 | Paper |
Interior penalty discontinuous approximations of elliptic problems Computational Methods in Applied Mathematics | 2002-03-19 | Paper |
A hybrid Hermitian general eigenvalue solver (available as arXiv preprint) | N/A | Paper |