| Publication | Date of Publication | Type |
|---|
| Algebraic Temporal Blocking for Sparse Iterative Solvers on Multi-Core CPUs | 2023-09-05 | Paper |
| Level-based Blocking for Sparse Matrices: Sparse Matrix-Power-Vector Multiplication | 2022-05-03 | Paper |
PHIST: a pipelined, hybrid-parallel iterative solver toolkit ACM Transactions on Mathematical Software | 2022-03-29 | Paper |
Benefits from using mixed precision computations in the ELPA-AEO and ESSEX-II eigensolver projects Japan Journal of Industrial and Applied Mathematics | 2019-08-15 | Paper |
Benefits from using mixed precision computations in the ELPA-AEO and ESSEX-II eigensolver projects Japan Journal of Industrial and Applied Mathematics | 2019-08-15 | Paper |
Direct numerical simulation of turbulent flow over dimples -- code optimization for NEC SX-8 plus flow results High Performance Computing in Science and Engineering `07 | 2018-06-05 | Paper |
High-performance implementation of Chebyshev filter diagonalization for interior eigenvalue computations Journal of Computational Physics | 2017-12-13 | Paper |
Increasing the performance of the Jacobi-Davidson method by blocking SIAM Journal on Scientific Computing | 2015-12-18 | Paper |
Comparison of different propagation steps for lattice Boltzmann methods Computers & Mathematics with Applications | 2015-09-03 | Paper |
Multicore-optimized wavefront diamond blocking for optimizing stencil updates SIAM Journal on Scientific Computing | 2015-07-20 | Paper |
A unified sparse matrix data format for efficient general sparse matrix-vector multiplication on modern processors with wide SIMD units SIAM Journal on Scientific Computing | 2015-01-23 | Paper |
Have the vectors the continuing ability to parry the attack of the killer micros? High Performance Computing on Vector Systems | 2015-01-09 | Paper |
Domain decomposition and locality optimization for large-scale lattice Boltzmann simulations Computers and Fluids | 2014-04-17 | Paper |
Performance analysis and optimization strategies for a D3Q19 lattice Boltzmann kernel on nVIDIA GPUs using CUDA Advances in Engineering Software | 2011-07-13 | Paper |
On the single processor performance of simple lattice Boltzmann kernels Computers and Fluids | 2009-12-07 | Paper |
| RZBENCH: performance evaluation of current HPC architectures using low-level and application benchmarks | 2009-02-09 | Paper |
Introducing a parallel cache oblivious blocking approach for the lattice Boltzmann method Progress in Computational Fluid Dynamics | 2008-06-16 | Paper |
| scientific article; zbMATH DE number 2152998 (Why is no real title available?) | 2005-04-05 | Paper |
Parallelization strategies for density matrix renormalization group algorithms on shared-memory systems. Journal of Computational Physics | 2004-03-14 | Paper |
One-Dimensional Electron-Phonon Systems: Mott- Versus Peierls-Insulators High Performance Computing in Science and Engineering, Munich 2002 | 2003-09-26 | Paper |
Pseudo-Vectorization and RISC Optimization Techniques for the Hitachi SR8000 Architecture High Performance Computing in Science and Engineering, Munich 2002 | 2003-09-26 | Paper |
| scientific article; zbMATH DE number 1953307 (Why is no real title available?) | 2003-07-27 | Paper |