PAPI
From MaRDI portal
Software:18089
swMATH5951MaRDI QIDQ18089FDOQ18089
Author name not available (Why is that?)
Cited In (30)
- Instrumentation database system for performance analysis of parallel scientific applications
- Scalarization using loop alignment and loop skewing
- Title not available (Why is that?)
- Capturing and analyzing the execution control flow of OpenMP applications
- Engineering a combinatorial Laplacian solver: lessons learned
- Fine-grained multithreading for the multifrontal \(QR\) factorization of sparse matrices
- Instruction-throughput regulation in computer processors with data-center applications
- Performance Comparison and Workload Analysis of Mesh Untangling and Smoothing Algorithms
- Implementation and evaluation of global and partitioned scheduling in a real-time OS
- Towards an accurate performance modeling of parallel sparse factorization
- Performance comparison of HPX versus traditional parallelization strategies for the discontinuous Galerkin method
- Improving Resource-Unaware SAT Solvers
- PySPH: A Python-based Framework for Smoothed Particle Hydrodynamics
- DETECTING SECONDARY BOTTLENECKS IN PARALLEL QUANTUM CHEMISTRY APPLICATIONS USING MPI
- A distributed and incremental SVD algorithm for agglomerative data analysis on large networks
- Parallel simulations of dynamic fracture using extrinsic cohesive elements
- Code modernization strategies to 3-D stencil-based applications on intel Xeon Phi: KNC and KNL
- mARGOt: A Dynamic Autotuning Framework for Self-Aware Approximate Computing
- Performance modeling of serial and parallel implementations of the fractional Adams-Bashforth-Moulton method
- When cache blocking of sparse matrix vector multiply works and why
- ParVec: vectorizing the PARSEC benchmark suite
- Data page layouts for relational databases on deep memory hierarchies
- An efficient time-step-based self-adaptive algorithm for predictor-corrector methods of Runge-Kutta type
- An Optimized Sparse Approximate Matrix Multiply for Matrices with Decay
- High order finite volume methods on wavelet-adapted grids with local time-stepping on multicore architectures for the simulation of shock-bubble interactions
- Optimized code generation for finite element local assembly using symbolic manipulation
- New fast divide-and-conquer algorithms for the symmetric tridiagonal eigenvalue problem
- Algorithm 942
- Enhancing speed and scalability of the ParFlow simulation code
- SCALEA: a performance analysis tool for parallel programs
This page was built for software: PAPI