StarPU
From MaRDI portal
Software:26122
swMATH14216MaRDI QIDQ26122FDOQ26122
Author name not available (Why is that?)
Cited In (29)
- A robust and scalable multi-level domain decomposition preconditioner for multi-core architecture with large number of cores
- Optimization of a discontinuous Galerkin solver with OpenCL and StarPU
- Leveraging access mode declarations in a model for memory consistency in heterogeneous systems
- Parallel \textit{QR} factorization of block-tridiagonal matrices
- High-order implicit palindromic discontinuous Galerkin method for kinetic-relaxation approximation
- Evaluation of selected resource allocation and scheduling methods in heterogeneous many-core processors and graphics processing units
- Experimenting task-based runtimes on a legacy computational fluid dynamics code with unstructured meshes
- Two-level parallelization of a fluid mechanics algorithm exploiting hardware heterogeneity
- A family of scheduling algorithms for hybrid parallel platforms
- Task-based parallelization of an implicit kinetic scheme
- A parallel fast multipole method for a space-time boundary element method for the heat equation
- Sparse direct solution on parallel computers
- Towards massively parallel computations in algebraic geometry
- A new sparse \(LDL^T\) solver using a posteriori threshold pivoting
- Highly scalable multiplication for distributed sparse multivariate polynomials on many-core systems
- Efficient and scalable algorithms for smoothed particle hydrodynamics on hybrid shared/distributed-memory architectures
- Performance comparison of HPX versus traditional parallelization strategies for the discontinuous Galerkin method
- Performance models and workload distribution algorithms for optimizing a hybrid CPU-GPU multifrontal solver
- Satisfiability modulo theory (SMT) formulation for optimal scheduling of task graphs with communication delay
- Semiautomatic task graph construction for \(\mathcal{H}\)-matrix arithmetic
- Supporting adaptive and irregular parallelism for non-linear numerical optimization
- A task-driven implementation of a simple numerical solver for hyperbolic conservation laws
- Implementing Multifrontal Sparse Solvers for Multicore Architectures with Sequential Task Flow Runtime Systems
- Dynamic autotuning of adaptive fast multipole methods on hybrid multicore CPU and GPU systems
- A scalable RBF-FD method for atmospheric flow
- An efficient multicore implementation of a novel HSS-structured multifrontal solver using randomized sampling
- Superglue: a shared memory framework using data versioning for dependency-aware task-based parallelization
- Experiments with sparse Cholesky using a sequential task-flow implementation
- A framework for cost based optimization of hybrid CPU/GPU query plans in database systems
This page was built for software: StarPU