Cited in
(only showing first 100 items - show all)- GScan
- An efficient way to assemble finite element matrices in vector languages
- nvGRAPH
- Parallel computation of alpha complexes for biomolecules
- Serial and parallel approaches for image segmentation by numerical minimization of a second-order functional
- Exploiting batch processing on streaming architectures to solve 2D elliptic finite element problems: a hybridized discontinuous Galerkin (HDG) case study
- Fast <it>k</it>-selection algorithms for graphics processing units
- GraphBLAST: A High-Performance Linear Algebra-based Graph Framework on the GPU
- AMDGPU.jl
- DiffEqGPU.jl
- GPUArrays.jl
- GPUCompiler.jl
- Metal.jl
- oneAPI.jl
- Solving ordinary differential equations on GPUs
- An efficient implementation of parallel simulated annealing algorithm in GPUs
- Parallel Chen-Han (PCH) algorithm for discrete geodesics
- Derivative-free optimization and neural networks for robust regression
- Algorithmic patterns for \(\mathcal {H}\)-matrices on many-core processors
- H2Opus
- Finite element integration on GPGPUs
- Experiments-based parameter identification on the GPU for cooperative systems
- Analysis of a splitting approach for the parallel solution of linear systems on GPU cards
- GPU-Accelerated Discontinuous Galerkin Methods on Polytopic Meshes
- A GPU-based hyperbolic SVD algorithm
- Establishing mesh topology in multi-material cells: enabling technology for robust and accurate multi-material simulations
- Nested data-parallelism on the GPU
- A language for hierarchical data parallel design-space exploration on GPUs
- Developing extensible lattice-Boltzmann simulators for general-purpose graphics-processing units
- CHEXVIS
- A Monte Carlo volumetric-ray-casting estimator for global fluence tallies on GPUs
- A fast GPU-accelerated mixed-precision strategy for fully nonlinear water wave computations
- A massively parallel GPU-accelerated model for analysis of fully nonlinear free surface waves
- GScan: a parallel Graham scan algorithm for calculating two-dimensional convex hulls on graphic processing units
- H2Opus: a distributed-memory multi-GPU software package for non-local operators
- SoAx
- Optimizing sparse matrix-matrix multiplication for the GPU
- Leveraging parallel computing in multibody dynamics
- Acceleration strategies for explicit finite element analysis of metal powder-based additive manufacturing processes using graphical processing units
- Advanced parallelization strategies using hybrid MPI-CUDA octree DSMC method for modeling flow through porous media
- Enabling a high throughput real time data pipeline for a large radio telescope array with GPUs
- ViennaCL-linear algebra library for multi- and many-core architectures
- A numerical study of the effect of particle properties on the radial distribution of suspensions in pipe flow
- Parallel collision detection of ellipsoids with applications in large scale multibody dynamics
- A GPU-based preconditioned Newton-Krylov solver for flexible multibody dynamics
- A new sparse matrix vector multiplication graphics processing unit algorithm designed for finite element problems
- Imalytics Preclinical
- A scalable parallel method for large collision detection problems
- Computing of high breakdown regression estimators without sorting on graphics processing units
- Memory-Efficient Sparse Matrix-Matrix Multiplication by Row Merging on Many-Core Architectures
- A Framework for Error-Bounded Approximate Computing, with an Application to Dot Products
- A GPU-CUDA based direct simulation Monte Carlo algorithm for real gas flows
- Productivity, performance, and portability for computational fluid dynamics applications
- A GPU accelerated aggregation algebraic multigrid method
- A conservative discontinuous Galerkin scheme for the 2D incompressible Navier-Stokes equations
- A modular lattice Boltzmann solver for GPU computing
- Multi-mass solvers for lattice QCD on GPUs
- Semi-automatic porting of a large-scale Fortran CFD code to GPUs
- Toward a GPU-aware comparison of explicit and implicit CFD simulations on structured meshes
- DeWall
- CUDA
- FParser
- OceanWave3D
- ViennaCL
- OpenCL
- PSPIKE
- CUSP
- VexCL
- SMMP
- Bolstad2
- 3D Alpha Shapes
- OpenACC
- cuRAND
- ESOM-MAP
- GAGA
- FMMTL
- MC21A
- Paralution
- GPU Quicksort
- MMMFEM
- Bullet
- ChronoEngine
- GAMPACK
- TheLMA
- ggks
- Nikola
- SpGEMM
- SkePU
- Algorithm 548
- NESL
- UPC++
- ASKIT
- OptFEM
- GPU accelerated greedy algorithms for compressed sensing
- KENO
- RUMD
- AHMED
- SaPGPU
- DAC
- FPTuner
This page was built for software: Thrust