CUDA
From MaRDI portal
Software:15791
swMATH3258MaRDI QIDQ15791FDOQ15791
Author name not available (Why is that?)
Cited In (only showing first 100 items - show all)
- Algorithm 1002: Graph coloring based parallel push-relabel algorithm for the maximum flow problem
- SODECL: an open-source library for calculating multiple orbits of a system of stochastic differential equations in parallel
- Fast extraction of neuron morphologies from large-scale SBFSEM image stacks
- EMPIRE-PIC: a performance portable unstructured particle-in-cell code
- Simulations of GA melting based on multiple-relaxation time lattice Boltzmann method performed with CUDA in Python
- Title not available (Why is that?)
- Large colloids in cholesteric liquid crystals
- AMGCL: an efficient, flexible, and extensible algebraic multigrid implementation
- Towards a complete FEM-based simulation toolkit on GPUs: unstructured grid finite element geometric multigrid solvers with strong smoothers based on sparse approximate inverses
- Multi-GPU implementation of the lattice Boltzmann method
- CUDA programs for solving the time-dependent dipolar Gross-Pitaevskii equation in an anisotropic trap
- GeoMFree \(^{\operatorname{3D}}\): a package of meshfree local radial point interpolation method (RPIM) for geomechanics
- MemShield: GPU-assisted software memory encryption
- Virtuaschlieren: a hybrid GPU/CPU-based schlieren simulator for ideal and non-ideal compressible-fluid flows
- PeriPy -- a high performance peridynamics package
- Homotopy continuation method for solving systems of nonlinear and polynomial equations
- Evaluation of selected resource allocation and scheduling methods in heterogeneous many-core processors and graphics processing units
- ShearLab 3D: faithful digital shearlet transforms based on compactly supported shearlets
- ViennaCL-linear algebra library for multi- and many-core architectures
- Chrono: An Open Source Multi-physics Dynamics Engine
- Dense Arithmetic over Finite Fields with the CUMODP Library
- Massively parallel approximate Gaussian process regression
- Program package MPGOS: challenges and solutions during the integration of a large number of independent ODE systems using GPUs
- Partitioned hybrid learning of Bayesian network structures
- Parallel metaheuristics: recent advances and new trends
- Memory-Efficient Sparse Matrix-Matrix Multiplication by Row Merging on Many-Core Architectures
- Multi-GPU numerical simulation of electromagnetic waves
- Tapping the supercomputer under your desk: solving dynamic equilibrium models with graphics processors
- GPU accelerated simulations of bluff body flows using vortex particle methods
- Nodal discontinuous Galerkin methods on graphics processors
- Logical and set calculations in the framework of geometrical informatics paradigm
- On software implementation of Kuznyechik on Intel CPUs
- Theoretical and numerical analysis of approaches to evaluation of statistical error of the DSMC method
- A parallel implementation of an \(O^\ast(n^4)\) volume algorithm
- Parallel optimization of 3D cardiac electrophysiological model using GPU
- MDI-GPU: accelerating integrative modelling for genomic-scale data using GP-GPU computing
- Efficient serial and parallel coordinate descent methods for huge-scale truss topology design
- \(\mathcal H\)-LU factorization on many-core systems
- CUMODP
- Lattice study of infrared behaviour in \(\mathrm{SU}(3)\) gauge theory with twelve massless flavours
- Model checking of biological systems
- Fast multipole methods on graphics processors
- Sailfish: a flexible multi-GPU implementation of the lattice Boltzmann method
- Krylov subspace methods for the Dirac equation
- DualSPHysics: Open-source parallel CFD solver based on smoothed particle hydrodynamics (SPH)
- PyFR: an open source framework for solving advection-diffusion type problems on streaming architectures using the flux reconstruction approach
- Interleaving and Lock-Step Semantics for Analysis and Verification of GPU Kernels
- Fast <it>k</it>-selection algorithms for graphics processing units
- PuReMD-GPU: A reactive molecular dynamics simulation package for GPUs
- \texttt{CUDAEASY} -- a GPU accelerated cosmological lattice program
- Static and dynamic SABR stochastic volatility models: calibration and option pricing using GPUs
- GPUSVM: a comprehensive CUDA based support vector machine package
- HONEI: A collection of libraries for numerical computations targeting multiple processor architectures
- Numerical simulations of elastic wave propagation using graphical processing units -- comparative study of high-performance computing capabilities
- Solving the examination timetabling problem in GPUs
- An efficient implementation of parallel simulated annealing algorithm in GPUs
- Conic optimization via operator splitting and homogeneous self-dual embedding
- MALBEC: a new CUDA-C ray-tracer in general relativity
- Simulations of complex and microscopic models of cardiac electrophysiology powered by multi-GPU platforms
- Multivalued geodesic ray-tracing for computing brain connections using diffusion tensor imaging
- GPU boosted CNN simulator library for graphical flow-based programmability
- TeraFLOP computing on a desktop PC with GPUs for 3D CFD
- New Hermitian and skew-Hermitian splitting methods for non-Hermitian positive-definite linear systems
- High-order finite-element seismic wave propagation modeling with MPI on a large GPU cluster
- A directed genetic algorithm for global optimization
- Quantitative photoacoustic tomography
- Title not available (Why is that?)
- Gaalop -- high performance parallel computing based on conformal geometric algebra
- Automatic differentiation through the use of hyper-dual numbers for second derivatives
- Speed records for NTRU
- Variants of Mersenne Twister suitable for graphic processors
- A multi-platform scaling study for an openmp parallelization of a discontinuous Galerkin ocean model
- GPU driven finite difference WENO scheme for real time solution of the shallow water equations
- libtropicon: a scalable library for computing intersection points of generic tropical hyper-surfaces
- AQUAgpusph, a new free 3D SPH solver accelerated with OpenCL
- GPU based detection of topological changes in Voronoi diagrams
- gpuSPHASE -- a shared memory caching implementation for 2D SPH using CUDA
- GPU accelerated intensities MPI (GAIN-MPI): a new method of computing Einstein-\(A\) coefficients
- Independent sampling for Bayesian normal conditional autoregressive models with OpenCL acceleration
- EigenCFA, accelerating flow analysis with GPUs
- Verification of concurrent systems with VerCors
- Unveiling WARIS code, a parallel and multi-purpose FDM framework
- GPU-PLWAH: GPU-based implementation of the PLWAH algorithm for compressing bitmaps
- Introducing CURRENNT: the Munich open-source CUDA recurrent neural network toolkit
- AmgX: a library for GPU accelerated algebraic multigrid and preconditioned iterative methods
- GPU-accelerated sparse matrix-matrix multiplication by iterative row merging
- CAMPARY: CUDA multiple precision arithmetic library and applications
- Cucheb: a GPU implementation of the filtered Lanczos procedure
- GPU accelerated population annealing algorithm
- NRMC -- a GPU code for $N$-reverse Monte Carlo modeling of fluids in confined media
- Simulating FRSN P systems with real numbers in P-Lingua on sequential and CUDA platforms
- Algorithm 944: Talbot Suite: parallel implementations of Talbot's method for the numerical inversion of Laplace transforms
- Developing extensible lattice-Boltzmann simulators for general-purpose graphics-processing units
- Performance evaluation of a two-dimensional lattice Boltzmann solver using CUDA and PGAS UPC based parallelisation
- Design and implementation of adaptive SpMV library for multicore and many-core architecture
- Parallel meshing, discretization, and computation of flow in massive discrete fracture networks
- The eigenvalues slicing library (EVSL): algorithms, implementation, and software
- A Monte Carlo volumetric-ray-casting estimator for global fluence tallies on GPUs
- Direct simulation of pore-scale two-phase visco-capillary flow on large digital rock images using a phase-field lattice Boltzmann method on general-purpose graphics processing units
- Global Memory Access Modelling for Efficient Implementation of the Lattice Boltzmann Method on Graphics Processing Units
This page was built for software: CUDA