CUDA
From MaRDI portal
Software:15791
swMATH3258MaRDI QIDQ15791FDOQ15791
Author name not available (Why is that?)
Cited In (only showing first 100 items - show all)
- Simulations of GA melting based on multiple-relaxation time lattice Boltzmann method performed with CUDA in Python
- Title not available (Why is that?)
- Efficient Serial and Parallel Coordinate Descent Methods for Huge-Scale Truss Topology Design
- Large colloids in cholesteric liquid crystals
- AMGCL: an efficient, flexible, and extensible algebraic multigrid implementation
- Towards a complete FEM-based simulation toolkit on GPUs: unstructured grid finite element geometric multigrid solvers with strong smoothers based on sparse approximate inverses
- Multi-GPU implementation of the lattice Boltzmann method
- CUDA programs for solving the time-dependent dipolar Gross-Pitaevskii equation in an anisotropic trap
- GeoMFree \(^{\operatorname{3D}}\): a package of meshfree local radial point interpolation method (RPIM) for geomechanics
- MemShield: GPU-assisted software memory encryption
- Virtuaschlieren: a hybrid GPU/CPU-based schlieren simulator for ideal and non-ideal compressible-fluid flows
- PeriPy -- a high performance peridynamics package
- Homotopy continuation method for solving systems of nonlinear and polynomial equations
- Evaluation of selected resource allocation and scheduling methods in heterogeneous many-core processors and graphics processing units
- ShearLab 3D: faithful digital shearlet transforms based on compactly supported shearlets
- ViennaCL-linear algebra library for multi- and many-core architectures
- Model Checking of Biological Systems
- Chrono: An Open Source Multi-physics Dynamics Engine
- CAMPARY: Cuda Multiple Precision Arithmetic Library and Applications
- Automatic Differentiation Through the Use of Hyper-Dual Numbers for Second Derivatives
- Dense Arithmetic over Finite Fields with the CUMODP Library
- Gaalop—High Performance Parallel Computing Based on Conformal Geometric Algebra
- Simulating FRSN P Systems with Real Numbers in P-Lingua on sequential and CUDA platforms
- Program package MPGOS: challenges and solutions during the integration of a large number of independent ODE systems using GPUs
- Variants of Mersenne Twister Suitable for Graphic Processors
- Partitioned hybrid learning of Bayesian network structures
- Parallel metaheuristics: recent advances and new trends
- Memory-Efficient Sparse Matrix-Matrix Multiplication by Row Merging on Many-Core Architectures
- AmgX: A Library for GPU Accelerated Algebraic Multigrid and Preconditioned Iterative Methods
- GPU-Accelerated Sparse Matrix-Matrix Multiplication by Iterative Row Merging
- Multi-GPU numerical simulation of electromagnetic waves
- Tapping the supercomputer under your desk: solving dynamic equilibrium models with graphics processors
- GPU accelerated simulations of bluff body flows using vortex particle methods
- Nodal discontinuous Galerkin methods on graphics processors
- Logical and set calculations in the framework of geometrical informatics paradigm
- On software implementation of Kuznyechik on Intel CPUs
- Theoretical and numerical analysis of approaches to evaluation of statistical error of the DSMC method
- A parallel implementation of an \(O^\ast(n^4)\) volume algorithm
- Parallel optimization of 3D cardiac electrophysiological model using GPU
- MDI-GPU: accelerating integrative modelling for genomic-scale data using GP-GPU computing
- \(\mathcal H\)-LU factorization on many-core systems
- CUMODP
- Lattice study of infrared behaviour in \(\mathrm{SU}(3)\) gauge theory with twelve massless flavours
- Massively Parallel Approximate Gaussian Process Regression
- Fast multipole methods on graphics processors
- Performance Evaluation of a Two-Dimensional Lattice Boltzmann Solver Using CUDA and PGAS UPC Based Parallelisation
- Developing Extensible Lattice-Boltzmann Simulators for General-Purpose Graphics-Processing Units
- Design and Implementation of Adaptive SpMV Library for Multicore and Many-Core Architecture
- Title not available (Why is that?)
- Title not available (Why is that?)
- EigenCFA
- Verification of Concurrent Systems with VerCors
- Speed Records for NTRU
- Unveiling WARIS Code, a Parallel and Multi-purpose FDM Framework
- Sailfish: a flexible multi-GPU implementation of the lattice Boltzmann method
- Krylov subspace methods for the Dirac equation
- DualSPHysics: Open-source parallel CFD solver based on smoothed particle hydrodynamics (SPH)
- PyFR: an open source framework for solving advection-diffusion type problems on streaming architectures using the flux reconstruction approach
- Interleaving and Lock-Step Semantics for Analysis and Verification of GPU Kernels
- Parallel Meshing, Discretization, and Computation of Flow in Massive Discrete Fracture Networks
- The Eigenvalues Slicing Library (EVSL): Algorithms, Implementation, and Software
- Fast <it>k</it>-selection algorithms for graphics processing units
- Algorithm 944
- PuReMD-GPU: A reactive molecular dynamics simulation package for GPUs
- \texttt{CUDAEASY} -- a GPU accelerated cosmological lattice program
- Title not available (Why is that?)
- Static and dynamic SABR stochastic volatility models: calibration and option pricing using GPUs
- Algorithm 1002
- SODECL
- EMPIRE-PIC: A Performance Portable Unstructured Particle-in-Cell Code
- GPUSVM: a comprehensive CUDA based support vector machine package
- HONEI: A collection of libraries for numerical computations targeting multiple processor architectures
- Numerical simulations of elastic wave propagation using graphical processing units -- comparative study of high-performance computing capabilities
- Solving the examination timetabling problem in GPUs
- An efficient implementation of parallel simulated annealing algorithm in GPUs
- Conic optimization via operator splitting and homogeneous self-dual embedding
- Quantitative Photoacoustic Tomography
- MALBEC: a new CUDA-C ray-tracer in general relativity
- Simulations of complex and microscopic models of cardiac electrophysiology powered by multi-GPU platforms
- Multivalued geodesic ray-tracing for computing brain connections using diffusion tensor imaging
- GPU boosted CNN simulator library for graphical flow-based programmability
- TeraFLOP computing on a desktop PC with GPUs for 3D CFD
- New Hermitian and skew-Hermitian splitting methods for non-Hermitian positive-definite linear systems
- High-order finite-element seismic wave propagation modeling with MPI on a large GPU cluster
- A directed genetic algorithm for global optimization
- Title not available (Why is that?)
- A multi-platform scaling study for an openmp parallelization of a discontinuous Galerkin ocean model
- GPU driven finite difference WENO scheme for real time solution of the shallow water equations
- libtropicon: a scalable library for computing intersection points of generic tropical hyper-surfaces
- AQUAgpusph, a new free 3D SPH solver accelerated with OpenCL
- GPU based detection of topological changes in Voronoi diagrams
- gpuSPHASE -- a shared memory caching implementation for 2D SPH using CUDA
- GPU accelerated intensities MPI (GAIN-MPI): a new method of computing Einstein-\(A\) coefficients
- Independent sampling for Bayesian normal conditional autoregressive models with OpenCL acceleration
- Cucheb: a GPU implementation of the filtered Lanczos procedure
- GPU accelerated population annealing algorithm
- NRMC -- a GPU code for $N$-reverse Monte Carlo modeling of fluids in confined media
- A Monte Carlo volumetric-ray-casting estimator for global fluence tallies on GPUs
- Direct simulation of pore-scale two-phase visco-capillary flow on large digital rock images using a phase-field lattice Boltzmann method on general-purpose graphics processing units
- Global Memory Access Modelling for Efficient Implementation of the Lattice Boltzmann Method on Graphics Processing Units
This page was built for software: CUDA