MKL
From MaRDI portal
Software:19038
swMATH6975MaRDI QIDQ19038FDOQ19038
Author name not available (Why is that?)
Cited In (only showing first 100 items - show all)
- Multi-core CPUs, clusters, and grid computing: A tutorial
- Forward stable eigenvalue decomposition of rank-one modifications of diagonal matrices
- Generation of large finite-element matrices on multiple graphics processors
- Tests with FALKSOL. A massively parallel multi-level domain decomposing direct solver
- Numerical Solution of 3D Exterior Unsteady Wave Propagation Problems Using Boundary Operators
- HexGen and Hex2Spline: polycube-based hexahedral mesh generation and spline modeling for isogeometric analysis applications in LS-DYNA
- Efficient alternating least squares algorithms for low multilinear rank approximation of tensors
- JuSFEM: a Julia-based open-source package of parallel smoothed finite element method (S-FEM) for elastic problems
- PTEBEM for wave drift forces based on hydrodynamic pressure integration
- Scientific computations on multi-core systems using different programming frameworks
- Towards an efficient use of the BLAS library for multilinear tensor contractions
- Numerical benchmarking of fluid-rigid body interactions
- Load balance and parallel I/O: optimising COSA for large simulations
- FFT, FMM, or multigrid? A comparative study of state-of-the-art Poisson solvers for uniform and nonuniform grids in the unit cube
- BLIS: a framework for rapidly instantiating BLAS functionality
- Aerodynamic force evaluation for ice shedding phenomenon using vortex in cell scheme, penalisation and level set approaches
- A high order discontinuous Galerkin-Fourier incompressible 3D Navier-Stokes solver with rotating sliding meshes
- An approach for large-scale gyroscopic eigenvalue problems with application to high-frequency response of rolling tires
- Using Nesterov's method to accelerate multibody dynamics with friction and contact
- An efficient hybrid tridiagonal divide-and-conquer algorithm on distributed memory architectures
- Vector and multithread computation of silencer performance prediction on a dual-processor PC workstation
- Robust viscous-inviscid interaction scheme for application on unstructured meshes
- Improved convex and concave relaxations of composite bilinear forms
- Numerical methods and parallel algorithms for computation of periodic responses of plates
- A decomposition method with minimum communication amount for parallelization of multi-dimensional FFTs
- Performance models and workload distribution algorithms for optimizing a hybrid CPU-GPU multifrontal solver
- The Singular Value Decomposition: Anatomy of Optimizing an Algorithm for Extreme Scale
- Accelerated dimension-independent adaptive metropolis
- Two-stage least squares and indirect least squares algorithms for simultaneous equations models
- Design and Implementation of Adaptive SpMV Library for Multicore and Many-Core Architecture
- Regularized symmetric positive definite matrix factorizations for linear systems arising from RBF interpolation and differentiation
- Mathematical modelling of flagellated microswimmers
- PARFES: A method for solving finite element linear equations on multi-core computers
- Low-rank method for fast solution of generalized Smoluchowski equations
- Efficient algorithm for simultaneous reduction to the \(m\)-Hessenberg-triangular-triangular form
- Efficient algebraic multigrid for migration-diffusion-convection-reaction systems arising in electrochemical simulations
- PySPH: A Python-based Framework for Smoothed Particle Hydrodynamics
- Accurate eigenvalue decomposition of real symmetric arrowhead matrices and applications
- A parallel algorithm for solving a partial eigenvalue problem for block-diagonal bordered matrices
- Discrete Helmholtz-Hodge decomposition on polyhedral meshes using compatible discrete operators
- Calculating Floquet states of large quantum systems: a parallelization strategy and its cluster implementation
- An efficient blocking M2L translation for low-frequency fast multipole method in three dimensions
- Title not available (Why is that?)
- Algorithm 977
- Algorithm 978
- SparseX
- OSQP: An Operator Splitting Solver for Quadratic Programs
- BiqBin: Moving Boundaries for NP-hard Problems by HPC
- AMPS: Real‐time mesh cutting with augmented matrices for surgical simulations
- A Hybrid High-Order Method for Flow Simulations in Discrete Fracture Networks
- Implementing Multifrontal Sparse Solvers for Multicore Architectures with Sequential Task Flow Runtime Systems
- A real-valued block conjugate gradient type method for solving complex symmetric linear systems with multiple right-hand sides.
- SPARC: accurate and efficient finite-difference formulation and parallel implementation of density functional theory: extended systems
- An improved strain gradient plasticity formulation with energetic interfaces: theory and a fully implicit finite element formulation
- A toolkit for efficient numerical applications in Java
- Multi-preconditioned Domain Decomposition Methods in the Krylov Subspaces
- A general solution strategy of modified power method for higher mode solutions
- Application of the sequential matrix diagonalization algorithm to high-dimensional functional MRI data
- Multiscale modelling of damage and failure in two-dimensional metallic foams
- An improved stiff-ODE solving framework for reacting flow simulations with detailed chemistry in OpenFOAM
- Highly accurate numerical solution of Hartree-Fock equation with pseudospectral method for closed-shell atoms
- A fast and scalable bottom-left-fill algorithm to solve nesting problems using a semi-discrete representation
- An Accelerated Divide-and-Conquer Algorithm for the Bidiagonal SVD Problem
- Parallelization of the inverse fast multipole method with an application to boundary element method
- Full waveform inversion through double-sweeping solver
- High-performance bidiagonal reduction using tile algorithms on homogeneous multicore architectures
- A resistive magnetohydrodynamics solver using modern C++ and the Boost library
- Nonlinear dynamics of slender structures: a new object-oriented framework
- Algorithms for Efficient Reproducible Floating Point Summation
- Machine learning algorithms for three-dimensional mean-curvature computation in the level-set method
- A \(\mu\)-mode BLAS approach for multidimensional tensor-structured problems
- Surface smoothing procedures in computational contact mechanics
- A Multithreaded Recursive and Nonrecursive Parallel Sparse Direct Solver
- The BLAS API of BLASFEO
- Employing AVX vectorization to improve the performance of random number generators
- Iterative representing set selection for nested cross approximation
- Performance of the Low-Rank TT-SVD for Large Dense Tensors on Modern MultiCore CPUs
- Implementing High-performance Complex Matrix Multiplication via the 3m and 4m Methods
- Combined co-rotational beam/shell elements for fluid-structure interaction analysis of insect-like flapping wing
- \textit{LPSE}: a 3-D wave-based model of cross-beam energy transfer in laser-irradiated plasmas
- AMPS: An Augmented Matrix Formulation for Principal Submatrix Updates with Application to Power Grids
- Solving Dense Interval Linear Systems with Verified Computing on Multicore Architectures
- Iterative solver for systems of linear equations with a sparse stiffness matrix on clusters
- A computational investigation of a model of single-crystal gradient thermoplasticity that accounts for the stored energy of cold work and thermal annealing
- Implementing High-Performance Complex Matrix Multiplication via the 1M Method
- Adaptive FETI-DP and BDDC methods with a generalized transformation of basis for heterogeneous problems
- A High Performance QDWH-SVD Solver Using Hardware Accelerators
- Subdivision-Based Nonlinear Multiscale Cloth Simulation
- vibro -Lanczos, a symmetric Lanczos solver for vibro-acoustic simulations
- Towards physics-oriented smoothing in algebraic multigrid for systems of partial differential equations arising in multi-ion transport and reaction models
- Efficient algorithm for proper orthogonal decomposition of block-structured adaptively refined numerical simulations
- Scalable Linear Solvers Based on Enlarged Krylov Subspaces with Dynamic Reduction of Search Directions
- Analytical Modeling Is Enough for High-Performance BLIS
- An immersed \(CR\)-\(P_0\) element for Stokes interface problems and the optimal convergence analysis
- Algorithm 1026: Concurrent Alternating Least Squares for Multiple Simultaneous Canonical Polyadic Decompositions
- Mathematical substantiation of pulsed electromagnetic soundings for new problems of petroleum geophysics
- A parallel computing method using blocked format with optimal partitioning for SpMV on GPU
- Quantum circuits synthesis using Householder transformations
- On BLAS Level-3 Implementations of Common Solvers for (Quasi-) Triangular Generalized Lyapunov Equations
- Stress-aware large-scale mesh editing using a domain-decomposed multigrid solver
This page was built for software: MKL