MKL
From MaRDI portal
Software:19038
swMATH6975MaRDI QIDQ19038FDOQ19038
Author name not available (Why is that?)
Cited In (only showing first 100 items - show all)
- Multi-core CPUs, clusters, and grid computing: A tutorial
- Forward stable eigenvalue decomposition of rank-one modifications of diagonal matrices
- BiqBin: moving boundaries for NP-hard problems by HPC
- AMPS: real-time mesh cutting with augmented matrices for surgical simulations.
- A hybrid high-order method for flow simulations in discrete fracture networks
- Generation of large finite-element matrices on multiple graphics processors
- Tests with FALKSOL. A massively parallel multi-level domain decomposing direct solver
- Multi-preconditioned domain decomposition methods in the Krylov subspaces
- HexGen and Hex2Spline: polycube-based hexahedral mesh generation and spline modeling for isogeometric analysis applications in LS-DYNA
- Efficient alternating least squares algorithms for low multilinear rank approximation of tensors
- JuSFEM: a Julia-based open-source package of parallel smoothed finite element method (S-FEM) for elastic problems
- PTEBEM for wave drift forces based on hydrodynamic pressure integration
- Scientific computations on multi-core systems using different programming frameworks
- Towards an efficient use of the BLAS library for multilinear tensor contractions
- Numerical benchmarking of fluid-rigid body interactions
- Load balance and parallel I/O: optimising COSA for large simulations
- FFT, FMM, or multigrid? A comparative study of state-of-the-art Poisson solvers for uniform and nonuniform grids in the unit cube
- BLIS: a framework for rapidly instantiating BLAS functionality
- Aerodynamic force evaluation for ice shedding phenomenon using vortex in cell scheme, penalisation and level set approaches
- A high order discontinuous Galerkin-Fourier incompressible 3D Navier-Stokes solver with rotating sliding meshes
- An approach for large-scale gyroscopic eigenvalue problems with application to high-frequency response of rolling tires
- Using Nesterov's method to accelerate multibody dynamics with friction and contact
- An efficient hybrid tridiagonal divide-and-conquer algorithm on distributed memory architectures
- Algorithms for efficient reproducible floating point summation
- Vector and multithread computation of silencer performance prediction on a dual-processor PC workstation
- Robust viscous-inviscid interaction scheme for application on unstructured meshes
- Improved convex and concave relaxations of composite bilinear forms
- Numerical solution of 3D exterior unsteady wave propagation problems using boundary operators
- Numerical methods and parallel algorithms for computation of periodic responses of plates
- A decomposition method with minimum communication amount for parallelization of multi-dimensional FFTs
- Performance models and workload distribution algorithms for optimizing a hybrid CPU-GPU multifrontal solver
- Accelerated dimension-independent adaptive metropolis
- Two-stage least squares and indirect least squares algorithms for simultaneous equations models
- Regularized symmetric positive definite matrix factorizations for linear systems arising from RBF interpolation and differentiation
- Mathematical modelling of flagellated microswimmers
- PARFES: A method for solving finite element linear equations on multi-core computers
- Low-rank method for fast solution of generalized Smoluchowski equations
- Efficient algorithm for simultaneous reduction to the \(m\)-Hessenberg-triangular-triangular form
- Efficient algebraic multigrid for migration-diffusion-convection-reaction systems arising in electrochemical simulations
- PySPH: A Python-based Framework for Smoothed Particle Hydrodynamics
- An accelerated divide-and-conquer algorithm for the bidiagonal SVD problem
- Accurate eigenvalue decomposition of real symmetric arrowhead matrices and applications
- A parallel algorithm for solving a partial eigenvalue problem for block-diagonal bordered matrices
- Discrete Helmholtz-Hodge decomposition on polyhedral meshes using compatible discrete operators
- Calculating Floquet states of large quantum systems: a parallelization strategy and its cluster implementation
- An efficient blocking M2L translation for low-frequency fast multipole method in three dimensions
- Title not available (Why is that?)
- OSQP: an operator splitting solver for quadratic programs
- Implementing Multifrontal Sparse Solvers for Multicore Architectures with Sequential Task Flow Runtime Systems
- A real-valued block conjugate gradient type method for solving complex symmetric linear systems with multiple right-hand sides.
- SPARC: accurate and efficient finite-difference formulation and parallel implementation of density functional theory: extended systems
- An improved strain gradient plasticity formulation with energetic interfaces: theory and a fully implicit finite element formulation
- A toolkit for efficient numerical applications in Java
- A general solution strategy of modified power method for higher mode solutions
- Application of the sequential matrix diagonalization algorithm to high-dimensional functional MRI data
- Multiscale modelling of damage and failure in two-dimensional metallic foams
- An improved stiff-ODE solving framework for reacting flow simulations with detailed chemistry in OpenFOAM
- Highly accurate numerical solution of Hartree-Fock equation with pseudospectral method for closed-shell atoms
- A fast and scalable bottom-left-fill algorithm to solve nesting problems using a semi-discrete representation
- Parallelization of the inverse fast multipole method with an application to boundary element method
- Full waveform inversion through double-sweeping solver
- High-performance bidiagonal reduction using tile algorithms on homogeneous multicore architectures
- A resistive magnetohydrodynamics solver using modern C++ and the Boost library
- Nonlinear dynamics of slender structures: a new object-oriented framework
- The singular value decomposition: anatomy of optimizing an algorithm for extreme scale
- Algorithm 977: A QR-preconditioned QR SVD method for computing the SVD with high accuracy
- Algorithm 978: Safe scaling in the level 1 BLAS
- SparseX: a library for high-performance sparse matrix-vector multiplication on multicore platforms
- Design and implementation of adaptive SpMV library for multicore and many-core architecture
- Machine learning algorithms for three-dimensional mean-curvature computation in the level-set method
- A \(\mu\)-mode BLAS approach for multidimensional tensor-structured problems
- Surface smoothing procedures in computational contact mechanics
- A multishift, multipole rational QZ method with aggressive early deflation
- Acceleration of three-dimensional tokamak magnetohydrodynamical code with graphics processing unit and OpenACC heterogeneous parallel programming
- Employing AVX vectorization to improve the performance of random number generators
- Implementing High-performance Complex Matrix Multiplication via the 3m and 4m Methods
- Combined co-rotational beam/shell elements for fluid-structure interaction analysis of insect-like flapping wing
- \textit{LPSE}: a 3-D wave-based model of cross-beam energy transfer in laser-irradiated plasmas
- Toward a high performance tile divide and conquer algorithm for the dense symmetric eigenvalue problem
- A computational investigation of a model of single-crystal gradient thermoplasticity that accounts for the stored energy of cold work and thermal annealing
- Solving random ordinary differential equations on GPU clusters using multiple levels of parallelism
- Adaptive FETI-DP and BDDC methods with a generalized transformation of basis for heterogeneous problems
- vibro -Lanczos, a symmetric Lanczos solver for vibro-acoustic simulations
- Partitioning and reordering for spike-based distributed-memory parallel Gauss-Seidel
- Iterative representing set selection for nested cross approximation.
- Efficient algorithm for proper orthogonal decomposition of block-structured adaptively refined numerical simulations
- On the usage of tetrahedral background cells in nodal integration of RPIM for 3D elasto-static problems
- An immersed \(CR\)-\(P_0\) element for Stokes interface problems and the optimal convergence analysis
- Algorithm 1026: Concurrent Alternating Least Squares for Multiple Simultaneous Canonical Polyadic Decompositions
- Design of a high-performance GEMM-like tensor-tensor multiplication
- Mathematical substantiation of pulsed electromagnetic soundings for new problems of petroleum geophysics
- An ellipsoidal bounding scheme for the quasi-clique number of a graph
- The BLAS API of BLASFEO: optimizing performance for small matrices
- A parallel computing method using blocked format with optimal partitioning for SpMV on GPU
- Quantum circuits synthesis using Householder transformations
- On BLAS Level-3 Implementations of Common Solvers for (Quasi-) Triangular Generalized Lyapunov Equations
- Stress-aware large-scale mesh editing using a domain-decomposed multigrid solver
- Improved hyper-reduction approach for the forced vibration analysis of rotating components
- A dissection solver with kernel detection for symmetric finite element matrices on shared memory computers
- Iterative solver for systems of linear equations with a sparse stiffness matrix for clusters
This page was built for software: MKL