ATLAS
From MaRDI portal
Software:12829
No author found.
No records found.
Related Items (only showing first 100 items - show all)
A Block Orthogonalization Procedure with Constant Synchronization Requirements ⋮ Unnamed Item ⋮ HARNESS fault tolerant MPI design, usage and performance issues ⋮ Randomized numerical linear algebra: Foundations and algorithms ⋮ A framework for high-performance matrix multiplication based on hierarchical abstractions, algorithms and optimized low-level kernels ⋮ Emmerald: a fast matrix–matrix multiply using Intel's SSE instructions ⋮ NetBuild: transparent cross‐platform access to computational software libraries ⋮ Towards performance evaluation of high-performance computing on multiple Java platforms ⋮ Unnamed Item ⋮ FLAME ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Unnamed Item ⋮ The Singular Value Decomposition: Anatomy of Optimizing an Algorithm for Extreme Scale ⋮ Euro-Par 2004 Parallel Processing ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Implementing High-Performance Complex Matrix Multiplication via the 1M Method ⋮ Anatomy of high-performance matrix multiplication ⋮ Cache efficient bidiagonalization using BLAS 2.5 operators ⋮ Unnamed Item ⋮ A Fast Direct Solver for Structured Linear Systems by Recursive Skeletonization ⋮ Unnamed Item ⋮ Computational Science - ICCS 2004 ⋮ Computational Science - ICCS 2004 ⋮ Computational Science - ICCS 2004 ⋮ Languages and Compilers for Parallel Computing ⋮ Languages and Compilers for Parallel Computing ⋮ Recursive Blocked Algorithms and Hybrid Data Structures for Dense Matrix Library Software ⋮ Code Optimizations for Complex Microprocessors Applied to CFD Software ⋮ IMF: An Incomplete Multifrontal $LU$-Factorization for Element-Structured Sparse Linear Systems ⋮ Unnamed Item ⋮ FFPACK ⋮ Improving the arithmetic intensity of multigrid with the help of polynomial smoothers ⋮ Nonnegative Diagonals and High Performance on Low-Profile Matrices from Householder QR ⋮ Combinatorial Optimization of Matrix-Vector Multiplication in Finite Element Assembly ⋮ Cache-Oblivious Sparse Matrix–Vector Multiplication by Using Sparse Matrix Partitioning Methods ⋮ SIMPLE AND EFFECTIVE C++ MATRIX–VECTOR LIBRARY FOR NONPROFESSIONALS IN COMPUTER SCIENCE ⋮ Adaptive use of iterative methods in predictor-corrector interior point methods for linear programming ⋮ Unnamed Item ⋮ Unnamed Item ⋮ The Lifted Newton Method and Its Application in Optimization ⋮ Towards a fast parallel sparse symmetric matrix-vector multiplication ⋮ Accelerating a particle-in-cell simulation using a hybrid counting sort ⋮ REVISITING MATRIX PRODUCT ON MASTER-WORKER PLATFORMS ⋮ Unsymmetric Ordering Using A Constrained Markowitz Scheme ⋮ Optimizing Local Performance in HPF ⋮ Geometric Optimization of the Evaluation of Finite Element Matrices ⋮ Unnamed Item ⋮ A recursive formulation of Cholesky factorization of a matrix in packed storage ⋮ Formal derivation of algorithms ⋮ The design and implementation of a new out-of-core sparse cholesky factorization method ⋮ A column pre-ordering strategy for the unsymmetric-pattern multifrontal method ⋮ The science of deriving dense linear algebra algorithms ⋮ A fully portable high performance minimal storage hybrid format Cholesky algorithm ⋮ Design, implementation and testing of extended and mixed precision BLAS ⋮ A Parallel Divide and Conquer Algorithm for the Symmetric Eigenvalue Problem on Distributed Memory Architectures ⋮ Improving memory performance of sorting algorithms ⋮ Parallel Processing and Applied Mathematics ⋮ Large-Scale Scientific Computing ⋮ Large-Scale Scientific Computing ⋮ Optimizing the Evaluation of Finite Element Matrices ⋮ Logic program specialisation through partial deduction: Control issues ⋮ Unnamed Item ⋮ Algorithm 1005 ⋮ Deterministic unimodularity certification ⋮ EdgePack: A Parallel Vertex and Node Reordering Package for Optimizing Edge-Based Computations in Unstructured Grids ⋮ Parallel Processing of Matrix Multiplication in a CPU and GPU Heterogeneous Environment ⋮ computers vs. the Human Race ⋮ Unnamed Item ⋮ High-Performance Evaluation of Finite Element Variational Forms via Commuting Diagrams and Duality ⋮ Analytical Modeling Is Enough for High-Performance BLIS ⋮ Distribution of a class of divide and conquer recurrences arising from the computation of the Walsh-Hadamard transform ⋮ Towards an efficient use of the BLAS library for multilinear tensor contractions ⋮ Unnamed Item ⋮ Optimizing locality and scalability of embedded Runge-Kutta solvers using block-based pipelining ⋮ Communication lower bounds for distributed-memory matrix multiplication ⋮ Iterative algorithms to approximate canonical Gabor windows: Computational aspects ⋮ ATLAS: a real-space finite-difference implementation of orbital-free density functional theory ⋮ Computing Globally Optimal Solutions for Single-Row Layout Problems Using Semidefinite Programming and Cutting Planes ⋮ A Cache-Oblivious Sparse Matrix–Vector Multiplication Scheme Based on the Hilbert Curve ⋮ OpenMX 2.0: extended structural equation and statistical modeling ⋮ Unnamed Item ⋮ The shifted number system for fast linear algebra on integer matrices ⋮ Towards an accurate performance modeling of parallel sparse factorization ⋮ When cache blocking of sparse matrix vector multiply works and why ⋮ Analysis of a sparse hypermatrix Cholesky with fixed-sized blocking ⋮ On the accuracy of finite-difference solutions for nonlinear water waves ⋮ Scaling LAPACK panel operations using parallel cache assignment
This page was built for software: ATLAS