swMATH903MaRDI QIDQ13647FDOQ13647
Author name not available (Why is that?)
Official website: http://www.spiral.net/
Cited In (76)
- SpMV
- Case Studies in Model Manipulation for Scientific Computing
- Automatic generation of fast algorithms for matrix–vector multiplication
- Simultaneous conversions with the residue number system using linear algebra
- Generating symmetric DFTs and equivariant FFT algorithms
- Linnea
- A Rewriting System for the Vectorization of Signal Transforms
- Automatic parallel library generation for general-size modular FFT algorithms
- Multi-stage programming with functors and monads: eliminating abstraction overhead from generic code
- High performance implementation of the TFT
- MULTI-LEARNER BASED RECURSIVE SUPERVISED TRAINING
- BLIS: a framework for rapidly instantiating BLAS functionality
- An optimizing compiler for parallel chemistry simulations
- Computing one billion roots using the tangent Graeffe method
- Automatic derivation and implementation of signal processing algorithms
- DFTI---a new interface for Fast Fourier Transform libraries
- Applying Automated Memory Analysis to Improve Iterative Algorithms
- Geometric Optimization of the Evaluation of Finite Element Matrices
- 10.1162/jmlr.2003.3.4-5.887
- Recent progress and applications in group FFTs
- Automatic derivation and implementation of fast convolution algorithms
- Symmetry-based matrix factorization
- Exploiting parallelism in matrix-computation kernels for symmetric multiprocessor systems: matrix-multiplication and matrix-addition algorithm optimizations by software pipelining and threads allocation
- TSFC: A Structure-Preserving Form Compiler
- tangent Graeffe
- In search of a program generator to implement generic transformations for high-performance computing
- DFTI
- FLAME
- Resilient distributed field estimation
- QUAFF
- rSQP++
- PHiPAC
- PAPI
- modpn
- OSKI
- Languages and Compilers for Parallel Computing
- Generating C. System description
- Analysa
- BER MetaOCaml
- DynTile
- PLuTo
- cl1ck
- Algorithm 679
- AREP
- crs
- Camlp4
- Unified embedded parallel finite element computations via software-based Fréchet differentiation
- Algorithm 784
- Strymonas
- Petabricks
- Terra
- CTF
- dijitso
- CIRFE
- MatchPy
- SAGE
- pOSKI
- SMATER
- Adaptive Winograd's matrix multiplications
- mARGOt: A Dynamic Autotuning Framework for Self-Aware Approximate Computing
- Reliable generation of high-performance matrix algebra
- MulticoreBSP
- Title not available (Why is that?)
- Daubechies wavelets for high performance electronic structure calculations: the BigDFT project
- An efficient time-step-based self-adaptive algorithm for predictor-corrector methods of Runge-Kutta type
- A program generator for Intel AES-NI instructions
- DxTer
- Knowledge-based automatic generation of partitioned matrix expressions
- Functional and dynamic programming in the design of parallel prefix networks
- Formal semantics applied to the implementation of a skeleton-based parallel programming library
- The ``Seven Dwarfs of symbolic computation
- Decomposing monomial representations of solvable groups.
- Automated FEM discretizations for the Stokes equation
- Distribution of a class of divide and conquer recurrences arising from the computation of the Walsh-Hadamard transform
- Design and implementation of adaptive SpMV library for multicore and many-core architecture
- Fast arithmetic for triangular sets: from theory to practice
This page was built for software: SPIRAL