The following pages link to NAS Parallel Benchmarks (Q20852):
Displaying 50 items.
- Performance modeling of hybrid MPI/OpenMP scientific applications on large-scale multicore supercomputers (Q394738) (← links)
- Algorithm-system scalability of heterogeneous computing (Q436897) (← links)
- A two-stage hardware scheduler combining greedy and optimal scheduling (Q436901) (← links)
- Performance evaluation of mixed-mode OpenMP/MPI implementations (Q601003) (← links)
- Model-based fault localization: finding behavioral outliers in large-scale computing systems (Q601135) (← links)
- A detailed analysis of communication load balance on BlueGene supercomputer (Q603224) (← links)
- Failure-aware resource management for high-availability computing clusters with distributed virtual machines (Q666083) (← links)
- Adaptive execution techniques of parallel programs for multiprocessors (Q666102) (← links)
- A session key caching and prefetching scheme for secure communication in cluster systems (Q666163) (← links)
- Experience in using SIMD and MIMD parallelism for computational fluid dynamics (Q685974) (← links)
- Capturing and analyzing the execution control flow of OpenMP applications (Q839494) (← links)
- MPI correctness checking for OpenMP/MPI applications (Q839508) (← links)
- Interconnection network simulation using traces of MPI applications (Q842846) (← links)
- A speculative and adaptive MPI rendezvous protocol over RDMA-enabled interconnects (Q842850) (← links)
- LogGPO: an accurate communication model for performance prediction of MPI programs (Q848368) (← links)
- Improved upper bounds for online malleable job scheduling (Q892840) (← links)
- Supporting openmp on cell (Q934940) (← links)
- Performance evaluation of a multi-zone application in different openmp approaches (Q934945) (← links)
- Performance advantage of reconfigurable cache design on multicore processor systems (Q934956) (← links)
- Parallel simulation of electron-solid interactions for electron microscopy modeling (Q973416) (← links)
- Online scheduling of malleable parallel jobs with setup times on two identical machines (Q976487) (← links)
- Design and performance of a scheduling framework for resizable parallel applications (Q991070) (← links)
- Parallelization and optimization of Mfold on shared memory system (Q991149) (← links)
- Design and implementation of an agent home scheme strategy for prefetch-based DSM systems (Q1040747) (← links)
- Using cost to control instrumentation overhead (Q1128728) (← links)
- Parallel benchmarks of turbulence in complex geometries (Q1370689) (← links)
- A parallel finite element method for the analysis of crystalline solids (Q1372785) (← links)
- A parallelized ENO procedure for direct numerical simulation of compressible turbulence (Q1375899) (← links)
- Algorithms for the parallel alternating direction access machine (Q1575740) (← links)
- An object-oriented parallel programming language for distributed-memory parallel computing platforms (Q1651016) (← links)
- Code modernization strategies to 3-D stencil-based applications on intel Xeon Phi: KNC and KNL (Q1667693) (← links)
- Topology-aware strategy for MPI-IO operations in clusters (Q1722874) (← links)
- Unstructured adaptive meshes: Bad for your memory? (Q1765343) (← links)
- Reducing division latency with reciprocal caches (Q1916989) (← links)
- Online malleable job scheduling for \(m\leq 3\) (Q1944031) (← links)
- Parallelization of a multiblock flow code: An engineering implementation (Q1973759) (← links)
- Self-similarity of parallel machines (Q2431431) (← links)
- SAC -- a functional array language for efficient multi-threaded execution (Q2431843) (← links)
- A proposal for error handling in OpenMP (Q2457961) (← links)
- Direct and inverse problems of high-viscosity fluid dynamics (Q2458075) (← links)
- High-scalability parallelization of a molecular modeling application: Performance and productivity comparison between OpenMP and MPI implementations (Q2461678) (← links)
- Performance characteristics of the multi-zone NAS parallel benchmarks (Q2497743) (← links)
- Parallel 3D mortar element method for adaptive nonconforming meshes (Q2502060) (← links)
- Data optimizations for constraint automata (Q2974782) (← links)
- (Q3124739) (← links)
- Bsp2omp: A Compiler For Translating Bsp Programs To Openmp (Q3399235) (← links)
- Efficient communication using message prediction for clusters of multiprocessors (Q4790851) (← links)
- HPF/JA: extensions of High Performance Fortran for accelerating real‐world applications (Q4790852) (← links)
- VPP Fortran and the design of HPF/JA extensions (Q4790854) (← links)
- Implementation and evaluation of HPF/SX V2 (Q4790858) (← links)