The NAS Parallel Benchmarks (NPB) are a small set of programs designed to help evaluate the performance of parallel supercomputers. The benchmarks are derived from computational fluid dynamics (CFD) applications and consist of five kernels and three pseudo-applications in the original "pencil-and-paper" specification (NPB 1). The benchmark suite has been extended to include new benchmarks for unstructured adaptive meshes, parallel I/O, multi-zone applications, and computational grids. Problem sizes in NPB are predefined and indicated as different classes. Reference implementations of NPB are available in commonly-used programming models like MPI and OpenMP (NPB 2 and NPB 3).
- Copperhead
- Chapel
- DXML
- DTrace
- Reducing division latency with reciprocal caches
- MPICH
- Improved upper bounds for online malleable job scheduling
- SAC -- a functional array language for efficient multi-threaded execution
- PARALLEL CFD BENCHMARKS ON CRAY COMPUTERS
- Online scheduling of malleable parallel jobs with setup times on two identical machines
- Code modernization strategies to 3-D stencil-based applications on intel Xeon Phi: KNC and KNL
- OpenSHMEM
- Data optimizations for constraint automata
- scientific article; zbMATH DE number 926698 (Why is no real title available?)
- Parallel 3D mortar element method for adaptive nonconforming meshes
- Dynamic data prefetching in home-based software DSMs
- Implementation and evaluation of a communication intensive application on the EARTH multithreaded system
- An unsteady incompressible Navier-Stokes solver for large eddy simulation of turbulent flows
- Design and implementation of an agent home scheme strategy for prefetch-based DSM systems
- Direct and inverse problems of high-viscosity fluid dynamics
- scientific article; zbMATH DE number 926688 (Why is no real title available?)
- Porting and performance evaluation of irregular codes using OpenMP
- Performance characteristics of the multi-zone NAS parallel benchmarks
- Online malleable job scheduling for \(m\leq 3\)
- scientific article; zbMATH DE number 991438 (Why is no real title available?)
- An object-oriented parallel programming language for distributed-memory parallel computing platforms
- Techniques for compiling and implementing all NAS parallel benchmarks in HPF
- VPP Fortran and the design of HPF/JA extensions
- Performance evaluation of a multi-zone application in different openmp approaches
- Performance advantage of reconfigurable cache design on multicore processor systems
- Supporting openmp on cell
- Parallel benchmarks of turbulence in complex geometries
- VXDL: virtual resources and interconnection networks description language
- Comments on PVPs, MPPs, NOWS, and future computer architectures
- Adaptive execution techniques of parallel programs for multiprocessors
- Failure-aware resource management for high-availability computing clusters with distributed virtual machines
- A session key caching and prefetching scheme for secure communication in cluster systems
- Deadlock detection in MPI programs
- scientific article; zbMATH DE number 2087789 (Why is no real title available?)
- CACHING IN WITH MULTIGRID ALGORITHMS: PROBLEMS IN TWO DIMENSIONS
- Performance modeling of hybrid MPI/OpenMP scientific applications on large-scale multicore supercomputers
- Comments on PVPs, MPPs, NOWS, and future computer architectures
- Self-similarity of parallel machines
- Experience in using SIMD and MIMD parallelism for computational fluid dynamics
- Performance evaluation of mixed-mode OpenMP/MPI implementations
- Topology-aware strategy for MPI-IO operations in clusters
- A parallel finite element method for the analysis of crystalline solids
- Advanced optimization strategies in the Rice dHPF compiler
- Efficient communication using message prediction for clusters of multiprocessors
- Implementation and evaluation of HPF/SX V2
- Redistribution strategies for portable parallel FFT: a case study
- Parallel simulation of electron-solid interactions for electron microscopy modeling
- Parallel iterative solvers for unstructured grids using a directive/MPI hybrid programming model for the GeoFEM platform on SMP cluster architectures
- Capturing and analyzing the execution control flow of OpenMP applications
- MPI correctness checking for OpenMP/MPI applications
- scientific article; zbMATH DE number 1287887 (Why is no real title available?)
- Design and performance of a scheduling framework for resizable parallel applications
- A speculative and adaptive MPI rendezvous protocol over RDMA-enabled interconnects
- Interconnection network simulation using traces of MPI applications
- LogGPO: an accurate communication model for performance prediction of MPI programs
- Circular-arc graph coloring: On chords and circuits in the meeting graph
- Algorithms for the parallel alternating direction access machine
- Bsp2omp: A Compiler For Translating Bsp Programs To Openmp
- A proposal for error handling in OpenMP
- Parallelization and optimization of Mfold on shared memory system
- Implementation of parallel plasma particle-in-cell codes on PC cluster
- A two-stage hardware scheduler combining greedy and optimal scheduling
- Algorithm-system scalability of heterogeneous computing
- Parallelization of a multiblock flow code: An engineering implementation
- Using cost to control instrumentation overhead
- Model-based fault localization: finding behavioral outliers in large-scale computing systems
- Unstructured adaptive meshes: Bad for your memory?
- Computational fluid dynamics applications on parallel-vector computers: Computations of stirred vessel flows
- A parallelized ENO procedure for direct numerical simulation of compressible turbulence
- A detailed analysis of communication load balance on BlueGene supercomputer
- HPF/JA: extensions of High Performance Fortran for accelerating real‐world applications
- MPI-CHECK: a tool for checking Fortran 90 MPI programs
- High-scalability parallelization of a molecular modeling application: Performance and productivity comparison between OpenMP and MPI implementations
- Elkhound
- HPF/JA
- MPI-CHECK
- Paje
- QUAFF
- CableS
- ParoC++
- BSP2OMP
- FLEXSIM
- EARTH--MANNA
- OVERFLOW-MLP
- VXDL
- BSPlib
- HPCC
- TPVM
- CoArray
- PVM
- eSkel
- ARMCI
- LAM-MPI
- KAAPI
- PAPI
This page was built for software: NAS Parallel Benchmarks