NAS Parallel Benchmarks
The NAS Parallel Benchmarks (NPB) are a small set of programs designed to help evaluate the performance of parallel supercomputers. The benchmarks are derived from computational fluid dynamics (CFD) applications and consist of five kernels and three pseudo-applications in the original "pencil-and-paper" specification (NPB 1). The benchmark suite has been extended to include new benchmarks for unstructured adaptive meshes, parallel I/O, multi-zone applications, and computational grids. Problem sizes in NPB are predefined and indicated as different classes. Reference implementations of NPB are available in commonly-used programming models like MPI and OpenMP (NPB 2 and NPB 3).
- Performance modeling of hybrid MPI/OpenMP scientific applications on large-scale multicore supercomputers
- Comments on PVPs, MPPs, NOWS, and future computer architectures
- Self-similarity of parallel machines
- Experience in using SIMD and MIMD parallelism for computational fluid dynamics
- Performance evaluation of mixed-mode OpenMP/MPI implementations
- Topology-aware strategy for MPI-IO operations in clusters
- A parallel finite element method for the analysis of crystalline solids
- Advanced optimization strategies in the Rice dHPF compiler
- Efficient communication using message prediction for clusters of multiprocessors
- Implementation and evaluation of HPF/SX V2
- Redistribution strategies for portable parallel FFT: a case study
- Parallel simulation of electron-solid interactions for electron microscopy modeling
- Parallel iterative solvers for unstructured grids using a directive/MPI hybrid programming model for the GeoFEM platform on SMP cluster architectures
- Capturing and analyzing the execution control flow of OpenMP applications
- MPI correctness checking for OpenMP/MPI applications
- scientific article; zbMATH DE number 1287887 (Why is no real title available?)
- Design and performance of a scheduling framework for resizable parallel applications
- A speculative and adaptive MPI rendezvous protocol over RDMA-enabled interconnects
- Interconnection network simulation using traces of MPI applications
- LogGPO: an accurate communication model for performance prediction of MPI programs
- Circular-arc graph coloring: On chords and circuits in the meeting graph
- Algorithms for the parallel alternating direction access machine
- Bsp2omp: A Compiler For Translating Bsp Programs To Openmp
- A proposal for error handling in OpenMP
- Parallelization and optimization of Mfold on shared memory system
- Implementation of parallel plasma particle-in-cell codes on PC cluster
- A two-stage hardware scheduler combining greedy and optimal scheduling
- Algorithm-system scalability of heterogeneous computing
- Parallelization of a multiblock flow code: An engineering implementation
- Using cost to control instrumentation overhead
- Model-based fault localization: finding behavioral outliers in large-scale computing systems
- Unstructured adaptive meshes: Bad for your memory?
- Computational fluid dynamics applications on parallel-vector computers: Computations of stirred vessel flows
- A parallelized ENO procedure for direct numerical simulation of compressible turbulence
- A detailed analysis of communication load balance on BlueGene supercomputer
- HPF/JA: extensions of High Performance Fortran for accelerating real‐world applications
- MPI-CHECK: a tool for checking Fortran 90 MPI programs
- High-scalability parallelization of a molecular modeling application: Performance and productivity comparison between OpenMP and MPI implementations
- Elkhound
- HPF/JA
- MPI-CHECK
- Paje
- QUAFF
- CableS
- ParoC++
- BSP2OMP
- FLEXSIM
- EARTH--MANNA
- OVERFLOW-MLP
- VXDL
- BSPlib
- HPCC
- TPVM
- CoArray
- PVM
- eSkel
- ARMCI
- LAM-MPI
- KAAPI
- PAPI
- CPMD
- MPI/MPICH
- MPI
- ParaProf
- SKaMPI
- Ibis
- HPCTOOLKIT
- KOJAK
- Jumpshot
- MARMOT
- mpiP
- Nimrod/G
- ReSHAPE
- VAMPIR
- SPLASH-2
- NetLogger
- SIMGRID
- Scalasca
- Intel MPI Benchmarks
- hwloc
- LogGOPSim
- MOCCA
- Sweep3d
- Sequoia Benchmark
- Sphinx
- UJMP
- Sisal
- PARAVER
- ParADE
- Omega
- ickp
- MPJ Express
- PCG
- OProfile
- JCuda
- SICOSYS
- Kendo
- MCT
- iperf
- ADAPT
This page was built for software: NAS Parallel Benchmarks