swMATH8853MaRDI QIDQ20852FDOQ20852
Official website: http://www.nas.nasa.gov/publications/npb.html
The NAS Parallel Benchmarks (NPB) are a small set of programs designed to help evaluate the performance of parallel supercomputers. The benchmarks are derived from computational fluid dynamics (CFD) applications and consist of five kernels and three pseudo-applications in the original "pencil-and-paper" specification (NPB 1). The benchmark suite has been extended to include new benchmarks for unstructured adaptive meshes, parallel I/O, multi-zone applications, and computational grids. Problem sizes in NPB are predefined and indicated as different classes. Reference implementations of NPB are available in commonly-used programming models like MPI and OpenMP (NPB 2 and NPB 3).
Cited In (only showing first 100 items - show all)
- Performance modeling of hybrid MPI/OpenMP scientific applications on large-scale multicore supercomputers
- Experience in using SIMD and MIMD parallelism for computational fluid dynamics
- Performance evaluation of mixed-mode OpenMP/MPI implementations
- OpenSHMEM
- A parallel finite element method for the analysis of crystalline solids
- Parallel iterative solvers for unstructured grids using a directive/MPI hybrid programming model for the GeoFEM platform on SMP cluster architectures
- Capturing and analyzing the execution control flow of OpenMP applications
- MPI correctness checking for OpenMP/MPI applications
- Design and performance of a scheduling framework for resizable parallel applications
- A speculative and adaptive MPI rendezvous protocol over RDMA-enabled interconnects
- Interconnection network simulation using traces of MPI applications
- LogGPO: an accurate communication model for performance prediction of MPI programs
- Bsp2omp: A Compiler For Translating Bsp Programs To Openmp
- Implementation of parallel plasma particle-in-cell codes on PC cluster
- A two-stage hardware scheduler combining greedy and optimal scheduling
- Algorithm-system scalability of heterogeneous computing
- Computational fluid dynamics applications on parallel-vector computers: Computations of stirred vessel flows
- Model-based fault localization: finding behavioral outliers in large-scale computing systems
- A detailed analysis of communication load balance on BlueGene supercomputer
- HPF/JA: extensions of High Performance Fortran for accelerating real‐world applications
- MPI-CHECK: a tool for checking Fortran 90 MPI programs
- High-scalability parallelization of a molecular modeling application: Performance and productivity comparison between OpenMP and MPI implementations
- Elkhound
- HPF/JA
- QUAFF
- CableS
- eSkel
- ARMCI
- LAM-MPI
- KAAPI
- PAPI
- CPMD
- MPI/MPICH
- MPI
- ParaProf
- SAC -- a functional array language for efficient multi-threaded execution
- SKaMPI
- Ibis
- HPCTOOLKIT
- KOJAK
- Jumpshot
- MARMOT
- mpiP
- Nimrod/G
- ReSHAPE
- VAMPIR
- SPLASH-2
- NetLogger
- SIMGRID
- Scalasca
- Intel MPI Benchmarks
- hwloc
- LogGOPSim
- MOCCA
- Sweep3d
- Sequoia Benchmark
- Sphinx
- UJMP
- Sisal
- PARAVER
- ParADE
- Omega
- ickp
- MPJ Express
- PCG
- OProfile
- JCuda
- SICOSYS
- Kendo
- MCT
- iperf
- ADAPT
- Copperhead
- Chapel
- DXML
- DTrace
- An unsteady incompressible Navier-Stokes solver for large eddy simulation of turbulent flows
- Direct and inverse problems of high-viscosity fluid dynamics
- Performance characteristics of the multi-zone NAS parallel benchmarks
- VXDL: virtual resources and interconnection networks description language
- Deadlock detection in MPI programs
- Adaptive execution techniques of parallel programs for multiprocessors
- Failure-aware resource management for high-availability computing clusters with distributed virtual machines
- A session key caching and prefetching scheme for secure communication in cluster systems
- CACHING IN WITH MULTIGRID ALGORITHMS: PROBLEMS IN TWO DIMENSIONS
- Comments on PVPs, MPPs, NOWS, and future computer architectures
- Self-similarity of parallel machines
- Topology-aware strategy for MPI-IO operations in clusters
- Advanced optimization strategies in the Rice dHPF compiler
- Efficient communication using message prediction for clusters of multiprocessors
- Implementation and evaluation of HPF/SX V2
- Redistribution strategies for portable parallel FFT: a case study
- Title not available (Why is that?)
- Parallel simulation of electron-solid interactions for electron microscopy modeling
- Circular-arc graph coloring: On chords and circuits in the meeting graph
- Algorithms for the parallel alternating direction access machine
- A proposal for error handling in OpenMP
- Parallelization and optimization of Mfold on shared memory system
- Parallelization of a multiblock flow code: An engineering implementation
- Using cost to control instrumentation overhead
This page was built for software: NAS Parallel Benchmarks