NAS Parallel Benchmarks

MaRDI QIDQ20852swMATHFDO

Authors NASA

Official website http://www.nas.nasa.gov/publications/npb.html

The NAS Parallel Benchmarks (NPB) are a small set of programs designed to help evaluate the performance of parallel supercomputers. The benchmarks are derived from computational fluid dynamics (CFD) applications and consist of five kernels and three pseudo-applications in the original "pencil-and-paper" specification (NPB 1). The benchmark suite has been extended to include new benchmarks for unstructured adaptive meshes, parallel I/O, multi-zone applications, and computational grids. Problem sizes in NPB are predefined and indicated as different classes. Reference implementations of NPB are available in commonly-used programming models like MPI and OpenMP (NPB 2 and NPB 3).

Cited in

(only showing first 100 items - show all)

Copperhead
Chapel
DXML
DTrace
Reducing division latency with reciprocal caches
MPICH
Improved upper bounds for online malleable job scheduling
SAC -- a functional array language for efficient multi-threaded execution
PARALLEL CFD BENCHMARKS ON CRAY COMPUTERS
Online scheduling of malleable parallel jobs with setup times on two identical machines
Code modernization strategies to 3-D stencil-based applications on intel Xeon Phi: KNC and KNL
OpenSHMEM
Data optimizations for constraint automata
scientific article; zbMATH DE number 926698 (Why is no real title available?)
Parallel 3D mortar element method for adaptive nonconforming meshes
Dynamic data prefetching in home-based software DSMs
Implementation and evaluation of a communication intensive application on the EARTH multithreaded system
An unsteady incompressible Navier-Stokes solver for large eddy simulation of turbulent flows
Design and implementation of an agent home scheme strategy for prefetch-based DSM systems
Direct and inverse problems of high-viscosity fluid dynamics
scientific article; zbMATH DE number 926688 (Why is no real title available?)
Porting and performance evaluation of irregular codes using OpenMP
Performance characteristics of the multi-zone NAS parallel benchmarks
Online malleable job scheduling for \(m\leq 3\)
scientific article; zbMATH DE number 991438 (Why is no real title available?)
An object-oriented parallel programming language for distributed-memory parallel computing platforms
Techniques for compiling and implementing all NAS parallel benchmarks in HPF
VPP Fortran and the design of HPF/JA extensions
Performance evaluation of a multi-zone application in different openmp approaches
Performance advantage of reconfigurable cache design on multicore processor systems
Supporting openmp on cell
Parallel benchmarks of turbulence in complex geometries
VXDL: virtual resources and interconnection networks description language
Comments on PVPs, MPPs, NOWS, and future computer architectures
Adaptive execution techniques of parallel programs for multiprocessors
Failure-aware resource management for high-availability computing clusters with distributed virtual machines
A session key caching and prefetching scheme for secure communication in cluster systems
Deadlock detection in MPI programs
scientific article; zbMATH DE number 2087789 (Why is no real title available?)
CACHING IN WITH MULTIGRID ALGORITHMS: PROBLEMS IN TWO DIMENSIONS
Performance modeling of hybrid MPI/OpenMP scientific applications on large-scale multicore supercomputers
Comments on PVPs, MPPs, NOWS, and future computer architectures
Self-similarity of parallel machines
Experience in using SIMD and MIMD parallelism for computational fluid dynamics
Performance evaluation of mixed-mode OpenMP/MPI implementations
Topology-aware strategy for MPI-IO operations in clusters
A parallel finite element method for the analysis of crystalline solids
Advanced optimization strategies in the Rice dHPF compiler
Efficient communication using message prediction for clusters of multiprocessors
Implementation and evaluation of HPF/SX V2
Redistribution strategies for portable parallel FFT: a case study
Parallel simulation of electron-solid interactions for electron microscopy modeling
Parallel iterative solvers for unstructured grids using a directive/MPI hybrid programming model for the GeoFEM platform on SMP cluster architectures
Capturing and analyzing the execution control flow of OpenMP applications
MPI correctness checking for OpenMP/MPI applications
scientific article; zbMATH DE number 1287887 (Why is no real title available?)
Design and performance of a scheduling framework for resizable parallel applications
A speculative and adaptive MPI rendezvous protocol over RDMA-enabled interconnects
Interconnection network simulation using traces of MPI applications
LogGPO: an accurate communication model for performance prediction of MPI programs
Circular-arc graph coloring: On chords and circuits in the meeting graph
Algorithms for the parallel alternating direction access machine
Bsp2omp: A Compiler For Translating Bsp Programs To Openmp
A proposal for error handling in OpenMP
Parallelization and optimization of Mfold on shared memory system
Implementation of parallel plasma particle-in-cell codes on PC cluster
A two-stage hardware scheduler combining greedy and optimal scheduling
Algorithm-system scalability of heterogeneous computing
Parallelization of a multiblock flow code: An engineering implementation
Using cost to control instrumentation overhead
Model-based fault localization: finding behavioral outliers in large-scale computing systems
Unstructured adaptive meshes: Bad for your memory?
Computational fluid dynamics applications on parallel-vector computers: Computations of stirred vessel flows
A parallelized ENO procedure for direct numerical simulation of compressible turbulence
A detailed analysis of communication load balance on BlueGene supercomputer
HPF/JA: extensions of High Performance Fortran for accelerating real‐world applications
MPI-CHECK: a tool for checking Fortran 90 MPI programs
High-scalability parallelization of a molecular modeling application: Performance and productivity comparison between OpenMP and MPI implementations
Elkhound
HPF/JA
MPI-CHECK
Paje
QUAFF
CableS
ParoC++
BSP2OMP
FLEXSIM
EARTH--MANNA
OVERFLOW-MLP
VXDL
BSPlib
HPCC
TPVM
CoArray
PVM
eSkel
ARMCI
LAM-MPI
KAAPI
PAPI

This page was built for software: NAS Parallel Benchmarks