M100 dataset 3: from 21-07 to 21-09

From MaRDI portal
Dataset:6724928



DOI10.5281/zenodo.7589320Zenodo7589320MaRDI QIDQ6724928FDOQ6724928

Dataset published at Zenodo repository.

Luca Benini, Mohsen Seyedkazemi Ardebili, Daniela Galetti, Francesco Barchi, Andrea Bartolini, Andrea Borghesi, Martin Molan, Mirko Cestari, Massimiliano Guarrasi, Carmine di Santi, Alessio Mauri, Francesco Beneventi

Publication date: 31 January 2023

Copyright license: Creative Commons Attribution 4.0 International



This entry is a part of a larger data set collected from the most recent Tier-0 supercomputer hosted at CINECA (Marconi100, https://www.hpc.cineca.it/hardware/marconi100). The data covers the entirety of the system, ranging from the computing nodes (980+ computing nodes) internal information such as core loads, temperatures, frequencies, memory write/read operations, CPU power consumption, fan speed, GPU usage details, etc., to the system-wide information, including the liquid cooling infrastructure, the air conditioning system, the power supply units, workload manager statistics, and job-related information, system status alerts, and weather forecast. It comprises hundreds of metrics measured on each computing node, in addition to hundreds of other metrics gathered from sensors monitored along all system components. The whole data set is stored as a collection of Zenodo entries; this particular entry corresponds to the period: 21-07, 21-09. The dataset is stored as a partitioned Parquet dataset, with this partitioning hierarchy: year_month (YY-MM), plugin, metric. The data is distributed as tarball files, each corresponding to one month of data (first-level partitioning, year_month). The collected data is generated by a monitoring infrastructure working on unstructured data (to improve efficiency and scalability); however, this data has been organized in a structured manner to facilitate its fruition. The simplest way to understand how the access the data is to refer to the companion software modules released together with the dataset itself, which can be found at: https://gitlab.com/ecs-lab/exadata.







This page was built for dataset: M100 dataset 3: from 21-07 to 21-09