M100 dataset 5: from 22-01 to 22-02 (Q6724948)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: M100 dataset 5: from 22-01 to 22-02 |
Dataset published at Zenodo repository.
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | M100 dataset 5: from 22-01 to 22-02 |
Dataset published at Zenodo repository. |
Statements
This entry is a part of a larger data set collected from the most recent Tier-0 supercomputer hosted at CINECA (Marconi100, https://www.hpc.cineca.it/hardware/marconi100). The data covers the entirety of the system, ranging from the computing nodes (980+ computing nodes) internal information such as core loads, temperatures, frequencies, memory write/read operations, CPU power consumption, fan speed, GPU usage details, etc., to the system-wide information, including the liquid cooling infrastructure, the air conditioning system, the power supply units, workload manager statistics, and job-related information, system status alerts, and weather forecast. It comprises hundreds of metrics measured on each computing node, in addition to hundreds of other metrics gathered from sensors monitored along all system components. The whole data set is stored as a collection of Zenodo entries; this particular entry corresponds to the period: 22-01, 22-02. The dataset is stored as a partitioned Parquet dataset, with this partitioning hierarchy: year_month (YY-MM), plugin, metric. The data is distributed as tarball files, each corresponding to one month of data (first-level partitioning, year_month). The collected data is generated by a monitoring infrastructure working on unstructured data (to improve efficiency and scalability); however, this data has been organized in a structured manner to facilitate its fruition. The simplest way to understand how the access the data is to refer to the companion software modules released together with the dataset itself, which can be found at: https://gitlab.com/ecs-lab/exadata.
0 references
31 January 2023
0 references
1.0.0
0 references