Data archive for paper "Copula-based synthetic data augmentation for machine-learning emulators"
DOI10.5281/zenodo.5150327Zenodo5150327MaRDI QIDQ6692584FDOQ6692584
Dataset published at Zenodo repository.
Publication date: 31 July 2021
Overview This is the data archive for paper Copula-based synthetic data augmentation for machine-learning emulators. It contains the papers data archive with model outputs (see results folder) and the Singularity image for (optionally) re-running experiments. For the Python tool used to generate synthetic data, please refer to Synthia. Requirements Singularity = 3 Portable Batch System (PBS) job scheduler* Todays high-performance computer (e.g. ~ 32 CPUs @ 2 500 MHz with 64 GB of RAM ) *Although PBS in not a strict requirement, it is required to run all helper scripts as included in this repository. Please note that depending on your specific system settings and resource availability, you may need to modify PBS parameters at the top of submit scripts stored in the hpc directory (e.g. #PBS -lwalltime=72:00:00). Usage To reproduce the results from the experiments described in the paper, first fit all copula models to the reduced NWP-SAF dataset with: qsub hpc/fit.sh then, to generate synthetic data, run all machine learning model configurations, and compute the relevant statistics use: qsub hpc/stats.sh qsub hpc/ml_control.sh qsub hpc/ml_synth.sh Finally, to plot all artifacts included in the paper use: qsub hpc/plot.sh Licence Code released under MIT license. Data from the reduced NWP-SAF dataset released under CC BY 4.0.
This page was built for dataset: Data archive for paper "Copula-based synthetic data augmentation for machine-learning emulators"