Today's cat is tomorrow's dog: accounting for time-based changes in the labels of ML vulnerability detection approaches (Replication Package Part 4: Poppler Dataset)

From MaRDI portal
(Redirected from Dataset:6717904)



DOI10.5281/zenodo.14852110Zenodo14852110MaRDI QIDQ6717904FDOQ6717904

Dataset published at Zenodo repository.

Ranindya Paramitha, Yuan Feng, Fabio Massacci

Publication date: 11 February 2025

Copyright license: Creative Commons Attribution 4.0 International



The Replication Package of "Today's cat is tomorrow's dog: accounting for time-based changes in the labels of ML vulnerability detection approaches" Part 4 (POPPLER Dataset) This repository includes: Code that contains the codes to replicate some parts of this study:a.1_generate_datasets implements our methodology to generate the datasets.b.2_run_models runs the ML models during the evaluation.c.3_result_replication generates charts presented in the paper from the ML evaluation results. Datasets that contain 2 folders:a.original datasets: 1 from NVD Vuldeepecker and 3 extracted fromBigVul. b. POPPLER datasets: train, validation, test sets for each time of observation extracted using our methodology from BigVuldataset for project poppler. Pretrained-models that we generated during our evaluation (3 test results for each time point in the timeline [2009-2018]). Results of our evaluation, the folder ALL contains the overall results and other folders are results by model. Please refer to the following repositories for the other datasets and pre-trained models: - Part 1 NVD Vuldeeepecker : https://doi.org/10.5281/zenodo.8207883- Part 2 LINUX : https://doi.org/10.5281/zenodo.10960662- Part 3 OPENSSL : https://doi.org/10.5281/zenodo.10966117







This page was built for dataset: Today's cat is tomorrow's dog: accounting for time-based changes in the labels of ML vulnerability detection approaches (Replication Package Part 4: Poppler Dataset)