Variant Calling Files supporting leptospira analysis

From MaRDI portal
(Redirected from Dataset:6699521)



DOI10.5281/zenodo.7584745Zenodo7584745MaRDI QIDQ6699521FDOQ6699521

Dataset published at Zenodo repository.

Thomas Cokelaer

Publication date: 30 January 2023

Copyright license: Creative Commons Attribution 4.0 International



This data set contains the results of a variant calling analysis performed on 20 samples named X_SX for sample 1 to 16 and named B3288_2 B3288_8 B3288_10 and B3288_16 for samples named 17 to 20. All variant calling analysis were performed with Sequana variant calling pipeline (https://github.com/sequana/variant_calling) but only the final VCF files are provided in this dataset. One VCF file per sample and per reference genome used. There were 273 core genomes used; Therefore we have here 273 time 20 VCF files. In the summary directory one can also find the CSV files for SNPs and INDELs that were extracted from the VCF files using several filtering (depth of 10, strand balance 0.2, min frequency =0.5). More information and notebooks generating and using those files can be found here:https://github.com/biomics-pasteur-fr/manuscript_capture_leptospira/ A tagged version of this github repository is provided as version 1 : manuscript_capture_leptospira-1.tar.gz







This page was built for dataset: Variant Calling Files supporting leptospira analysis