Data: Guiding the design of SARS-CoV-2 genomic surveillance by estimating the resolution of outbreak detection

From MaRDI portal
(Redirected from Dataset:6690001)



DOI10.5281/zenodo.6860154Zenodo6860154MaRDI QIDQ6690001FDOQ6690001

Dataset published at Zenodo repository.

Jen Kok, Mailie Gall, Vitali Sintchenko, Grace Blackwell, Dominic E. Dwyer, Alexander P. Drew, Rebecca J. Rockett, Elena Martinez, Carl Suster, Alicia Arnott, Jenny Draper, Sharon C.-A. Chen

Publication date: 22 July 2022

Copyright license: Creative Commons Attribution 4.0 International



Contains data necessary to reproduce the quantitative results related to a SARS-CoV-2 outbreak in NSW, Australia in the associated paper. icpmr_delta_gisaid.csv Tabular data containing the GISAID accession numbers, dates of collection and submission for all sequences used in the NSW outbreak analysis. The epi set is available on GISAID as EPI_SET_220919ef. The wgs_cluster column contains identifiers of genomic clusters defined at ICPMR, NSW Health Pathology. A value of Other means that the sequence either did not belong to a cluster or was part of a cluster that contained fewer than 30 sequences in the study period, and sequences with this value should not be considered to form a single cluster. icpmr_delta_gisaid.dists.tsv.gz Compressed pairwise SNP distance matrix in the format output by snp-dists. The script that creates this file from sequence data is available in the linked code archive.







This page was built for dataset: Data: Guiding the design of SARS-CoV-2 genomic surveillance by estimating the resolution of outbreak detection