Imputation panel for low-pass whole genome sequencing (GLIMPSE2 format)
DOI10.5281/zenodo.14171544Zenodo14171544MaRDI QIDQ6681610FDOQ6681610
Dataset published at Zenodo repository.
Publication date: 15 November 2024
Copyright license: Creative Commons Attribution 4.0 International
This dataset includes autosomal genotypes from the 1000 Genomes +HGDP project (10.1101/2023.01.23.525248 ) as well as X chromosome genotypes from the NY Genome Center (as of yet, a comparable dataset that includes HGDP is not available for the X; see 10.1016/j.cell.2022.08.004). The genotypes were down-sampled so as to be appropriate for low-pass imputation; uncertain phase calls were removed (any PP tags), and individuals deemed to be outliers or relatives (based on autosomal data, as per the first citation) were also removed. Similarly, singleton polymorphisms were also excluded. Hemizygous genotypes on the X were converted into (quasi) diploid genotypes. These data were then converted into a binary imputation panel format using glimpse v2 (https://odelaneau.github.io/GLIMPSE/; using the static binaries provided). The "chunk" size was doubled from the defaults (which considers a minimum number of snps, genetic length and physical length) so as to be more performant.
This page was built for dataset: Imputation panel for low-pass whole genome sequencing (GLIMPSE2 format)