DeepCNA: an explainable deep learning method for cancer diagnosis and cancer-specific patterns of copy number aberrations
DOI10.5281/zenodo.14892622Zenodo14892622MaRDI QIDQ6693374FDOQ6693374
Dataset published at Zenodo repository.
Publication date: 19 February 2025
Copyright license: Creative Commons Attribution 4.0 International
Data associated with the manuscript: https://www.biorxiv.org/content/10.1101/2024.03.08.584160v1 A deep learning approach classified samples according to cancer primary site, based only on binned copy number aberations. Guided integrated gradients was then used to calculate an attribute score for each bin, representing how important that genomic location was in the cancer classification. Output of the analysis: Binned copy numbers for 6707 samples from the Genomics England Limited data set (channel 0 = total copy number, channel 1 = minor allele copy number) Attribute scores for how important each genomic bin was for the cancer classification (channel 0 and channel 1) Metadata linking rows in the binned data above to sample information Input data for the analysis DATA_CN.tgz contains each sample's copy number profile as an .npy file DATA.tgz contains each sample's normalised copy number profile as an .npy file. This is the input to the neural network training in the DeepCNA repository
This page was built for dataset: DeepCNA: an explainable deep learning method for cancer diagnosis and cancer-specific patterns of copy number aberrations