Data from: Genome-scale annotation of protein binding sites via language model and geometric deep learning

From MaRDI portal
Dataset:6710241



DOI10.5281/zenodo.10845362Zenodo10845362MaRDI QIDQ6710241FDOQ6710241

Dataset published at Zenodo repository.

Qianmu Yuan, Yuedong Yang

Publication date: 20 March 2024

Copyright license: MIT license



The dataset contains the training and test sets of protein binding sites with DNA, RNA, peptide, protein, ATP, HEM, Zn2+, Ca2+, Mg2+ and Mn2+. Each protein is associated with 3 lines indicating the protein name (PDB accession code and chain), sequence and residue labels (0 for non-binding and 1 for binding), respectively. The ESMFold-predicted structures are also provided.







This page was built for dataset: Data from: Genome-scale annotation of protein binding sites via language model and geometric deep learning