Protein Function Embeddings: First Beta Release of Datasets
DOI10.5281/zenodo.7793384Zenodo7793384MaRDI QIDQ6693412FDOQ6693412
Dataset published at Zenodo repository.
Leyla Jael Castro, Dietrich Rebholz-Schuhmann, Rohitha Ravinder
Publication date: 2 April 2023
Copyright license: Creative Commons Attribution 4.0 International
This release corresponds to the datasets generated from athesis work that explores how information for protein functions can be exploited through embeddings so that the produced information can be used to improve protein function annotations. The underlying hypothesis here is that any pair of proteins with high sequence similarity will also share a similar biological function which would be reflected by the corresponding protein embeddings. The comparison and evaluation of this is done using two text-driven embedding approaches: Word2doc2Vec and Hybrid-Word2doc2Vec.
This page was built for dataset: Protein Function Embeddings: First Beta Release of Datasets