Protein Function Embeddings: First Beta Release of Datasets

From MaRDI portal
Dataset:6693412



DOI10.5281/zenodo.7793384Zenodo7793384MaRDI QIDQ6693412FDOQ6693412

Dataset published at Zenodo repository.

Leyla Jael Castro, Dietrich Rebholz-Schuhmann, Rohitha Ravinder

Publication date: 2 April 2023

Copyright license: Creative Commons Attribution 4.0 International



This release corresponds to the datasets generated from athesis work that explores how information for protein functions can be exploited through embeddings so that the produced information can be used to improve protein function annotations. The underlying hypothesis here is that any pair of proteins with high sequence similarity will also share a similar biological function which would be reflected by the corresponding protein embeddings. The comparison and evaluation of this is done using two text-driven embedding approaches: Word2doc2Vec and Hybrid-Word2doc2Vec.







This page was built for dataset: Protein Function Embeddings: First Beta Release of Datasets