Datasets, alphabets and models from paper 'Reverse Engineering Molecules from Fingerprints through Deterministic Enumeration and Generative Models.

From MaRDI portal
(Redirected from Dataset:6711273)




Files utilized and produced within the molecule-signature project: alphabets.zip: Alphabets of molecule signatures. datasets.zip: Datasets from MetaNetX, eMolecules, and DrugBank used to build alphabets and train the generative models. models.zip: PyTorch/Lightning models and SentencePiece tokenization models for decoding SMILES from ECFP. See embedded README.md files and the publication for in depth details.











This page was built for dataset: Datasets, alphabets and models from paper 'Reverse Engineering Molecules from Fingerprints through Deterministic Enumeration and Generative Models.