physiochemical_protein (Q6037796)

From MaRDI portal
OpenML dataset with id 44963
Language Label Description Also known as
English
physiochemical_protein
OpenML dataset with id 44963

    Statements

    0 references
    0 references
    **Data Description**\N\NThis is a data set of Physicochemical Properties of Protein Tertiary Structure. The data set is taken from CASP 5-9. There are 45730 decoys and size varying from 0 to 21 armstrong.\N\NThe goal of the dataset is to predict the size of the residue for a tertiary protein structure (a 3d protein structure). Once linked in the protein chain, an individual amino acid is called a residue. The target feature is root mean square error of the residue.\N\N**Attribute Description**\N\N1. *RMSD* - size of the residue\N2. *F1* - total surface area\N3. *F2* - non polar exposed area\N4. *F3* - fractional area of exposed non polar residue\N5. *F4* - fractional area of exposed non polar part of residue\N6. *F5* - molecular mass weighted exposed area\N7. *F6* - average deviation from standard exposed area of residue\N8. *F7* - Euclidian distance\N9. *F8* - secondary structure penalty\N10. *F9* - Spacial Distribution constraints (N,K Value)
    0 references
    22 December 2022
    0 references
    RMSD
    0 references
    a7e9bb5d3d78ac0c5aad3edcb26404b0
    0 references
    0
    0 references
    0
    0 references
    10
    0 references
    45,730
    0 references
    0
    0 references
    10
    0 references
    0 references