codrna (Q6033107)

From MaRDI portal
OpenML dataset with id 351
Language Label Description Also known as
English
codrna
OpenML dataset with id 351

    Statements

    0 references
    **Author**: Andrew V Uzilov","Joshua M Keegan","David H Mathews. \N**Source**: [original](http://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets) - \N**Please cite**: [AVU06a]\NAndrew V Uzilov, Joshua M Keegan, and David H Mathews. \NDetection of non-coding RNAs on the basis of predicted secondary structure formation free energy change. \NBMC Bioinformatics, 7(173), 2006. \N\NThis is the cod-rna dataset, retrieved 2014-11-14 from the libSVM site. Additional to the preprocessing done there (see LibSVM site for details), this dataset was created as follows: \N-join test, train and rest datasets \N-normalize each file columnwise according to the following rules: \N-If a column only contains one value (constant feature), it will set to zero and thus removed by sparsity. \N-If a column contains two values (binary feature), the value occuring more often will be set to zero, the other to one. \N-If a column contains more than two values (multinary/real feature), the column is divided by its std deviation. \N\NNOTE: please keep in mind that cod-rna has many duplicated data points, within each file (train,test,rest) and also accross these files. these duplicated points have not been removed!
    0 references
    Andrew V Uzilov
    0 references
    Joshua M Keegan
    0 references
    David H Mathews.
    0 references
    2006
    0 references
    29 August 2014
    0 references
    Y
    0 references
    0 references
    https://bmcbioinformatics.biomedcentral.com/articles/10.1186/1471-2105-7-173
    0 references
    35122feb0ca0caf066073d28c3de40f8
    0 references
    1
    0 references
    2
    0 references
    9
    0 references
    488,565
    0 references
    0
    0 references
    0 references

    Identifiers

    0 references