AP_Colon_Lung (Q6033845)

From MaRDI portal
OpenML dataset with id 1126
Language Label Description Also known as
English
AP_Colon_Lung
OpenML dataset with id 1126

    Statements

    0 references
    0 references
    **Author**: \N**Source**: Unknown - Date unknown \N**Please cite**: \N\NGEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality metrics (e.g. accuracy, precision, area under ROC curve, etc.) for classification, feature selection or clustering algorithms.\N\NThis repository was inspired by an increasing need in machine learning / bioinformatics communities for a collection of microarray classification problems that could be used by different researches. This way many different classification or feature selection techniques can finally be compared to eachother on the same set of problems.\N\NOrigin of data\N\NEach gene expression sample in GEMLeR repository comes from a large publicly available expO (Expression Project For Oncology) repository by International Genomics Consortium.\N\NThe goal of expO and its consortium supporters is to procure tissue samples under standard conditions and perform gene expression analyses on a clinically annotated set of deidentified tumor samples. The tumor data is updated with clinical outcomes and is released into the public domain without intellectual property restriction. The availability of this information translates into direct benefits for patients, researchers and pharma alike.\N\NSource: expO website\NAlthough there are various other sources of gene expression data available, a decision to use data from expO repository was made because of:\N- consistency of tissue samples processing procedure\N- same microarray platform used for all samples\N- availability of additional information for combined genotype-phenotype studies\N- availability of a large number of samples for different tumor types\N\NIn case of publishing material based on GEMLeR datasets, then, please note the assistance you received by using this repository. This will help others to obtain the same datasets and replicate your experiments. Please cite as follows when referring to this repository:\N\NStiglic, G., & Kokol, P. (2010). Stability of Ranked Gene Lists in Large Microarray Analysis Studies. Journal of biomedicine biotechnology, 2010, 616358.\N\NYou are also welcome to acknowledge the contribution of expO (Expression Project For Oncology) and International Genomics Consortium for providing their gene expression samples to the public.
    0 references
    7 October 2014
    0 references
    Tissue
    0 references
    0 references
    0 references
    35f668fdc0644afc7bec0e7ad6f961d6
    0 references
    1
    0 references
    2
    0 references
    10,936
    0 references
    412
    0 references
    0
    0 references
    10,935
    0 references
    0 references