A robust rerank approach for feature selection and its application to pooling-based GWA studies (Q382672): Difference between revisions

From MaRDI portal
Changed an Item
ReferenceBot (talk | contribs)
Changed an Item
 
(3 intermediate revisions by 3 users not shown)
Property / describes a project that uses
 
Property / describes a project that uses: Haploview / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1155/2013/860673 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2170147419 / rank
 
Normal rank
Property / cites work
 
Property / cites work: 10.1162/153244303322753616 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Significance analysis of microarrays applied to the ionizing radiation response / rank
 
Normal rank
Property / cites work
 
Property / cites work: Tight Clustering: A Resampling‐Based Approach for Identifying Stable and Tight Patterns in Data / rank
 
Normal rank
Property / cites work
 
Property / cites work: Theory & Methods: Special Invited Paper: Dimension Reduction and Visualization in Discriminant Analysis (with discussion) / rank
 
Normal rank

Latest revision as of 01:42, 7 July 2024

scientific article
Language Label Description Also known as
English
A robust rerank approach for feature selection and its application to pooling-based GWA studies
scientific article

    Statements

    A robust rerank approach for feature selection and its application to pooling-based GWA studies (English)
    0 references
    0 references
    0 references
    0 references
    21 November 2013
    0 references
    Summary: Large-\(p\)-small-\(n\) datasets are commonly encountered in modern biomedical studies. To detect the difference between two groups, conventional methods would fail to apply due to the instability in estimating variances in \(t\)-test and a high proportion of tied values in AUC (area under the receiver operating characteristic curve) estimates. The significance analysis of microarrays (SAM) may also not be satisfactory, since its performance is sensitive to the tuning parameter, and its selection is not straightforward. In this work, we propose a robust rerank approach to overcome the above-mentioned diffculties. In particular, we obtain a rank-based statistic for each feature based on the concept of ``rank-over-variable''. Techniques of ``random subset'' and ``rerank'' are then iteratively applied to rank features, and the leading features will be selected for further studies. The proposed re-rank approach is especially applicable for large-\(p\)-small-\(n\) datasets. Moreover, it is insensitive to the selection of tuning parameters, which is an appealing property for practical implementation. Simulation studies and real data analysis of pooling-based genome wide association (GWA) studies demonstrate the usefulness of our method.
    0 references
    0 references
    0 references

    Identifiers