A robust rerank approach for feature selection and its application to pooling-based GWA studies (Q382672): Difference between revisions

Summary: Large-\(p\)-small-\(n\) datasets are commonly encountered in modern biomedical studies. To detect the difference between two groups, conventional methods would fail to apply due to the instability in estimating variances in \(t\)-test and a high proportion of tied values in AUC (area under the receiver operating characteristic curve) estimates. The significance analysis of microarrays (SAM) may also not be satisfactory, since its performance is sensitive to the tuning parameter, and its selection is not straightforward. In this work, we propose a robust rerank approach to overcome the above-mentioned diffculties. In particular, we obtain a rank-based statistic for each feature based on the concept of ``rank-over-variable''. Techniques of ``random subset'' and ``rerank'' are then iteratively applied to rank features, and the leading features will be selected for further studies. The proposed re-rank approach is especially applicable for large-\(p\)-small-\(n\) datasets. Moreover, it is insensitive to the selection of tuning parameters, which is an appealing property for practical implementation. Simulation studies and real data analysis of pooling-based genome wide association (GWA) studies demonstrate the usefulness of our method.

0 references

describes a project that uses

PLINK

0 references

Haploview

0 references

MaRDI profile type

MaRDI publication profile

0 references

full work available at URL

https://doi.org/10.1155/2013/860673

0 references

cites work

10.1162/153244303322753616

0 references

Significance analysis of microarrays applied to the ionizing radiation response

0 references

Tight Clustering: A Resampling‐Based Approach for Identifying Stable and Tight Patterns in Data

0 references

Theory & Methods: Special Invited Paper: Dimension Reduction and Visualization in Discriminant Analysis (with discussion)

0 references

Identifiers

zbMATH Open document ID

1275.62076

0 references

DOI

10.1155/2013/860673

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:382672

@@ Property / Wikidata QID @@
+Q36801958
@@ Property / Wikidata QID: Q36801958 / rank @@
+Normal rank
@@ Property / describes a project that uses @@
+PLINK
@@ Property / describes a project that uses: PLINK / rank @@
+Normal rank
@@ Property / describes a project that uses @@
+Haploview
@@ Property / describes a project that uses: Haploview / rank @@
+Normal rank
@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / full work available at URL @@
+https://doi.org/10.1155/2013/860673
+Normal rank
@@ Property / OpenAlex ID @@
+W2170147419
@@ Property / OpenAlex ID: W2170147419 / rank @@
+Normal rank
@@ Property / cites work @@
+.1162/153244303322753616
@@ Property / cites work: 10.1162/153244303322753616 / rank @@
+Normal rank
@@ Property / cites work @@
+Significance analysis of microarrays applied to the ionizing radiation response
+Normal rank
@@ Property / cites work @@
+Tight Clustering: A Resampling‐Based Approach for Identifying Stable and Tight Patterns in Data
+Normal rank
@@ Property / cites work @@
+Theory &amp; Methods: Special Invited Paper: Dimension Reduction and Visualization in Discriminant Analysis (with discussion)
+Normal rank