Statistical significance for genomewide studies

From MaRDI portal
Publication:5460791

DOI10.1073/pnas.1530509100zbMath1130.62385OpenAlexW1964895626WikidataQ24681264 ScholiaQ24681264MaRDI QIDQ5460791

John D. Storey, Robert Tibshirani

Publication date: 19 July 2005

Published in: Proceedings of the National Academy of Sciences (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1073/pnas.1530509100



Related Items

DEA model considering outputs with stochastic noise and a heavy-tailed (stable) distribution, A Guide to Teaching Data Science, Has It Really Been Demonstrated That Most Genomic Research Findings Are False?, Multiple Competition-Based FDR Control and Its Application to Peptide Detection, Shrunken p‐Values for Assessing Differential Expression with Applications to Genomic Data Analysis, Weighted step-down confidence procedures with applications to gene expression data, The Sparse MLE for Ultrahigh-Dimensional Feature Screening, A statistical methodology to select covariates in high-dimensional data under dependence. Application to the classification of genetic profiles in oncology, A modified two-sample t-test based on permutation method for large-scale data, Sequence Robust Association Test for Familial Data, Adaptive local false discovery rate procedures for highly spiky data and their application RNA sequencing data of yeast SET4 deletion mutants, Efficient testing and effect size estimation for set‐based genetic association inference via semiparametric multilevel mixture modeling, A nonparametric mixture approach to density and null proportion estimation in large‐scale multiple comparison problems, Screening-assisted dynamic multiple testing with false discovery rate control, Unified Tests for Nonparametric Functions in RKHS With Kernel Selection and Regularization, Large-Scale Inference of Multivariate Regression for Heavy-Tailed and Asymmetric Data, Efficient screening of predictive biomarkers for individual treatment selection, Exceedance control of the false discovery proportion via high precision inversion method of berk-Jones statistics, Change-detection-assisted multiple testing for spatiotemporal data, Statistical proof? The problem of irreproducibility, Confidence and discoveries with \(e\)-values, A central limit theorem for the Benjamini-Hochberg false discovery proportion under a factor model, Sequential tests of multiple hypotheses controlling false discovery and nondiscovery rates, F-distribution calibrated empirical likelihood ratio tests for multiple hypothesis testing, Evaluation of false discovery rate and power via sample size in microarray studies, Discovering the False Discovery Rate, Estimating equation-based causality analysis with application to microarray time series data, An overview of recent developments in genomics and associated statistical methods, Multiple Testing in a Two-Stage Adaptive Design With Combination Tests Controlling FDR, Discovering Findings That Replicate From a Primary Study of High Dimension to a Follow-Up Study, False Discovery Rate Estimation for Frequentist Pharmacovigilance Signal Detection Methods, Linear Score Tests for Variance Components in Linear Mixed Models and Applications to Genetic Association Studies, Improved Estimation of the Noncentrality Parameter Distribution from a Large Number of t‐Statistics, with Applications to False Discovery Rate Estimation in Microarray Data Analysis, Minimum profile Hellinger distance estimation for a semiparametric mixture model, Linear Mixed Model Selection for False Discovery Rate Control in Microarray Data Analysis, Statistical significance of combinatorial regulations, False discovery rate estimation for large‐scale homogeneous discrete p‐values, A Note on the Adaptive Control of False Discovery Rates, Assessing Differential Gene Expression with Small Sample Sizes in Oligonucleotide Arrays Using a Mean‐Variance Model, A Simple Diagnostic Plot Connecting Robust Estimation, Outlier Detection, and False Discovery Rates, A Bayesian False Discovery Rate for Multiple Testing, A review of modern multiple hypothesis testing, with particular attention to the false discovery proportion, Statistical and Knowledge Supported Visualization of Multivariate Data, Nonparametric Bayesian Estimation of Positive False Discovery Rates, Exploring the Information in p‐Values for the Analysis and Planning of Multiple‐Test Experiments, Controlling the False Discovery Rate for Feature Selection in High‐resolution NMR Spectra, Significant motifs in time series, Functional Modelling of Microarray Time Series, Statistical Methods for Expression Quantitative Trait Loci (eQTL) Mapping, Analyzing Designed Experiments with Multiple Responses, Control of the FWER in Multiple Testing Under Dependence, Application of Biostatistics and Bioinformatics Tools to Identify Putative Transcription Factor-Gene Regulatory Network of Ankylosing Spondylitis and Sarcoidosis, Identifying Genes Associated with a Quantitative Trait or Quantitative Trait Locus via Selective Transcriptional Profiling, A Statistical Procedure for Detecting Highly Correlated Genes with a Pre-Specified Candidate Gene in Microarray Analysis, Bayesian Structure Learning in Multilayered Genomic Networks, On False Discovery and Non‐discovery Proportions of the Dynamic Adaptive Procedure, Sure Independence Screening for Ultrahigh Dimensional Feature Space, Computational Biology: Toward Deciphering Gene Regulatory Information in Mammalian Genomes, Discussion of: Treelets -- an adaptive multi-scale basis for sparse unordered data, Finite skew-mixture models for estimation of positive false discovery rates, Null-free false discovery rate control using decoy permutations, Detection of test speededness using change-point analysis, M-regression, false discovery rates and outlier detection with application to genetic association studies, Optimal false discovery rate control for large scale multiple testing with auxiliary information, Estimating Effect Sizes of Differentially Expressed Genes for Power and Sample-Size Assessments in Microarray Experiments, A two-component nonparametric mixture model with stochastic dominance, Estimating the Proportion of True Null Hypotheses in Nonparametric Exponential Mixture Model with Appication to the Leukemia Gene Expression Data, Bayesian analysis of RNA-Seq data using a family of negative binomial models, Comparing five statistical methods of differential methylation identification using bisulfite sequencing data, Comparability of gene expression in human blood, immune and carcinoma cells, An Omnibus Consistent Adaptive Percentile Modified Wilcoxon Rank Sum Test with Applications in Gene Expression Studies, Estimating the proportion of true null hypotheses in multiple testing problems, Estimation of the proportion of true null hypotheses in high-dimensional data under dependence, A parametric model to estimate the proportion from true null using a distribution for \(p\)-values, Asymptotically independent U-statistics in high-dimensional testing, Optimal detection of weak positive latent dependence between two sequences of multiple tests, On Efficient Estimators of the Proportion of True Null Hypotheses in a Multiple Testing Setup, Modifying SAMseq to account for asymmetry in the distribution of effect sizes when identifying differentially expressed genes, Statistical inferences based on outliers for gene expression analysis, Nonparametric estimation of genewise variance for microarray data, On improving some adaptive BH procedures controlling the FDR under dependence, BOPA: A Bayesian hierarchical model for outlier expression detection, Investigations into refinements of Storey's method of multiple hypothesis testing minimising the FDR, and its application to test binomial data, A novel pairwise comparison method for in silico discovery of statistically significant cis-regulatory elements in eukaryotic promoter regions: application to \textit{Arabidopsis}, Hierarchical inference for genome-wide association studies: a view on methodology with software, A graph Laplacian prior for Bayesian variable selection and grouping, A selective overview of feature screening for ultrahigh-dimensional data, Gamma-based clustering via ordered means with application to gene-expression analysis, Optimal significance analysis of microarray data in a class of tests whose null statistic can be constructed, An inexact interior point method for \(L_{1}\)-regularized sparse covariance selection, More Powerful and Reliable Second-Level Statistical Randomness Tests for NIST SP 800-22, Bayesian regularization via graph Laplacian, A new method to detect periodically correlated structure, Dynamic adaptive multiple tests with finite sample FDR control, A general framework for multiple testing dependence, Statistical learning and selective inference, A Bernstein-type estimator for decreasing density with application to \(p\)-value adjustments, Control of the false discovery proportion for independently tested null hypotheses, Statistical and computational challenges in whole genome prediction and genome-wide association analyses for plant and animal breeding, Power, FDR and conservativeness of BB-SGoF method, Genome-wide significance levels and weighted hypothesis testing, Structures and assumptions: strategies to harness gene \(\times\) gene and gene \(\times\) environment interactions in GWAS, Nonparametric density estimation for symmetric distributions by contaminated data, Simultaneous critical values for \(t\)-tests in very high dimensions, A Bayesian model averaging approach for observational gene expression studies, Frequentist properties of Bayesian multiplicity control for multiple testing of normal means, Deriving and comparing the distribution for the number of false positives in single step methods to control \(k\)-FWER, Mutual fund performance: false discoveries, bias, and power, Floating prioritized subset analysis: A powerful method to detect differentially expressed genes, Sample size growth with an increasing number of comparisons, Applying shrinkage variance estimators to the TOST test in high dimensional settings, Equitability, interval estimation, and statistical power, Bayesian sparse graphical models for classification with application to protein expression data, Estimating the number of genes that are differentially expressed in both of two independent experiments, Estimation procedures for the false discovery rate: a systematic comparison for microarray data, Estimation and testing of gene expression heterosis, Symmetric directional false discovery rate control, Genomic outlier profile analysis: mixture models, null hypotheses, and nonparametric estimation, Efficient p-value estimation in massively parallel testing problems, Identifying temporally differentially expressed genes through functional principal components analysis, Sample size calculations for controlling the distribution of false discovery proportion in microarray experiments, Bayesian Hierarchical Modeling and Selection of Differentially Expressed Genes for the EST Data, Testing Periodicity in Short Series and Application to Gene Expression Data, On estimating the proportion of true null hypotheses for false discovery rate controlling procedures in exploratory DNA microarray studies, Simultaneous control of false positives and false negatives in multiple hypotheses testing, Identifying differentially expressed genes in unreplicated multiple-treatment microarray timecourse experiments, A permutation test motivated by microarray data analysis, A sufficient criterion for control of some generalized error rates in multiple testing, Internal validation inferences of significant genomic features in genome-wide screening, Combining quantitative trait loci analyses and microarray data: an empirical likelihood approach, The beta-binomial distribution for estimating the number of false rejections in microarray gene expression studies, Partition clustering of high dimensional low sample size data based on \(p\)-values, Joint adaptive mean-variance regularization and variance stabilization of high dimensional data, Biomarker discovery: classification using pooled samples, Latent rank change detection for analysis of splice-junction microarrays with nonlinear effects, Liquid chromatography mass spectrometry-based proteomics: biological and technological as\-pects, Asymptotics of Bonferroni for dependent normal test statistics, Robust joint analysis with data fusion in two-stage quantitative trait genome-wide association studies, False discovery rate envelopes, A note on estimating the false discovery rate under mixture model, Bayesian testing of many hypotheses \(\times \) many genes: a study of sleep apnea, A statistical method for estimating the proportion of differentially expressed genes, A cross-validation based estimation of the proportion of true null hypotheses, Distributions associated with simultaneous multiple hypothesis testing, Replicability analysis for genome-wide association studies, Gini correlation for feature screening, Optimal two-stage genome-wide association designs based on false discovery rate, Efficient computer experiment-based optimization through variable selection, HmmSeq: a hidden Markov model for detecting differentially expressed genes from RNA-seq data, A clarifying comparison of methods for controlling the false discovery rate, A new multiple testing method in the dependent case, On a generalized false discovery rate, A semi-parametric approach for mixture models: application to local false discovery rate estimation, Calibration of compositional measurements, A powerful test for ordinal trait genetic association analysis, Tournament screening cum EBIC for feature selection with high-dimensional feature spaces, Multi-subgroup gene screening using semi-parametric hierarchical mixture models and the optimal discovery procedure: Application to a randomized clinical trial in multiple myeloma, Flexible estimation of a semiparametric two-component mixture model with one parametric component, An Optimal Test with Maximum Average Power While Controlling FDR with Application to RNA‐Seq Data



Cites Work