Valid post-selection inference

From MaRDI portal
Publication:355109

DOI10.1214/12-AOS1077zbMATH Open1267.62080arXiv1306.1059OpenAlexW2009462809MaRDI QIDQ355109FDOQ355109


Authors: Andreas Buja, Linda Zhao, Richard Berk, Lawrence Brown, Kai Zhang Edit this on Wikidata


Publication date: 24 July 2013

Published in: The Annals of Statistics (Search for Journal in Brave)

Abstract: It is common practice in statistical data analysis to perform data-driven variable selection and derive statistical inference from the resulting model. Such inference enjoys none of the guarantees that classical statistical theory provides for tests and confidence intervals when the model has been chosen a priori. We propose to produce valid ``post-selection inference by reducing the problem to one of simultaneous inference and hence suitably widening conventional confidence and retention intervals. Simultaneity is required for all linear functions that arise as coefficient estimates in all submodels. By purchasing ``simultaneity insurance for all possible submodels, the resulting post-selection inference is rendered universally valid under all possible model selection procedures. This inference is therefore generally conservative for particular selection procedures, but it is always less conservative than full Scheffe protection. Importantly it does not depend on the truth of the selected submodel, and hence it produces valid inference even in wrong models. We describe the structure of the simultaneous inference problem and give some asymptotic results.


Full work available at URL: https://arxiv.org/abs/1306.1059




Recommendations




Cites Work


Cited In (only showing first 100 items - show all)

Uses Software





This page was built for publication: Valid post-selection inference

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q355109)