On-line predictive linear regression

From MaRDI portal
Publication:1018652

DOI10.1214/08-AOS622zbMATH Open1160.62065arXivmath/0511522OpenAlexW3104560798WikidataQ62046709 ScholiaQ62046709MaRDI QIDQ1018652FDOQ1018652

Alexander Gammerman, Ilia Nouretdinov, Vladimir Vovk

Publication date: 20 May 2009

Published in: The Annals of Statistics (Search for Journal in Brave)

Abstract: We consider the on-line predictive version of the standard problem of linear regression; the goal is to predict each consecutive response given the corresponding explanatory variables and all the previous observations. We are mainly interested in prediction intervals rather than point predictions. The standard treatment of prediction intervals in linear regression analysis has two drawbacks: (1) the classical prediction intervals guarantee that the probability of error is equal to the nominal significance level epsilon, but this property per se does not imply that the long-run frequency of error is close to epsilon; (2) it is not suitable for prediction of complex systems as it assumes that the number of observations exceeds the number of parameters. We state a general result showing that in the on-line protocol the frequency of error for the classical prediction intervals does equal the nominal significance level, up to statistical fluctuations. We also describe alternative regression models in which informative prediction intervals can be found before the number of observations exceeds the number of parameters. One of these models, which only assumes that the observations are independent and identically distributed, is popular in machine learning but greatly underused in the statistical theory of regression.


Full work available at URL: https://arxiv.org/abs/0906.3123




Recommendations




Cites Work


Cited In (17)





This page was built for publication: On-line predictive linear regression

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1018652)