In-season prediction of batting averages: a field test of empirical Bayes and Bayes methodol\-ogies
From MaRDI portal
Publication:2482971
Abstract: Batting average is one of the principle performance measures for an individual baseball player. It is natural to statistically model this as a binomial-variable proportion, with a given (observed) number of qualifying attempts (called ``at-bats), an observed number of successes (``hits) distributed according to the binomial distribution, and with a true (but unknown) value of that represents the player's latent ability. This is a common data structure in many statistical applications; and so the methodological study here has implications for such a range of applications. We look at batting records for each Major League player over the course of a single season (2005). The primary focus is on using only the batting records from an earlier part of the season (e.g., the first 3 months) in order to estimate the batter's latent ability, , and consequently, also to predict their batting-average performance for the remainder of the season. Since we are using a season that has already concluded, we can then validate our estimation performance by comparing the estimated values to the actual values for the remainder of the season. The prediction methods to be investigated are motivated from empirical Bayes and hierarchical Bayes interpretations. A newly proposed nonparametric empirical Bayes procedure performs particularly well in the basic analysis of the full data set, though less well with analyses involving more homogeneous subsets of the data. In those more homogeneous situations better performance is obtained from appropriate versions of more familiar methods. In all situations the poorest performing choice is the na"{{i}}ve predictor which directly uses the current average to predict the future average.
Recommendations
- Estimation with Selected Binomial Information or Do You Really Believe That Dave Winfield is Batting .471?
- scientific article; zbMATH DE number 1031815
- scientific article; zbMATH DE number 775737
- Bayes and Empirical Bayes Procedures for Comparing Parameters
- Model-based estimation of baseball batting metrics
Cites work
- scientific article; zbMATH DE number 1634843 (Why is no real title available?)
- scientific article; zbMATH DE number 3124366 (Why is no real title available?)
- scientific article; zbMATH DE number 3557007 (Why is no real title available?)
- scientific article; zbMATH DE number 1220667 (Why is no real title available?)
- scientific article; zbMATH DE number 720689 (Why is no real title available?)
- scientific article; zbMATH DE number 3205757 (Why is no real title available?)
- scientific article; zbMATH DE number 3068128 (Why is no real title available?)
- A test for the Poisson distribution
- Admissible Estimators, Recurrent Diffusions, and Insoluble Boundary Value Problems
- Estimation of the mean of a multivariate normal distribution
- Estimation with Incompletely Specified Loss Functions (the Case of Several Location Parameters)
- Estimation with quadratic loss.
- Estimators for Gaussian models having a block-wise structure
- Nonparametric empirical Bayes and compound decision approaches to estimation of a high-dimensional vector of normal means
- Proper Bayes Minimax Estimators of the Multivariate Normal Mean
- Robbins, empirical Bayes and microarrays
- Statistical decision theory and Bayesian analysis. 2nd ed
- THE TRANSFORMATION OF POISSON, BINOMIAL AND NEGATIVE-BINOMIAL DATA
- Tables of the Freeman-Tukey transformations for the binomial and Poisson distributions
- The Efficiency of Logistic Regression Compared to Normal Discriminant Analysis
- The General Canonical Correlation Distribution
- The root-unroot algorithm for density estimation as implemented via wavelet block thresholding
- Transformations Related to the Angular and the Square Root
Cited in
(31)- Estimation with Selected Binomial Information or Do You Really Believe That Dave Winfield is Batting .471?
- A survey of nonparametric mixing density estimation via the predictive recursion algorithm
- Interval estimation for messy observational data
- On general maximum likelihood empirical Bayes estimation of heteroscedastic IID normal means
- Nuclear penalized multinomial regression with an application to predicting at bat outcomes in baseball
- Optimal shrinkage estimation of mean parameters in family of distributions with quadratic variance
- SURE estimates under dependence and heteroscedasticity
- Nonparametric hierarchical Bayes analysis of binomial data via Bernstein polynomial priors
- Statistical theory powering data science
- A general framework for empirical Bayes estimation in discrete linear exponential family
- Estimating the mean and variance of a high-dimensional normal distribution using a mixture prior
- Bayesball: A Bayesian hierarchical model for evaluating fielding in major league baseball
- scientific article; zbMATH DE number 775737 (Why is no real title available?)
- An empirical Bayes mixture method for effect size and false discovery rate estimation
- On generalized geometric distributions and improved estimation of batting average in cricket
- Credit portfolios, credibility theory, and dynamic empirical Bayes
- Approximate repeated-measures shrinkage
- Group-linear empirical Bayes estimates for a heteroscedastic normal mean
- Competing process hazard function models for player ratings in ice hockey
- Model-based estimation of baseball batting metrics
- Online estimation of individual-level effects using streaming shrinkage factors
- Modelling the Evolution of Distributions: An Application to Major League Baseball
- Semi-parametric Bayesian inference for multi-season baseball data
- Hierarchical Bayesian modeling of hitting performance in baseball
- Empirical estimates for heteroscedastic hierarchical dynamic normal models
- Nonparametric empirical Bayes and compound decision approaches to estimation of a high-dimensional vector of normal means
- Asymptotically optimal nonparametric empirical Bayes via predictive recursion
- A nonparametric empirical Bayes approach to adaptive minimax estimation
- Approximate nonparametric maximum likelihood for mixture models: a convex optimization approach to fitting arbitrary multivariate mixing distributions
- Adaptive sparse estimation with side information
- On SURE estimates in hierarchical models assuming heteroscedasticity for both levels of a two-level normal hierarchical model
This page was built for publication: In-season prediction of batting averages: a field test of empirical Bayes and Bayes methodol\-ogies
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2482971)