The Essential Histogram
From MaRDI portal
Abstract: The histogram is widely used as a simple, exploratory display of data, but it is usually not clear how to choose the number and size of bins. We construct a confidence set of distribution functions that optimally address the two main tasks of the histogram: estimating probabilities and detecting features such as increases and modes in the distribution. We define the essential histogram as the histogram in the confidence set with the fewest bins. Thus the essential histogram is the simplest visualization of the data that optimally achieves the main tasks of the histogram. The only assumption we make is that the data are independent and identically distributed. We provide a fast algorithm for the essential histogram, and illustrate our methodology with examples. An R-package is available on CRAN.
Recommendations
Cited in
(12)- Confidence Bands for a Log-Concave Density
- Equal-bin-width histogram versus equal-bin-count histogram
- Combining regular and irregular histograms by penalized likelihood
- Fountains and histograms
- Dynamic quantile function models
- Histogram Contextualization
- Two dimensional histogram analysis using the Helmholtz principle
- A note on the e-a histogram
- A comparison of automatic histogram constructions
- Fast and fully-automated histograms for large-scale data sets
- The median of a set of histogram data
- essHist
This page was built for publication: The Essential Histogram
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q145406)