Optimal clustering on the real line (Q1116592)
From MaRDI portal
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Optimal clustering on the real line |
scientific article |
Statements
Optimal clustering on the real line (English)
0 references
1988
0 references
A method is proposed for assessing the number of groups (clusters) in a random sample (presented by order statistics) drawn from a continuous population distributed on (a,b)\(\subseteq (-\infty,\infty)\). A sample quantile function is defined to be a piecewise linear interpolation of these order statistics. The method is based on the calculation of an asymptotic nonparametric confidence interval for the fractional reduction of the within-group error due to \((g+1)\)-grouping over g-grouping. The confidence interval theory (as well as the central limit theory) is developed for both universally optimal and bounded locally optimal groupings. The derivation makes use of a Donsker-type theorem for the quantile process that is also proven. The consistency of the estimators used in the proofs of the theorem is shown.
0 references
M-estimator
0 references
quantization
0 references
piecewise linear interpolation of order statistics
0 references
universally optimal groupings
0 references
number of groups
0 references
continuous population
0 references
sample quantile function
0 references
asymptotic nonparametric confidence interval
0 references
within-group error
0 references
central limit theory
0 references
bounded locally optimal groupings
0 references
Donsker-type theorem
0 references
quantile process
0 references
consistency
0 references