Optimal clustering on the real line (Q1116592)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Optimal clustering on the real line
scientific article

    Statements

    Optimal clustering on the real line (English)
    0 references
    0 references
    1988
    0 references
    A method is proposed for assessing the number of groups (clusters) in a random sample (presented by order statistics) drawn from a continuous population distributed on (a,b)\(\subseteq (-\infty,\infty)\). A sample quantile function is defined to be a piecewise linear interpolation of these order statistics. The method is based on the calculation of an asymptotic nonparametric confidence interval for the fractional reduction of the within-group error due to \((g+1)\)-grouping over g-grouping. The confidence interval theory (as well as the central limit theory) is developed for both universally optimal and bounded locally optimal groupings. The derivation makes use of a Donsker-type theorem for the quantile process that is also proven. The consistency of the estimators used in the proofs of the theorem is shown.
    0 references
    M-estimator
    0 references
    quantization
    0 references
    piecewise linear interpolation of order statistics
    0 references
    universally optimal groupings
    0 references
    number of groups
    0 references
    continuous population
    0 references
    sample quantile function
    0 references
    asymptotic nonparametric confidence interval
    0 references
    within-group error
    0 references
    central limit theory
    0 references
    bounded locally optimal groupings
    0 references
    Donsker-type theorem
    0 references
    quantile process
    0 references
    consistency
    0 references

    Identifiers