Detecting multiple outliers in linear regression using a cluster method combined with graphical visualization (Q2271696): Difference between revisions

From MaRDI portal
Added link to MaRDI item.
RedirectionBot (talk | contribs)
Removed claim: author (P16): Item:Q477983
Property / author
 
Property / author: Wojtek Janusz Krzanowski / rank
Normal rank
 

Revision as of 14:13, 15 February 2024

scientific article
Language Label Description Also known as
English
Detecting multiple outliers in linear regression using a cluster method combined with graphical visualization
scientific article

    Statements

    Detecting multiple outliers in linear regression using a cluster method combined with graphical visualization (English)
    0 references
    0 references
    8 August 2009
    0 references
    A new algorithm is proposed for multiple outliers detection in regression models. The idea is to apply a hierarchical clustering algorithm to the scatterplot of standardized predicted values and residuals (obtained by ordinary least squares) in order to find a single largest cluster which is considered as a set of inliers. Then the \(t\)-statistic is used to test all other observations as candidate outliers individually against the group of inliers. Results of real data sets analyses are presented.
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    hierarchical clustering
    0 references
    least squares
    0 references
    swamping
    0 references
    minimal spanning trees
    0 references