Detecting multiple outliers in linear regression using a cluster method combined with graphical visualization (Q2271696): Difference between revisions
From MaRDI portal
Set profile property. |
Set OpenAlex properties. |
||
Property / full work available at URL | |||
Property / full work available at URL: https://doi.org/10.1007/s00180-007-0026-3 / rank | |||
Normal rank | |||
Property / OpenAlex ID | |||
Property / OpenAlex ID: W1995900471 / rank | |||
Normal rank |
Revision as of 02:26, 20 March 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Detecting multiple outliers in linear regression using a cluster method combined with graphical visualization |
scientific article |
Statements
Detecting multiple outliers in linear regression using a cluster method combined with graphical visualization (English)
0 references
8 August 2009
0 references
A new algorithm is proposed for multiple outliers detection in regression models. The idea is to apply a hierarchical clustering algorithm to the scatterplot of standardized predicted values and residuals (obtained by ordinary least squares) in order to find a single largest cluster which is considered as a set of inliers. Then the \(t\)-statistic is used to test all other observations as candidate outliers individually against the group of inliers. Results of real data sets analyses are presented.
0 references
hierarchical clustering
0 references
least squares
0 references
swamping
0 references
minimal spanning trees
0 references