z-logo
open-access-imgOpen Access
Fast Improvised Influential Distance for the Identification of Influential Observations in Multiple Linear Regression
Author(s) -
Habshah Midi,
Muhammad Sani,
Shelan Saied Ismaeel,
Jayanthi Arasan
Publication year - 2021
Publication title -
sains malaysiana
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.251
H-Index - 29
eISSN - 2735-0118
pISSN - 0126-6039
DOI - 10.17576/jsm-2021-5007-22
Subject(s) - leverage (statistics) , linear regression , identification (biology) , monte carlo method , computer science , regression , regression analysis , statistics , algorithm , econometrics , artificial intelligence , machine learning , mathematics , botany , biology
Influential observations (IO) are those observations that are responsible for misleading conclusions about the fitting of a multiple linear regression model. The existing IO identification methods such as influential distance (ID) is not very successful in detecting IO. It is suspected that the ID employed inefficient method with long computational running time for the identification of the suspected IO at the initial step. Moreover, this method declares good leverage observations as IO, resulting in misleading conclusion. In this paper, we proposed fast improvised influential distance (FIID) that can successfully identify IO, good leverage observations, and regular observations with shorter computational running time. Monte Carlo simulation study and real data examples show that the FIID correctly identify genuine IO in multiple linear regression model with no masking and a negligible swamping rate.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here