Association Journal of CSIAM
Supervised by Ministry of Education of PRC
Sponsored by Xi'an Jiaotong University
ISSN 1005-3085  CN 61-1269/O1

Chinese Journal of Engineering Mathematics ›› 2023, Vol. 40 ›› Issue (1): 55-68.doi: 10.3969/j.issn.1005-3085.2023.01.004

Previous Articles     Next Articles

Statistical Diagnosis of Location Regression Model Based on Pena Distance under Skew Laplace Normal Data

ZHENG Guifen1,   WANG Danlu2,   WU Liucang2   

  1. 1. College of Artificial Intelligence, Wenshan University, Wenshan, Yunnan 663000;
    2. Faculty of Science, Kunming University of Science and Technology, Kunming, Yunnan 650093
  • Online:2023-02-15 Published:2023-04-11
  • Supported by:
    The National Natural Science Foundation of China (11861041; 11261025).

Abstract:

At present, data with sharp peaks, thick tails and skew appear in medicine, sociology, biology and other fields. For such data, adopting the Skew Laplace normal data to fit will get more accurate results. At the same time, in statistics, abnormal points or strong influence points will have a great impact on the results of statistical diagnosis, and hence the diagnosis of abnormal points or strong influence points is particularly important. Common methods such as Likelihood distance, Cook distance, etc., study the impact of deleting a point (group) on the regression analysis and predicted value. In the reasearch, the influence of Pena distance on the regression value and predicted value of a specific point after the deletion of each point in the sample is studied. Moreover, the influence of Pena distance on the Location regression model in the Skew Laplace normal data is studied, and the EM algorithm is applied to make a statistical diagnosis of the location regression model in the Skew Laplace normal distribution. The expression of the Pena distance and the discrimination method of high-leverage outliers under the location regression model with Skew Laplace normal data are obtained. The comparsion shows the Pena distance is compared with Cook distance and Likelihood distance, and the Pena distance is better than Cook distance and Likelihood distance in some cases. Through Monte Carlo simulation and a real example analysis, the proposed model and the proposed method are shown to be reasonable.

Key words: Pena distance, skew Laplace normal distribution, location regression model, EM algorithm

CLC Number: