Association Journal of CSIAM
Supervised by Ministry of Education of PRC
Sponsored by Xi'an Jiaotong University
ISSN 1005-3085  CN 61-1269/O1

Chinese Journal of Engineering Mathematics ›› 2026, Vol. 42 ›› Issue (6): 1149-1170.doi: 10.3969/j.issn.1005-3085.2025.06.012

Previous Articles     Next Articles

Variable Selection for High-dimensional Partially Linear Models with Longitudinal Data Based on Orthogonal Projection#br#

YANG Yiping1,2,  QIN Renyu1,  ZHAO Peixin1   

  1. 1. School of Mathematics and Statistics, Chongqing Technology and Business University, Chongqing, 400067

    2. Chongqing Key Laboratory of Statistical Intelligent Computing and Monitoring, Chongqing Technology and Business University, Chongqing, 400067

  • Received:2023-03-22 Accepted:2024-06-10 Online:2025-12-15 Published:2026-02-15
  • Supported by:
    The Natural Science Foundation of Chongqing (cstc2021jcyj-msxmX0079); the Education Commission Humanities and Social Sciences General Project of Chongqing (21SIGH118).

Abstract:

The variable selection problem of a partially linear model is considered when the number of the parameters diverges for longitudinal data. The nonparametric components are first eliminated by B-spline and QR decomposition, and the penalized objective function of the regression coefficient in the partial linear model is constructed by combining SCAD penalty and quadratic inference function, the variable selection and estimation of regression coefficients are obtained at the same time. Further, the estimates of the nonparametric components are obtained by combination with the quadratic inference function. The asymptotic properties of the regression coefficient estimates and nonparametric component estimates are demonstrated under some regular conditions. The simulation results show that the proposed method has a good estimation effect regardless of whether the correlation structure of longitudinal data is specified correctly. Finally, the method proposed is used to analyze the factors affecting the operating performance of real estate listed companies.

Key words: longitudinal data, QR decomposition, quadratic inference function, SCAD penalty, variable selection

CLC Number: