在线咨询
中国工业与应用数学学会会刊
主管:中华人民共和国教育部
主办:西安交通大学
ISSN 1005-3085  CN 61-1269/O1

工程数学学报 ›› 2024, Vol. 41 ›› Issue (2): 199-216.doi: 10.3969/j.issn.1005-3085.2024.02.001

• •    下一篇

规模以下工业企业抽样调查的权数调整研究

姜天英1,  金勇进2   

  1. 1. 北京物资学院统计与数据科学学院,北京 101149
    2. 中国人民大学应用统计科学研究中心,北京 100872
  • 收稿日期:2022-01-17 接受日期:2022-12-12 出版日期:2024-04-15 发布日期:2024-06-15
  • 通讯作者: 金勇进 E-mail: jinyongj_519@aliyun.com
  • 基金资助:
    国家社科基金西部项目 (21XTJ006);北京物资学院青年科研基金 (2022XJQN34).

Research on Weight Adjustment of Sampling Survey of Industrial Enterprises under the Designated Size

JIANG Tianying1,  JIN Yongjin2   

  1. 1. School of Statistics and Data Science, Beijing Wuzi University, Beijing 101149
    2. Center for Applied Statistics, Renmin University of China, Beijing 100872
  • Received:2022-01-17 Accepted:2022-12-12 Online:2024-04-15 Published:2024-06-15
  • Contact: Y. Jin. E-mail address: jinyongj_519@aliyun.com
  • Supported by:
    The National Social Science Foundation of China Western Project (21XTJ006); the Youth Research Fund Project of Beijing Wuzi University (2022XJQN34).

摘要:

为解决规模以下工业企业估计中的问题,对现有的权数调整范围进行了拓展,以期提高对规模以下工业企业的估计精度。一方面,解决目录企业的非自然消亡问题。分别讨论样本单元非自然消亡和样本层非自然消亡两种情况,将非自然消亡视为一种单元无回答,引入样本匹配方法,选择最为“相近”的正常上报企业与非自然消亡企业匹配,把非自然消亡的样本企业权数调整到正常上报的样本企业中。另一方面,解决非目录企业的估计偏差问题。分别讨论了基于超总体模型估计和倾向得分逆加权估计的权数调整思路,超总体模型估计选取了线性和非线性两种。倾向得分逆加权估计中,重点研究了倾向得分的求解,基于GBM (Generalized Boosted Model) 算法,在其迭代求解过程中引入了权重,提出了 w-GBM 算法,同时提出将参数估计方法中的 Logistic 回归估计和非参数估计方法中的 w-GBM 算法或GBM 算法进行加权的组合估计方法。实证结果表明,以上思路具有可行性。

关键词: 目录企业, 非目录企业, 权数调整, 倾向得分, 超总体模型, GBM算法

Abstract:

In order to solve the problems in the estimation of industrial enterprises under the designated size, the existing weight adjustment range is expanded to improve the estimation accuracy of industrial enterprises under the designated size. On the one hand, it solves the problem of unnatural extinction of catalog enterprises. The unnatural extinction of sample units and the unnatural extinction of sample layer are discussed respectively. The unnatural extinction is regarded as a unit without answer. The sample matching method is introduced to select the most ``similar" normal reporting enterprises to match with the unnatural extinction enterprises, and the weight of the unnatural extinction sample enterprises is adjusted to the normal reporting sample enterprises. On the other hand, the estimation bias of non-catalog enterprises is solved. The weight adjustment ideas based on superpopulation model estimation and inverse weighted estimation of propensity score are discussed, respectively. Linear and nonlinear models are selected for superpopulation model estimation. In the inverse weighted estimation of propensity score, the solution of propensity score is mainly studied. Based on generalized boosted model (GBM) algorithm, weight is introduced in the iterative solution process, and w-GBM algorithm is proposed. At the same time, a combined estimation method is proposed by weighting the logistic regression estimation in the parameter estimation method and the w-GBM algorithm or GBM algorithm in the nonparametric estimation method. The numerical results show that the ideas proposed in this paper are feasible.

Key words: catalog enterprise, non-catalog enterprises, weight adjustment, propensity score, super population model, GBM algorithm

中图分类号: