高级检索

农业转移人口就业质量的驱动因素识别基于SHAP算法的可解释性分析

Identifying the driving factors of employment quality for the rural transfer population: an interpretable analysis based on the SHAP algorithm

  • 摘要: 促进高质量就业是推进新型城镇化、实现城乡融合发展与共同富裕的关键。本文基于2014—2022年中国家庭追踪调查(CFPS)数据,构建了融合经济报酬、社会保障、工作稳定性、主观评价和工作负担的多维评价指标体系。通过系统比较多元线性回归、集成学习与深度学习的算法性能,采用XGBoost模型识别农业转移人口就业质量的核心驱动因素,并运用SHAP值进行可解释性分析。研究结果表明:1)集成学习模型在识别农业转移人口就业质量驱动因素上表现出预测精度与泛化能力的显著优势;2)受教育程度、产业结构、家庭其他成员收入、工会参与以及地区生产总值是影响就业质量的重要因素,相关变量对就业质量的提升存在阶段性与阈值特征;3)不同经济发展阶段中,“投资于物”与“投资于人”的驱动效能呈现动态演化;4)就业质量的核心驱动因素随务工年限呈现生命周期演化规律,且在地理空间上呈现差异格局,市场化程度较高、产业体系更完善的区域,人力资本定价机制更为成熟,而亲缘地缘与行业网络则对就业质量发挥补充作用。基于上述结论,提出构建精准扶持体系、分阶段分层次优化政策供给、坚持投资于物和投资于人紧密结合等政策建议。

     

    Abstract: Promoting high-quality employment is crucial for advancing new urbanization and achieving urban–rural integration and common prosperity. Based on data from the China Family Panel Studies (CFPS) from 2014 to 2022, this study constructs a multidimensional evaluation index system that integrates economic compensation, social security, job stability, subjective evaluation, and workload. By systematically comparing the performance of multiple linear regression, ensemble learning, and deep learning algorithms, the XGBoost model is employed to identify the core driving factors of employment quality among the rural transfer population, and SHAP values are applied to enhance interpretability. The results show that: 1) ensemble learning models exhibit clear advantages in predictive accuracy and generalization performance in identifying the driving factors of employment quality among the rural transfer population; 2) education level, industrial structure, income of other household members, labor union participation, and regional GDP are key determinants of employment quality, and their effects display stage-specific and threshold characteristics; 3) across different stages of economic development, the effects of investing in physical capital and investing in human capital evolve dynamically; 4) the core driving factors of employment quality follow a life-cycle evolution pattern with years of work experience and present spatial heterogeneity, with regions characterized by higher levels of marketization and more developed industrial systems exhibiting more mature human capital pricing mechanisms, while kinship, geographical, and industrial networks play a complementary role. Based on these findings, this study proposes policy recommendations, including the establishment of a targeted support system, the optimization of policy supply across different stages and levels, and the coordinated promotion of investing in physical capital and investing in human capital.

     

/

返回文章
返回