在线咨询
中国工业与应用数学学会会刊
主管:中华人民共和国教育部
主办:西安交通大学
ISSN 1005-3085  CN 61-1269/O1

工程数学学报 ›› 2026, Vol. 43 ›› Issue (1): 128-142.doi: 10.3969/j.issn.1005-3085.2026.01.008cstr: 32411.14.cjem.CN61-1269/O1.2026.01.008

• • 上一篇    下一篇

基于秘密共享的高效纵向联邦逻辑回归

毛正雄1,  李  辉1,  黄祖源1,  杨传旭1,  赵  鹏2,  赵方圆2,  杨树森3   

  1. 1. 云南电网有限责任公司信息中心,昆明 650011
    2. 西安交通大学计算机科学与技术学院,西安 710049
    3. 西安交通大学数学与统计学院,西安 710049
  • 收稿日期:2023-04-07 接受日期:2023-05-18 出版日期:2026-02-15 发布日期:2026-04-15
  • 基金资助:
    云南电网科技项目 (YNKJXM20210141).

Efficient Vertical Federated Logistic Regression Based on Secret Sharing

MAO Zhengxiong1,  LI Hui1,  HUANG Zuyuan1,  YANG Chuanxu1,  ZHAO Peng2,  ZHAO Fangyuan2,  YANG Shusen3   

  1. 1. Network Information Center, Yunnan Power Grid Co., Ltd., Kunming 650011
    2. School of Computer Science and Technology, Xi'an Jiaotong University, Xi'an 710049
    3. School of Mathematics and Statistics, Xi'an Jiaotong University, Xi'an 710049
  • Received:2023-04-07 Accepted:2023-05-18 Online:2026-02-15 Published:2026-04-15
  • Supported by:
    The Science and Technology Project of Yunnan Power Grid (YNKJXM20210141).

摘要:

逻辑回归是一种广泛应用于现实分类任务的机器学习模型。随着数据孤岛问题的涌现,如何针对多参与主体非贯通数据联合构建逻辑回归模型成为一个关键问题。纵向联邦学习可实现数据明文不暴露前提下多主体跨样本特征的联合机器学习模型训练。然而,现有纵向联邦逻辑回归方法主要基于同态加密技术,具有计算和通信开销大的短板。针对逻辑回归模型,研究安全高效的纵向联邦学习算法,目标实现数据隐私保护和模型学习效率的较优权衡。具体地,基于秘密共享提出了一种面向逻辑回归模型的高效率纵向联邦学习算法(Vertical Federated Logistic Regression algorithm based on Secret Sharing, VFLR-SS),通过将跨域分析过程中的中间数据随机分解为多个秘密份额进行交互从而实现隐私保护,同时避免了同态加密引发的计算和通信开销。对VFLR-SS的安全性进行了分析,并基于真实数据对算法进行了验证。实验结果表明VFLR-SS可实现与集中式逻辑回归算法可比的效用和性能,大幅降低了传统同态加密方法中的计算及通信开销。

关键词: 隐私保护, 联邦学习, 逻辑回归, 秘密共享, 同态加密

Abstract:

Logistic regression is a widely used machine learning model for real-world classification tasks. With the emerging ``isolated data islands" problem, it has become a key issue to collaboratively construct a logistic regression model based on non-penetrating data distributed in different data owners (multi-owner distribution). Vertical federated learning can realize the collaborative model training under multi-owner cross-feature data distribution while protecting the data plain-text from exposure. However, existing vertical federated logistic regression methods are mainly based on homomorphic encryption schemes, which would incur tremendous computation and communication costs. Therefore, this paper studies the secure and efficient vertical federated learning algorithm for the logistic regression model, aiming to achieve a better trade-off between data privacy protection and model learning efficiency. In particular, this paper proposes an efficient vertical federated learning algorithm VFLR-SS based on secret sharing. By randomly decomposing the intermediate data in the modeling process into multiple secret shares,  VFLR-SS can achieve privacy protection while avoiding the computational and communication overhead caused by homomorphic encryption algorithms. In this paper, the security analysis of VFLR-SS is provided, and the effectiveness and efficiency of the algori-thm are verified based on real data. The experimental results demonstrate that VFLR-SS can achieve comparable effectiveness and efficiency to centralized logistic regression algorithms, and significantly reduce the computation and communication costs in traditional homomorphic encryption schemes.

Key words: privacy protection, vertical federated learning, logistic regression, secret sharing, homomorphic encryption

中图分类号: