在线咨询
中国工业与应用数学学会会刊
主管:中华人民共和国教育部
主办:西安交通大学
ISSN 1005-3085  CN 61-1269/O1

工程数学学报 ›› 2023, Vol. 40 ›› Issue (3): 341-354.doi: 10.3969/j.issn.1005-3085.2023.03.001

• •    下一篇

异源在线网络话题早发现及演化特征研究

徐小艳1,   吕  伟1,   张贝贝2,   周帅鹏3,    魏  嵬2   

  1. 1. 西安石油大学理学院,西安  710065
    2. 西安理工大学计算机科学与工程学院,西安  710048
    3. 西安麦仓数据服务有限公司,西安  710061
  • 收稿日期:2020-08-21 接受日期:2022-11-18 出版日期:2023-06-15 发布日期:2023-08-15
  • 通讯作者: 吕 伟 E-mail: 276715147@qq.com
  • 基金资助:
    国家重点研发计划 (2018YFB0203900).

Early Detection and Evolution Law Discovery of New Emerging Topic of Heterogeneous On-line Social Networks

XU Xiaoyan1,  LV Wei1,  ZHANG Beibei2,  ZHOU Shuaipeng3,  WEI Wei2   

  1. 1. School of Science, Xi'an Shiyou University, Xi'an 710065
    2. School of Computer and Engineering, Xi'an University of Technology, Xi'an 710048
    3. Company of Aamaze Data, Xi'an 710061
  • Received:2020-08-21 Accepted:2022-11-18 Online:2023-06-15 Published:2023-08-15
  • Contact: W. Lv. E-mail address: 276715147@qq.com
  • Supported by:
    The National Key Research and Development Program (2018YFB0203900).

摘要:

从海量新闻标题、微博等短文本异源数据及早、准确地发现舆情话题及其演化规律,以期为政府和企业监管舆情态势提供参考。以大规模异源在线社会网络数据为研究源,以文本关键词共现为建模依据点构建时变话题网络模型,将异源在线网络话题早发现及演化追踪问题转变为时变话题网络的动态社团发现与社团演化规律研究问题。进一步,提出以Louvain算法为迭代核心且以模块度增益和网络局部增量为算法量化对象的时变网络动态Louvain社团发现方法,通过与静态Louvain社团发现算法在大量计算机仿真和真实异源社会媒体数据下实验对比分析,证明所建话题网络和动态Louvain社团算法可在较少时空代价下快速、有效地实现对异源在线社会网络隐匿话题的早发现和演化规律追踪。

关键词: 异源在线社会网络, 动态社团算法, Louvain算法, 模块度增益

Abstract: The purpose of this study is to accurately identify the new emerging topic immediately and to identify its evolution law from all kinds of heterogeneous on-line social network data which are composed of short texts like news titles and micro-blogs, the results of which will provide valuable decision support for government officers and company administrators. Firstly, all kinds of heterogeneous on-line social network data are acquired and modeled as the time-varying network by using the co-occurrence of short texts' keywords, from which the topic early detection problem could be changed into the problem of dynamic community detection on the time-varying topic network; Secondly, we propose the dynamic community detection method with static Louvain algorithm embedded and with modularity gain and local network variation as quantification values. The proposed method  yields preferable results under both large amount of computer-generated data and real heterogeneous online social network data, community detection and propagation identification results under computer-generated and real heterogeneous online social network data validates the algorithms' efficiency and feasibility.

Key words: heterogeneous online social network, dynamic community detection, Louvain algo-rithm, modularity gain

中图分类号: