计算机应用 ›› 2012, Vol. 32 ›› Issue (05): 1343-1346.

• 人工智能 • 上一篇    下一篇

基于反馈报道的话题模型动态修正方法

郑燕1,2,鲁燃1,2,赵爱华1,2   

  1. 1. 山东省分布式计算机软件新技术重点实验室,济南 250014
    2. 山东师范大学 信息科学与工程学院,济南 250014
  • 收稿日期:2011-10-28 修回日期:2011-12-16 发布日期:2012-05-01 出版日期:2012-05-01
  • 通讯作者: 郑燕
  • 作者简介:郑燕(1989-),女,山东泰安人,硕士研究生,主要研究方向:网络信息安全、话题追踪;鲁燃(1972-),男,山东菏泽人,副教授,主要研究方向:网络信息安全、网络模型、计算机网络;赵爱华(1987-),女,山东潍坊人,硕士研究生,主要研究方向:网络信息安全、话题检测与追踪。
  • 基金资助:

    国家自然科学基金资助项目(60873247);山东省高新自主创新专项工程(2008ZZ28);山东省自然科学基金资助项目(ZR2009GZ007);济南市青年科技明星计划项目(20080201)

Dynamic topic model amending method based on feedback stories

ZHENG Yan1,2,LU Ran1,2,ZHAO Ai-hua1,2   

  1. 1. School of Information Science and Engineering, Shandong Normal University, Jinan Shandong 250014, China
    2. Shandong Provincial Key Laboratory for Distributed Computer Software Novel Technology, Jinan Shandong 250014,China
  • Received:2011-10-28 Revised:2011-12-16 Online:2012-05-01 Published:2012-05-01
  • Contact: ZHENG Yan

摘要: 在话题追踪过程中,由于给定的初始话题相关报道少,而且话题具有动态演变的特点造成话题模型不准确。针对这一问题,提出了利用动态阈值收集反馈报道构造话题修正模型,实现了话题模型的动态修正;同时结合命名实体能够更加有效地区分不同话题的特性,提出了在修正话题模型时增大相关命名实体权重的方法,从而获得更准确的话题表示模型。实验结果表明,该方法能有效避免话题漂移现象,降低话题追踪过程中的漏报率和错报率。

关键词: 话题追踪, 话题模型, 动态阈值, 命名实体, 反馈报道

Abstract: In topic tracking, the initial topic related stories are few and topic evolves dynamically, which leads to the topic model could not express topic accurately. Concerning this problem, it was proposed to build amended topic model by feedback stories collected by dynamic threshold, to amend topic model dynamically. And in combination of the feature that the named entity could differentiate different topics more effectively, it was suggested to increase the weight of named entity when amending topic model, to express a topic better. The experimental results indicate that, this method can solve the topic shifting problem effectively, and the miss tracking rate and fault tracking rate decrease a lot in topic tracking.

Key words: topic tracking, topic model, dynamic threshold, named entity, feedback stories

中图分类号: