Journal of Computer Applications ›› 2012, Vol. 32 ›› Issue (11): 3030-3033.DOI: 10.3724/SP.J.1087.2012.03030

Previous Articles     Next Articles

Document sensitive information retrieval based on interest ontology

CHEN Hua-cheng,DU Xue-hui,CHEN Xing-yuan,XIA Chun-tao   

  1. Institute of Electronic Technology, Information Engineering University, Zhengzhou Henan 450004, China
  • Received:2012-05-18 Revised:2012-07-05 Online:2012-11-12 Published:2012-11-01
  • Contact: CHEN Hua-cheng
  • Supported by:
    ;Henan Science and Technology Innovation Talents Scheme

基于兴趣本体的文档敏感信息检测方法

陈华城,杜学绘,陈性元,夏春涛   

  1. 信息工程大学 电子技术学院,郑州 450004
  • 通讯作者: 陈华城
  • 作者简介:陈华城(1986-),男,福建漳州人,硕士研究生,主要研究方向:网络安全、信息检索;杜学绘(1968-),女,河南新乡人,教授,博士,主要研究方向:网络安全;陈性元(1963-),男,安徽无为人,教授,博士,主要研究方向:网络安全;夏春涛(1979-),男,河南许昌人,讲师,硕士,主要研究方向:网络安全。
  • 基金资助:
    国家973计划项目(2011CB311801);河南省科技创新人才计划 (114200510001)

Abstract: With the development of computer technology and Internet, more and more office hosts have been connected to Internet, the threat of sensitive information leakage becomes serious. Therefore, it is extremely necessary to detect whether documents contain sensitive information. In order to solve the low precision and low recall problems caused by the traditional query expansion retrieval methods, this paper built an ontology of sensitive information for users interest, proposed a concept similarity query expansion algorithm based on the interest ontology, and described an experimental case to verify the feasibility of algorithm. The experimental results show that the proposed algorithm can improve the precision and recall of the traditional methods.

Key words: sensitive information, retrieval, interest ontology, concept similarity, query expansion

摘要: 随着计算机技术及互联网的高速发展,越来越多的办公主机接入互联网,敏感信息的泄露隐患增多,文档的敏感信息检测显得尤为必要。为了解决传统的查询扩展检测方法查准率和查全率低的问题,构建了监测者关于敏感信息的兴趣本体,提出基于兴趣本体的概念相似度查询扩展算法,并验证了算法的可行性。实验证明该算法有效提高了文档敏感信息检测的查全率和查准率。

关键词: 敏感信息, 检测, 兴趣本体, 概念相似, 查询扩展

CLC Number: