计算机应用 ›› 2015, Vol. 35 ›› Issue (8): 2178-2183.DOI: 10.11772/j.issn.1001-9081.2015.08.2178

• 信息安全 • 上一篇    下一篇

社会网络子集(θ, k)-匿名方法

张晓琳, 王萍, 郭彦磊, 王静宇   

  1. 内蒙古科技大学 信息工程学院, 内蒙古 包头 014010
  • 收稿日期:2015-03-19 修回日期:2015-04-16 出版日期:2015-08-10 发布日期:2015-08-14
  • 通讯作者: 王萍(1990-),女,吉林九台人,硕士研究生,主要研究方向:社会网络隐私保护,wpwygogo@126.com
  • 作者简介:张晓琳(1966-),女,内蒙古包头人,教授,博士,CCF会员,主要研究方向:数据库;郭彦磊(1990-),男,河北邢台人,硕士研究生,主要研究方向:社会网络隐私保护; 王静宇(1976-),男,河南开封人,副教授,博士,主要研究方向:云计算、信息安全。
  • 基金资助:

    国家自然科学基金资助项目(61163015)。

(θ, k)-anonymous method in the subsets of social networks

ZHANG Xiaolin, WANG Ping, GUO Yanlei, WANG Jingyu   

  1. School of Information Engineering, Inner Mongolia University of Science and Technology, Baotou Nei Mongol 014010, China
  • Received:2015-03-19 Revised:2015-04-16 Online:2015-08-10 Published:2015-08-14

摘要:

针对目前社会网络邻域隐私保护相关研究并没有考虑对子集的保护,并且邻域子集中的特定属性分布情况也会造成个体隐私泄露这一问题,提出了一种新的(θ, k)-匿名模型。该模型移除社会网络中需要被保护的节点邻域子集标签后,基于k-同构思想,利用邻域组件编码技术和节点精炼方法处理候选集中的节点及其邻域子集信息,完成同构操作,其中考虑特定敏感属性分布问题。最终,该模型满足邻域子集中的每个节点都存在至少k-1个节点与其邻域同构,同时要求每个节点的属性分布在邻域子集内和在整个子集的差值不大于θ。实验结果表明,(θ, k)-匿名模型能够降低匿名成本并且最大化数据效用。

关键词: 社会网络, 邻域子集, 属性分布, k-同构, (θ, k)-匿名模型

Abstract:

Focusing on the issue that the current related research about social network do not consider subsets for neighborhood's privacy preserving, and the specific properties of neighborhood subsets also lead individual privacy disclosure, a new (θ, k)-anonymous model was proposed. According to the k-isomorphism ideology, the model removed labels of neighborhood subsets which needed to be protected in social network, made use of neighborhood component coding technique and the method of node refining to process nodes in candidate set and their neighborhood information, then completed the operation of specific subsets isomorphism with considering the sensitive attribute distribution. Ultimately, the model satisfies that each node in neighborhood subset meets neighborhood isomorphism with at least k-1 nodes, as well the model requires the difference between the attribute distribution of each node in the neighborhood subset and the throughout subsets is not bigger than θ. The experimental results show that, (θ, k)-anonymous model can reduce the anonymization cost and maximize the utility of the data.

Key words: social network, subset of neighbourhood, distribution of attribute, k-isomorphism, (θ, k)-anonymous model

中图分类号: