计算机应用 ›› 2017, Vol. 37 ›› Issue (9): 2567-2571.DOI: 10.11772/j.issn.1001-9081.2017.09.2567

• 网络空间安全 • 上一篇    下一篇

PTDC:路网环境中感知隐私的轨迹数据采集技术

霍峥, 王卫红, 曹玉辉   

  1. 河北经贸大学 信息技术学院, 石家庄 050061
  • 收稿日期:2017-03-29 修回日期:2017-05-03 出版日期:2017-09-10 发布日期:2017-09-13
  • 通讯作者: 霍峥,huozheng123@gmail.com
  • 作者简介:霍峥(1982-),女,河北邯郸人,讲师,博士,CCF会员,主要研究方向:移动对象数据库、隐私保护;王卫红(1970-),女,河北涿州人,教授,博士,CCF会员,主要研究方向:移动协同计算、移动云计算;曹玉辉(1969-),男,河北正定人,教授,博士,CCF会员,主要研究方向:资源聚合、移动云计算。
  • 基金资助:
    国家自然科学基金资助项目(61502279);河北省自然科学基金资助项目(F2015207009);河北省高等学校青年拔尖人才计划项目(BJ2016019);出国留学择优资助项目(C2015003042)。

PTDC:privacy-aware trajectory data collection technology under road network constraint

HUO Zheng, WANG Weihong, CAO Yuhui   

  1. School of Information Technology, Hebei University of Economics and Business, Shijiazhuang Hebei 050061, China
  • Received:2017-03-29 Revised:2017-05-03 Online:2017-09-10 Published:2017-09-13
  • Supported by:
    This research was partially supported by the National Natural Science Foundation of China (61502279), the Natural Science Foundation of Hebei Province (F2015207009), Young Talents Program in Colleges and Universities of Hebei Province (BJ2016019), the Advanced Program for Study Abroad (C2015003042).

摘要: 针对路网环境中移动对象轨迹隐私泄露以及语义位置同质性攻击等问题,提出了一种路网环境中感知隐私的轨迹数据采集(PTDC)算法。首先,通过兴趣位置(POI)访问人次的信息墒计算路网中POI的敏感性;其次,根据顶点间敏感性和距离的混合差距,定义了θ-边权,并建立路网空间的图模型、定义了k-θ-D匿名模型以抵御语义位置同质性攻击;最后,以无向图的广度优先遍历为基础,设计了满足POI语义差异性的匿名算法,将用户的敏感采样位置用匿名区域取代,并衡量了PTDC算法处理后数据的可用性。通过实验对PTDC算法进行了验证,并和自由空间中的基于语义位置的隐私保护算法——YCWA进行了比对。理论上讲,YCWA算法的隐私保护度低于PTDC算法。实验表明,PTDC算法的信息丢失率平均在15%左右,空间范围查询误差平均在12%左右,略逊于YCWA算法;然而,PTDC算法的运行时间在5 s以内,远远优于YCWA算法,可满足实时在线数据采集的需求。

关键词: 路网, 隐私保护, 轨迹数据, 数据采集, 语义位置

Abstract: Since the problem of trajectory privacy violation and homogeneous semantic location attack of moving objects in road network environment is very serious, a Privacy-aware Trajectory Data Collection (PTDC) algorithm was proposed. Firstly, through visits' entropy of Points Of Interests (POI), the sensitivity of each POI was computed; secondly, based on the mixture distance of sensitivity and Euclidean distance, θ-weight was defined and a weighted model of vertices and edges in the network environment was established to reach a k-θ-D anonymity, which can resist the semantic location homogeneity attack; finally, based on the bread-first traversal algorithm of undirected graph, an anonymous algorithm was proposed to satisfy the semantic difference of POIs, so that user's sensitive sampling location was replaced by an anonymous region. Data utility caused by PTDC algorithm was theoretically evaluated. A set of experiments were implemented to test PTDC algorithm, and compare it with the privacy-preserving algorithm named YCWA (You Can Walk Alone) in free space. In theory, the privacy level of YCWA algorithm was lower than PTDC algorithm. The experimental results show that the PTDC algorithm has an average information loss of about 15%, and average range count query error rate of about 12%, which performs slightly worse than YCWA algorithm, while the running time of PTDC algorithm is less than 5 seconds, which is much better than YCWA algorithm. PTDC algorithm meets the needs of real-time online data collection.

Key words: road network, privacy-preserving, trajectory data, data collection, semantic location

中图分类号: