计算机应用 ›› 2017, Vol. 37 ›› Issue (3): 854-859.DOI: 10.11772/j.issn.1001-9081.2017.03.854

• 数据科学与技术 • 上一篇    下一篇

基于时空模式的轨迹数据聚类算法

石陆魁1,2, 张延茹1, 张欣1   

  1. 1. 河北工业大学 计算机科学与软件学院, 天津 300401;
    2. 河北省大数据计算重点实验室(河北工业大学), 天津 300401
  • 收稿日期:2016-07-21 修回日期:2016-09-12 出版日期:2017-03-10 发布日期:2017-03-22
  • 通讯作者: 石陆魁
  • 作者简介:石陆魁(1974-),男,河北邯郸人,教授,博士,CCF会员,主要研究方向:机器学习、数据挖掘;张延茹(1990-),女,河北张家口人,硕士研究生,主要研究方向:机器学习、数据挖掘;张欣(1992-),女,河北衡水人,硕士研究生,主要研究方向:机器学习、数据挖掘。
  • 基金资助:
    天津市应用基础与前沿技术研究计划项目(14JCZDJC31600);河北省自然科学基金专项(F2016202144);河北省高等学校科学技术研究项目(ZD2014030)。

Trajectory data clustering algorithm based on spatio-temporal pattern

SHI Lukui1,2, ZHANG Yanru1, ZHANG Xin1   

  1. 1. School of Computer Science and Engineering, Hebei University of Technology, Tianjin 300401, China;
    2. Hebei Province Key Laboratory of Big Data Calculation(Hebei University of Technology), Tianjin 300401, China
  • Received:2016-07-21 Revised:2016-09-12 Online:2017-03-10 Published:2017-03-22
  • Supported by:
    This work was partially supported by Natural Science Foundation of Hebei Province (F2016202144), the Science and Technology Research Project of Colleges and Universities in Hebei Province(ZD2014030), the Tianjin Research Project of Application Foundation and Advanced Technology (14JCZD JC31600).

摘要: 针对轨迹聚类算法在相似性度量中多以空间特征为度量标准,缺少对时间特征的度量,提出了一种基于时空模式的轨迹数据聚类算法。该算法以划分再聚类框架为基础,首先利用曲线边缘检测方法提取轨迹特征点;然后根据轨迹特征点对轨迹进行子轨迹段划分;最后根据子轨迹段间时空相似性,采用基于密度的聚类算法进行聚类。实验结果表明,使用所提算法提取的轨迹特征点在保证特征点具有较好简约性的前提下较为准确地描述了轨迹结构,同时基于时空特征的相似性度量因同时兼顾了轨迹的空间与时间特征,得到了更好的聚类结果。

关键词: 时空模式, 轨迹数据, 曲线边缘检测, 相似性度量, 密度聚类

Abstract: Because the existing trajectory clustering algorithms in the similarity measurement usually used the spatial characteristics as the standards the characteristics lacking the consideration of temporal, a trajectory data clustering algorithm based on spatial-temporal pattern was proposed. The proposed algorithm was based on partition-and-group framework. Firstly, the trajectory feature points were extracted by using the curve edge detection method. Then the sub-trajectory segments were divided according to the trajectory feature points. Finally, the clustering algorithm based on density was used according to the spatio-temporal similarity between sub-trajectory segments. The experimental results show that the trajectory feature points extracted using the proposed algorithm are more accurate to describe the trajectory structure under the premise that the feature points have better simplicity. At the same time, the similarity measurement based on spatio-temporal feature obtains better clustering result by taking into account both spatial and temporal characteristics of trajectory.

Key words: spatio-temporal pattern, trajectory data, curve edge detection, similarity measurement, density based clustering

中图分类号: