计算机应用 ›› 2018, Vol. 38 ›› Issue (11): 3326-3331.DOI: 10.11772/j.issn.1001-9081.2018051023

• 应用前沿、交叉与综合 • 上一篇    下一篇

基于空间邻近搜索的移动轨迹相对时间模式挖掘方法

张海涛1, 周欢2, 张国楠3   

  1. 1. 南京邮电大学 地理与生物信息学院, 南京 210023;
    2. 南京邮电大学 通信与信息工程学院, 南京 210003;
    3. 南京邮电大学 计算机学院、软件学院、网络空间安全学院, 南京 210023
  • 收稿日期:2018-05-17 修回日期:2018-06-15 出版日期:2018-11-10 发布日期:2018-11-10
  • 通讯作者: 周欢
  • 作者简介:张海涛(1978-),男,河南商水人,副教授,博士,主要研究方向:移动GIS理论方法与关键技术、时空数据挖掘、LBS隐私保护;周欢(1993-),男,江苏连云港人,硕士研究生,主要研究方向:轨迹数据挖掘、隐私保护;张国楠(1997-),男,青海西宁人,主要研究方向:轨迹数据挖掘、隐私保护。
  • 基金资助:
    国家自然科学基金资助项目(41201465);江苏省自然科学基金资助项目(BK2012439);江苏省社会发展项目(BE2016774)。

Mining method of trajectory interval pattern based on spatial proximity searching

ZHANG Haitao1, ZHOU Huan2, ZHANG Guonan3   

  1. 1. School of Geographic and Biologic Information, Nanjing University of Posts and Telecommunications, Nanjing Jiangsu 210023, China;
    2. School of Telecommunications & Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing Jiangsu 210003, China;
    3. School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing Jiangsu 210003, China
  • Received:2018-05-17 Revised:2018-06-15 Online:2018-11-10 Published:2018-11-10
  • Supported by:
    This work is partially supported by the National Natural Science Foundation of China (41201465), the Jiangsu Provincial Natural Science Foundation (BK2012439), the Jiangsu Social Development Project (BE2016774).

摘要: 针对传统移动轨迹模式挖掘方法挖掘速度慢、占用最大内存大的问题,提出一种基于空间邻近搜索的移动轨迹相对时间模式挖掘方法。该方法包括5个阶段:1)对移动轨迹数据进行时空划分,并基于移动轨迹数据与时空格的匹配得到移动轨迹数据对应的时空格序列。2)扫描所有的时空格序列数据得到空间网格集合,并通过空间网格与时空格序列的包含运算得到所有的频繁空间网格。3)频繁空间网格转变为长度为1的频繁相对时间模式。4)基于空间邻近搜索的方式进行模式增长,得到以频繁空间网格为单元的候选相对时间模式,并通过相对时间模式与时空格序列的匹配运算,计算相对时间模式的支持度。5)基于设定的支持度阈值,得到所有频繁的相对时间模式。实验结果表明:所提方法由于采用了基于空间邻近搜索的方式进行模式扩展,大幅减小候选相对时间模式的搜索范围。与传统方法相比,所提方法具有挖掘速度快、占用最大内存少的优点。另外,方法在运行时间上具有更好的稳定性和可扩展性,而在占用最大内存上的稳定性与可扩展性与传统方法基本相近。该方法有助于移动轨迹模式挖掘方法提升挖掘速度、减少占用最大内存。

关键词: 移动轨迹数据, 空间邻近搜索, 时空格, 支持度, 相对时间模式

Abstract: Concerning the problem that traditional trajectory pattern mining methods have the problems of slow mining and large maximum amount of memory, a method of mining trajectory interval patterns based on spatial proximity searching was proposed. The implementation of the proposed method consists of five phases:1) Space-time discretization is performed on the trajectories, and space-time cell sequences corresponding to trajectories are achieved. 2) All the space-time cell sequences are scanned to get all no-duplication spatial cells, and all frequent spatial cells are obtained by the inclusion operation of the spatial cells and the cell sequences. 3) Frequent spatial cells are transformed into frequent interval patterns of length one. 4) Candidate interval patterns with the frequent spatial cells as units are generated by spatial proximity searching, and the support value of the candidate patterns are calculated by matching the patterns and the space-time cell sequences. 5) Based on the set support threshold, all frequent interval patterns are obtained. The experimental results show that the proposed method has the advantages of faster mining and less maximum amount of memory than traditional methods. Furthermore, in terms of running time, the proposed method has better stability and scalability performance than traditional methods. This method is helpful to the trajectory pattern mining methods to increase the mining speed and reduce the maximum amount of memory.

Key words: trajectory data, spatial proximity searching, space-time cell, support value, interval pattern

中图分类号: