计算机应用 ›› 2016, Vol. 36 ›› Issue (1): 33-38.DOI: 10.11772/j.issn.1001-9081.2016.01.0033

• 第32届中国数据库学术会议(NDBC 2015) • 上一篇    下一篇

基于深度表示模型的移动模式挖掘

陈勐, 禹晓辉, 刘洋   

  1. 山东大学 计算机科学与技术学院, 济南 250101
  • 收稿日期:2015-07-10 修回日期:2015-08-04 出版日期:2016-01-10 发布日期:2016-01-09
  • 通讯作者: 刘洋(1977-),女,山东济南人,副教授,博士,CCF会员,主要研究方向:情感分析、文本挖掘
  • 作者简介:陈勐(1990-),男,山东滕州人,博士研究生,主要研究方向:轨迹挖掘、城市计算;禹晓辉(1977-),男,山东济南人,教授,博士生导师,博士,CCF会员,主要研究方向:大数据管理、数据挖掘。
  • 基金资助:
    国家自然科学基金资助项目(61272092);山东省自然科学基金资助项目(ZR2012FZ004);山东省科技发展计划基金资助项目(2014GGE27178);国家973计划项目(2015CB352500);泰山学者计划基金资助项目。

Mining mobility patterns based on deep representation model

CHEN Meng, YU Xiaohui, LIU Yang   

  1. College of Computer Science and Technology, Shandong University, Jinan Shandong 250101, China
  • Received:2015-07-10 Revised:2015-08-04 Online:2016-01-10 Published:2016-01-09
  • Supported by:
    This work is partially supported by the National Natural Science Foundation of China (61272092), the Natural Science Foundation of Shandong Province (ZR2012FZ004), the Science and Technology Development Program of Shandong Province (2014GGE27178), the National Basic Research Program (973 Program) of China (2015CB352500) and the Research Fund for the Taishan Scholar Project of Shandong Province.

摘要: 针对时空轨迹中位置顺序和时间对于理解用户移动模式的重要性,提出了一种新的用户轨迹深度表示模型。该模型考虑到时空轨迹的特点:1)不同的位置顺序表示不同的移动模式;2)轨迹有周期性并且在不同的时间段有变化。首先,将两个连续的位置点组合成位置序列;然后,将位置序列和对应的时间块组合成时间位置序列,作为描述轨迹特征的基本单位;最后,利用深度表示模型为每个序列训练特征向量。为了验证深度表示模型的有效性,设计实验将时间位置序列向量应用到用户移动模式发现中,并利用Gowalla签到数据集进行了实验评测。实验结果显示提出的模型能够发现"上班""购物"等明确的模式,而Word2Vec很难发现有意义的移动模式。

关键词: 时空轨迹挖掘, 用户移动模式, 深度表示模型, 时间位置序列向量, 哈夫曼编码

Abstract: Focusing on the fact that the order of locations and time play a pivotal role in understanding user mobility patterns for spatio-temporal trajectories, a novel deep representation model for trajectories was proposed. The model considered the characteristics of spatio-temporal trajectories: 1) different orders of locations indicate different user mobility patterns; 2) trajectories tend to be cyclical and change over time. First, two time-ordered locations were combined in location sequence; second, the sequence and its corresponding time bin were combined in the temporal location sequence, which was the basic unit of describing the features of a trajectory; finally, the deep representation model was utilized to train the feature vector for each sequence. To verify the effectiveness of the deep representation model, experiments were designed to apply the temporal location sequence vectors to user mobility patterns mining, and empirical studies were performed on a real check-in dataset of Gowalla. The experimental results confirm that the proposed method is able to discover explicit movement patterns (e.g., working, shopping) and Word2Vec is difficult to discover the valuable patterns.

Key words: spatio-temporal trajectory mining, user mobility pattern, deep representation model, temporal location sequence vector, Huffman coding

中图分类号: