计算机应用 ›› 2017, Vol. 37 ›› Issue (5): 1282-1286.DOI: 10.11772/j.issn.1001-9081.2017.05.1282

• 先进计算 • 上一篇    下一篇

基于MapReduce的轨迹压缩并行化方法

吴家皋1,2, 夏轩1, 刘林峰1,2   

  1. 1. 南京邮电大学 计算机学院, 南京 210003;
    2. 计算机网络和信息集成教育部重点实验室(东南大学), 南京 211189
  • 收稿日期:2016-11-07 修回日期:2016-12-14 出版日期:2017-05-10 发布日期:2017-05-16
  • 通讯作者: 吴家皋
  • 作者简介:吴家皋(1969-),男,江苏苏州人,副教授,博士,CCF会员,主要研究方向:移动网络、云计算;夏轩(1989-),男,江苏宿迁人,硕士研究生,主要研究方向:云计算、移动模型;刘林峰(1980-),男,江苏丹阳人,副教授,博士,CCF会员,主要研究方向:水下传感器网络。
  • 基金资助:
    国家自然科学基金资助项目(61373139,41571389,71301081);东南大学计算机网络和信息集成教育部重点实验室开放基金资助项目(K93-9-2014-05B);南京邮电大学科研基金资助项目(NY214063)。

Parallel trajectory compression method based on MapReduce

WU Jiagao1,2, XIA Xuan1, LIU Linfeng1,2   

  1. 1. School of Computer, Nanjing University of Posts and Telecommunications, Nanjing Jiangsu 210003, China;
    2. Key Laboratory of Computer Network and Information Integration of Ministry of Education(Southeast University), Nanjing Jiangsu 211189, China
  • Received:2016-11-07 Revised:2016-12-14 Online:2017-05-10 Published:2017-05-16
  • Supported by:
    This work is partially supported by the National Natural Science Foundation of China (61373139,41571389,71301081), the Open Research Fund from Key Laboratory of Computer Network and Information Integration (Southeast University), Ministry of Education, China (K93-9-2014-05B), the Scientific Research Foundation of Nanjing University of Posts and Telecommunications (NY214063).

摘要: 带有全球定位系统(GPS)功能设备的增多,产生大量的时空轨迹数据,给数据的存储、传输和处理带来了沉重的负担。为了减轻这种负担,各种轨迹压缩方法也随之产生。提出了一种基于MapReduce的并行化轨迹压缩方法,针对并行化导致的分段点前后轨迹的相关性被破坏的问题,首先,采用两种分段点相互交错的划分方法划分轨迹;然后,将分段轨迹分配到多个节点上进行并行化压缩;最后,对压缩结果进行匹配合并。性能测试分析结果表明,所提出的并行化轨迹压缩方法能够大幅提高压缩效率,而且能完全消除因分段导致分段点前后相关性被破坏带来的误差。

关键词: 轨迹压缩, 分布式存储, MapReduce, Hadoop, 全球定位系统轨迹

Abstract: The massive spatiotemporal trajectory data is a heavy burden to store, transmit and process, which is caused by the increase Global Positioning System (GPS)-enable devices. In order to reduce the burden, many kinds of trajectory compression methods were generated. A parallel trajectory compression method based on MapReduce was proposed in this paper. In order to solve the destructive problem of correlation nearby segmentation points caused by the parallelization, in this method, the trajectory was divided by two segmentation methods in which the segmentation points were interleaving firstly. Then, the trajectory segments were assigned to different nodes for parallel compression. Lastly, the compression results were matched and merged. The performance test and analysis results show that the proposed method can not only increase the compression efficiency significantly, but also eliminate the error which is caused by the destructive problem of correlation.

Key words: trajectory compression, distributed storage, MapReduce, Hadoop, Global Positioning System (GPS) trajectory

中图分类号: