计算机应用 ›› 2019, Vol. 39 ›› Issue (11): 3370-3375.DOI: 10.11772/j.issn.1001-9081.2019040670

• 应用前沿、交叉与综合 • 上一篇    下一篇

基于融合时空数据的车辆加油行为多视图深度异常检测框架

丁景全1,2, 马博1,2,3, 李晓1,2,3   

  1. 1. 中国科学院 新疆理化技术研究所, 乌鲁木齐 830011;
    2. 中国科学院大学, 北京 100049;
    3. 新疆民族语音语言信息处理实验室, 乌鲁木齐 830011
  • 收稿日期:2019-04-22 修回日期:2019-06-08 出版日期:2019-11-10 发布日期:2019-08-21
  • 通讯作者: 马博
  • 作者简介:丁景全(1973-),男,内蒙古赤峰人,副研究员,博士研究生,主要研究方向:大数据治理与分析;马博(1984-),男,辽宁鞍山人,副研究员,博士,CCF会员,主要研究方向:大数据分析、知识图谱;李晓(1957-),男,四川邛崃人,研究员,博士生导师,硕士,CCF会员,主要研究方向:多语种信息处理、信息管理系统。
  • 基金资助:
    新疆维吾尔自治区自然科学基金资助项目(2019D01A92)。

Multi-view deep anomaly detection framework for vehicle refueling behaviors based on spatio-temporal data fusion

DING Jingquan1,2, MA Bo1,2,3, LI Xiao1,2,3   

  1. 1. The Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi Xinjiang 830011, China;
    2. University of Chinese Academy of Sciences, Beijing 100049, China;
    3. Xinjiang Laboratory of Minority Speech and Language Information Processing, Urumqi Xinjiang 830011, China
  • Received:2019-04-22 Revised:2019-06-08 Online:2019-11-10 Published:2019-08-21
  • Supported by:
    This work is partially supported by the Natural Science Foundation of Xinjiang (2019D01A92).

摘要: 车辆加油时空数据多源异构、关系复杂,现有成熟的异常检测方法难以对时空离散的加油活动数据进行分析,因此提出基于融合时空数据的车辆加油行为多视图深度异常检测框架。首先基于统一概念模型(UCM)对静态信息和动态活动数据进行关联融合管理,然后从空间视图、时间视图和语义视图角度对时空数据进行编码和转换,最后基于三种视图构建深度时空异常分析检测框架。车辆加油时空数据集上的实验结果表明,多种异常检测方法在融合时空数据上均可取得更低均方根误差(RMSE),平均降低10.73%,所提方法比现有主流方法中结果最好的长短时记忆网络(LSTM)的RMSE降低19.36%。在信用卡欺诈公开数据集上的实验结果表明,所提方法较之逻辑回归模型,马修斯系数(MCC)提高了32.78%。以上实验验证了所提方法的有效性。

关键词: 时空数据, 车辆加油, 数据融合, 异常检测, 深度学习

Abstract: The multi-source heterogeneity and complicated relationships of spatio-temporal data of vehicle refueling bring great challenges to existing anomaly detection approaches. Aiming at the problem, a multi-view deep anomaly detection framework for vehicle refueling based on spatio-temporal data fusion was proposed. Firstly, the static information and dynamic activity data were correlated, fused and managed based on Unified Conceptual Model (UCM). Secondly, the spatio-temporal data were encoded and converted according to spatial view, temporal view and semantic view. Finally, a deep anomaly detection framework was constructed based on the above multi-views. The experimental results on vehicle refueling spatio-temporal dataset show that all anomaly detection approaches tested can achieve an average decrease in the Root Mean Square Error (RMSE) by 10.73%, and the proposed multi-view spatio-temporal anomaly detection framework can obtain a decrease in the RMSE by 19.36% compared to LSTM (Long Short-Term Memory), which gets the best results in the-state-of-the-art methods. And the Matthews Correlation Coefficient (MCC) of the proposed method on the credit card fraud dataset is increased by 32.78% compared with that of Logistic Regression model. All experimental results demonstrate the effectiveness of the proposed anomaly detection framework.

Key words: spatio-temporal data, vehicle refueling, data fusion, anomaly detection, deep learning

中图分类号: