Journal of Computer Applications ›› 2019, Vol. 39 ›› Issue (3): 924-929.DOI: 10.11772/j.issn.1001-9081.2018081681

Abnormal time series data detection of gas station by Seq2Seq model based on bidirectional long short-term memory

TAO Tao1,2,3, ZHOU Xi1,3, MA Bo1,3, ZHAO Fan1,3   

  1. 1. Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi Xinjiang 830011, China;
    2. University of Chinese Academy of Sciences, Beijing 100049, China;
    3. Xinjiang Laboratory of Minority Speech and Language Information Processing, Xinjiang Technical Institute of Physics and Chemistry, Xinjiang Urumqi 830011, China
  • Received:2018-08-14 Revised:2018-09-13 Online:2019-03-11 Published:2019-03-10
  • Supported by:

    This work is partially supported by the Program of Introducing High-Level Talents of Xinjiang(Y639401201), the West Light Foundation of Chinese Academy of Sciences (2016-QNXZ-A-3).


陶涛1,2,3, 周喜1,3, 马博1,3, 赵凡1,3   

  1. 1. 中国科学院 新疆理化技术研究所, 乌鲁木齐 830011;
    2. 中国科学院大学, 北京 100049;
    3. 新疆理化技术研究所 新疆民族语音语言信息处理实验室, 乌鲁木齐 830011
  • 通讯作者: 周喜
  • 作者简介:陶涛(1994-),男,贵州毕节人,硕士研究生,主要研究方向:大数据分析、数据挖掘;周喜(1978-),男,湖南双峰人,研究员,博士,CCF会员,主要研究方向:物联网、大数据分析;马博(1984-),男,辽宁鞍山人,副研究员,博士,CCF会员,主要研究方向:数据分析与知识发现、机器学习;赵凡(1980-),男,山西介休人,副研究员,博士研究生,CCF会员,主要研究方向:信息安全、大数据分析。
  • 基金资助:



Time series data of gas station contains multi-dimensional information of fueling behavior, but the data of specific gas station are sparse. The existing abnormal data detection algorithms are not suitable for gas station time series data, because many pseudo outliers are mined and many real abnormal points are missed. To solve the problems, an abnormal detection method based on deep learning was proposed to detect vehicles with abnormal fueling. Firstly, feature extraction was performed on data collected from the gas station through an automatic encoder. Then, a deep learning model Seq2Seq with embedding Bidirectional Long Short-Term Memory (Bi-LSTM) was used to predict the fueling behavior. Finally, the threshold of outliers was defined by comparing the predicted value and the original value. The experiments on a fueling dataset and a credit card fraud dataset verify the effectiveness of the proposed method. Compared with the existing methods, the Root Mean Squared Error (RMSE) of the proposed method is decreased by 21.1% on the fueling dataset, and abnormal detection accuracy of the proposed method is improved by 1.4% on the credit card fraud dataset. Therefore, the proposed method can be applied to detect vehicles with abnormal fueling behavior, improving the management and operational efficiency of gas station.

Key words: gas station time-serise data, deep learning, Seq2Seq, Bidirectional Long Short-Term Memory (Bi-LSTM), outlier detection



关键词: 加油站时序数据, 深度学习, Seq2Seq, 双向长短期记忆, 异常检测

