计算机应用 ›› 2019, Vol. 39 ›› Issue (10): 2955-2959.DOI: 10.11772/j.issn.1001-9081.2019030573

• 数据科学与技术 • 上一篇    下一篇

基于随机性分析的虚假趋势时间序列判别

李建勋, 马美玲, 郭建华, 严峻   

  1. 西安理工大学 经济与管理学院, 西安 710054
  • 收稿日期:2019-04-09 修回日期:2019-05-18 出版日期:2019-10-10 发布日期:2019-06-03
  • 通讯作者: 李建勋
  • 作者简介:李建勋(1977-),男,陕西西安人,教授,博士,主要研究方向:决策支持系统、数据异常识别;马美玲(1995-),女,陕西宝鸡人,硕士研究生,主要研究方向:决策支持系统、数据质量分析;郭建华(1972-),男,陕西宝鸡人,讲师,硕士,主要研究方向:水利信息化、数据挖掘;严峻(1973-),男,上海人,讲师,硕士,主要研究方向:信息管理、数据融合。
  • 基金资助:
    陕西省教育厅专项科研计划项目(18JK0577)。

False trend time series detection based on randomness analysis

LI Jianxun, MA Meiling, GUO Jianhua, YAN Jun   

  1. School of Economics and Management, Xi'an University of Technology, Xi'an Shaanxi 710054, China
  • Received:2019-04-09 Revised:2019-05-18 Online:2019-10-10 Published:2019-06-03
  • Supported by:
    This work is partially supported by the Special Scientific research Program of Education Department of Shaanxi Province (18JK0577).

摘要: 针对符合一定数据模式或规律的虚假数据识别问题,提出一种基于随机性分析的虚假趋势时间序列判别方法。该方法在分析时间序列组成的基础上,首先探索虚假趋势时间序列的简单伪造和复杂伪造方式,并将其分解为虚假趋势和虚假随机两部分;然后通过基函数逼近进行时间序列虚假趋势部分的提取,采用随机性理论开展虚假随机部分的分析;最终借助单比特频数和块内频数对虚假随机部分是否具备随机性进行检测,为具有一定趋势特征的虚假时间序列的判别提供了一个解决方案。实验结果表明:该方法能够有效地分解虚假时间序列和提取虚假趋势部分,实现简单伪造数据和复杂伪造数据的判别,支持对通过观测手段或者检测设备所获取的数值型数据的真伪分析,进一步提高了虚假数据可判别范围,平均判别正确率可达74.7%。

关键词: 虚假数据, 时间序列, 趋势性, 随机性分析, 基函数

Abstract: Focusing on the detection problem of false data that conform to a certain pattern or rule, a false trend time series detection method based on randomness analysis was proposed. Based on the analysis of time series composition, firstly the simple forgery method and complex forgery method of false trend time series were explored, and decomposed into two parts:false trendness and false randomness. Then the false trend of time series was extracted by the approximation of base function, the false random of time series was analyzed with the randomness theory. Finally, monobit frequency and frequency within a block were adopted to test whether the false random part has randomness, which established a detection method of false time series with a certain trend feature. The simulation results show that proposed method can decompose the false time series and extract the false trend part effectively, meanwhile realize the detection of simple and complex forged data. It also supports the authenticity analysis for the numerical data obtained by means of observation or monitoring equipment, which improves the discrimination range of false data with average detection accuracy of 74.7%.

Key words: false data, time series, trend, randomness analysis, base function

中图分类号: