Journal of Computer Applications ›› 2024, Vol. 44 ›› Issue (11): 3442-3448.DOI: 10.11772/j.issn.1001-9081.2023111684

• Data science and technology • Previous Articles     Next Articles

Multivariate long-term series forecasting model based on decomposition and frequency domain feature extraction

Yiyang FAN1,2, Yang ZHANG1,2,3(), Shang ZENG1,2, Yu ZENG1,2, Maoli FU1,2,3   

  1. 1.Chengdu Institute of Computer Application,Chinese Academy of Sciences,Chengdu Sichuan 610213,China
    2.School of Computer Science and Technology,University of Chinese Academy of Sciences,Beijing 100049,China
    3.Shenzhen CBPM?KEXIN Banking Technology Company Limited,Shenzhen Guangdong 518206,China
  • Received:2023-12-08 Revised:2024-03-08 Accepted:2024-03-12 Online:2024-03-22 Published:2024-11-10
  • Contact: Yang ZHANG
  • About author:FAN Yiyang, born in 1998, M. S. candidate. His research interests include time series analysis, data mining.
    ZENG Shang, born in 1995, Ph. D. candidate. His research interests include big data analysis, data mining.
    ZENG Yu, born in 1999, M. S. candidate. His research interests include time series analysis, data mining.
    FU Maoli, born in 1988, Ph. D. candidate, engineer. His research interests include artificial intelligence, image processing, pattern recognition.
  • Supported by:
    Sichuan Province Science Program(2023YFG0113)

基于分解和频域特征提取的多变量长时间序列预测模型

范艺扬1,2, 张洋1,2,3(), 曾尚1,2, 曾渝1,2, 付茂栗1,2,3   

  1. 1.中国科学院 成都计算机应用研究所,成都 610213
    2.中国科学院大学 计算机科学与技术学院,北京 100049
    3.深圳市中钞科信金融科技有限公司,广东 深圳 518206
  • 通讯作者: 张洋
  • 作者简介:范艺扬(1998—),男,四川成都人 ,硕士研究生 ,主要研究方向:时间序列分析、数据挖掘
    曾尚(1995—),男,湖北荆门人,博士研究生,主要研究方向:大数据分析、数据挖掘
    曾渝(1999—),男,重庆人,硕士研究生,主要研究方向:时间序列分析、数据挖掘
    付茂栗(1988—),男,四川遂宁人,工程师,博士研究生,主要研究方向:人工智能、图像处理、模式识别。
  • 基金资助:
    四川省科技计划项目(2023YFG0113)

Abstract:

In response to the problems that the existing Transformer-based Multivariate Long-Term Series Forecasting (MLTSF) models mainly extract features from the time domain, and it is difficult to find out reliable dependencies directly from the dispersed time points of the long-term series, a new decomposition and frequency domain feature extraction model was proposed. Firstly, a periodic term-trend term decomposition method based on the frequency domain was proposed, which reduced the time complexity of the decomposition process. Then, based on the extraction of trend features using periodic term-trend term decomposition, a Transformer network performing frequency domain feature extraction based on Gabor transform was utilized to capture periodic dependencies, which enhanced the stability and robustness of forecasting. Experimental results on five benchmark datasets show that compared with the current state-of-the-art methods, the proposed model has the Mean Squared Error (MSE) in MLTSF is reduced by an average of 7.6% with a maximum reduction of 18.9%, which demonstrates that the proposed model improves forecasting accuracy effectively.

Key words: multivariate long-term series forecasting, frequency domain feature extraction, Gabor transform, Transformer, time series, deep learning

摘要:

针对现有基于Transformer的多变量长时间序列预测(MLTSF)模型主要从时域中提取特征,难以直接从长时间序列分散的时间点中找出可靠依赖关系的问题,提出一种新的基于分解和频域特征提取的模型。首先,提出基于频域的周期项-趋势项的分解方法,以降低分解过程的时间复杂度;其次,在利用周期项-趋势项分解提取序列趋势性特征的基础上,利用基于Gabor变换进行频域特征提取的Transformer网络捕捉周期性的依赖,提高预测的稳定性和鲁棒性。在5个基准数据集上的实验结果显示,与现有的先进方法相比,所提模型在MLTSF上的均方误差(MSE)平均减小了7.6%,最多减小了18.9%,有效提升了预测精度。

关键词: 多变量长时间序列预测, 频域特征提取, Gabor变换, Transformer, 时间序列, 深度学习

CLC Number: