《计算机应用》唯一官方网站

• •    下一篇

时频域多尺度交叉注意力融合的时间序列分类方法

王美1,苏雪松2,刘佳2,殷若南1,黄珊2   

  1. 1. 中国石油大学(华东)
    2. 胜利油田技术检测中心
  • 收稿日期:2023-06-07 修回日期:2023-09-08 发布日期:2023-09-27 出版日期:2023-09-27
  • 通讯作者: 殷若南

Time series classification method based on multi-scale cross-attention fusion in time-frequency domain

  • Received:2023-06-07 Revised:2023-09-08 Online:2023-09-27 Published:2023-09-27

摘要: 摘 要: 针对时间序列子序列间的潜在信息交互不足导致分类准确率低的问题,提出时频域多尺度交叉注意力融合的时间序列分类方法TFFormer(Time-Frequency Transformer)。首先,将原始时间序列的时频域谱分别划分为等长子序列,经线性投影后加入位置信息解决时间序列的点值耦合问题;然后通过改进的多头自注意力模块使模型关注更重要的序列特征,解决长时间序列的前后依赖问题;最后,构造多尺度交叉注意力模块增强时间序列在时域和频域之间的信息交互,使模型进一步挖掘序列的频域信息。实验结果显示,相比于全卷积网络(Fully Convolutional Networks, FCN),提出的方法在Trace、StarLightCurves和UWaveGestureLibraryAll数据集上的分类准确率分别为100%、97.6%和92.3%,优于FCN的99.7%、96.7%和90.9%,证明了通过增强时间序列时域和频域间的信息交互,可以提高模型收敛速度和分类精度。

关键词: 关键词: 时间序列, 注意力机制, 位置编码, 深度神经网络, 多尺度融合

Abstract: To address the problem of low classification accuracy caused by insufficient potential information interaction between time series subsequences, a time series classification method based on multi-scale cross-attention fusion in time-frequency domain called TFFormer (Time-Frequency Transformer) was proposed. First, time and frequency spectrums of the original series were divided into subsequences, and the point-value coupling problem was solved by adding positional embedding after linear projection. Then, through the improved multi-head self-attention mechanism, the long-term sequence dependency problem was solved, and the model is made to focus on more important features. Finally, a multi-scale cross-attention module was proposed to enhance the interaction between the time domain and frequency domain, so that the model can further mine the frequency information of the sequence. The experimental results show that compared with Fully Convolutional Networks (FCN), the classification accuracy of the proposed method on the Trace, StarLightCurves and UWaveGestureLibraryAll datasets reaches 100%, 97.6% and 92.3%, respectively, which is better than 99.7%, 96.7% and 90.9% of FCN. It is proved that by enhancing the information interaction between time domain and frequency domain of the time series, the model convergence speed and classification accuracy can be improved.

Key words: time series, attention mechanism, positional embedding, deep neural network, multi-scale fusion

中图分类号: