融合衍生特征的时间序列事件分类方法

doi:10.11772/j.issn.1001-9081.2024020202

《计算机应用》唯一官方网站 ›› 2025, Vol. 45 ›› Issue (2): 428-435.DOI: 10.11772/j.issn.1001-9081.2024020202

• 数据科学与技术 • 上一篇

融合衍生特征的时间序列事件分类方法

张翰林, 王俊陆, 宋宝燕()

辽宁大学信息学院，沈阳 110036

收稿日期:2024-02-29 修回日期:2024-05-14 接受日期:2024-05-17 发布日期:2024-06-04 出版日期:2025-02-10
通讯作者: 宋宝燕
作者简介:张翰林（1993—），男，辽宁沈阳人，博士研究生，CCF会员，主要研究方向：时序图查询、机器学习
王俊陆（1988—），男，辽宁沈阳人，讲师，博士，CCF会员，主要研究方向：深度学习、区块链；
基金资助:
国家重点研发计划项目(2021YFF0901004);辽宁省应用基础研究计划项目(2022JH2/101300250);辽宁省教育厅高校基本科研项目（理工类）面上项目（揭榜挂帅服务地方项目）(JYTMS20230761)

Time series event classification method fused with derived features

Hanlin ZHANG, Junlu WANG, Baoyan SONG()

School of Information，Liaoning University，Shenyang Liaoning 110036，China

Received:2024-02-29 Revised:2024-05-14 Accepted:2024-05-17 Online:2024-06-04 Published:2025-02-10
Contact: Baoyan SONG
About author:ZHANG Hanlin， born in 1993， Ph. D. candidate. His research interests include temporal graph query， machine learning.
WANG Junlu， born in 1988， Ph. D.， lecturer. His research interests include deep learning， blockchain.
Supported by:
National Key Research and Development Program of China(2021YFF0901004);Liaoning Province Applied Basic Research Program(2022JH2/101300250);General Project of Educational Department of Liaoning Province’s Higher Education Institution Basic Research Project(Engineering┫ （Leading the Way to Serve Local Projects） ┣JYTMS20230761)

摘要/Abstract

摘要：

时间序列分类是时间序列分析的基础。然而，现有的时间序列分类方法对应的形态特征并不能作为分类依据，且通道间的特征通过图上的单一权重刻画不够准确，导致分类精度不高。因此，提出一种融合衍生特征的时间序列事件分类方法（TSEC-FDF）。首先，在时间序列上构建时间序列事件集合后，根据每个时间序列事件构建突变图、协同图、启发图，以减少噪声对高维特征的干扰；其次，融合多图的特征作为衍生特征，并抽取时间序列事件的多个时间级别的特征；最后，提出一种融合衍生特征的多图卷积分类模型级联时间序列和图特征作为时间序列事件的高维特征。实验结果表明，与TF-C（Time-Frequency Consistency）和BiLSTM+隐马尔可夫模型（Bi-directional Long Short-Term Memory-Hidden Markov Model， BL-HMM）方法相比，TSEC-FDF在4个真实数据集上的准确率、精确率、查全率、F1值、AUROC（Area Under the Receiver Operating Characteristic curve）以及AUPRC（Area Under the Precision versus Recall Curve）至少提升了3.2%、4.7%、7.8%、6.3%、0.9%和2.2%。

关键词: 转换图, 衍生特征, 图卷积神经网络, 多图融合, 时间序列分类, 图构建

Abstract:

Time series classification is the foundation of time series analysis. However， the morphological features corresponding to the existing time series classification methods cannot serve as the basis for classification， and the features between channels are not characterized accurately by the single weight on the graph， resulting in low classification accuracy. Therefore， a Time Series Event Classification method Fused with Derived Features （TSEC-FDF） was proposed. Firstly， after constructing a time series event set on the time series， the mutation graphs， collaborative graphs， and heuristic graphs were constructed on the basis of each time series event to reduce noise interference to high-dimensional features. Secondly， the features of multiple graphs were fused and treated as derived features， and features of time series events at multiple time levels were extracted. Finally， a multi-graph convolutional classification model fusing derived features was proposed， where time series and graph features were cascaded as high-dimensional features of time series events. Experimental results show that TSEC-FDF improves the accuracy， precision， recall， F1 score， AUROC（Area Under the Receiver Operating Characteristic） curve， and AUPRC（Area Under the Precision versus Recall Curve） on 4 real datasets by 3.2%， 4.7%， 7.8%， 6.3%， 0.9%， and 2.2%， at least， compared to TF-C （Time-Frequency Consistency） and Bi-directional Long Short-Term Memory-Hidden Markov Model （BL-HMM） methods.

Key words: transfer graph, derived feature, graph convolutional neural network, multi-graph fusion, time series classification, graph construction

中图分类号:

TP183

张翰林, 王俊陆, 宋宝燕. 融合衍生特征的时间序列事件分类方法[J]. 计算机应用, 2025, 45(2): 428-435.

Hanlin ZHANG, Junlu WANG, Baoyan SONG. Time series event classification method fused with derived features[J]. Journal of Computer Applications, 2025, 45(2): 428-435.

图/表 8

图1 由于开采活动产生时间序列事件的过程

Fig. 1 Process of generating time series events due to mining activities

图2 事件对应时间序列

Fig. 2 Events corresponding to time series

图3 时间序列事件集合及其特征抽取过程

Fig. 3 Time series event set and its feature extraction process

图4 多图卷积分类模型结构

Fig. 4 Structure of multi-graph convolutional classification model

表1 实验数据集信息

Tab. 1 Experimental dataset information

数据集	采用频率/Hz	样本数	类别数	实例长度/s	数据集大小/MB
SleepEEG	100	60 000	5	30	1 484.8
HAR	50	10 299	6	11	46.2
BD	64 000	1 280	4	4	10 639.3
Earthquake	10~40	1 156	4	120	116.5

表2 不同数据集上5个模型的准确率等指标的对比结果 (%)

Tab. 2 Comparison results of accuracy and other indicators of five models on different datasets

模型	数据集	准确率	精确率	查全率	F1值	AUROC	AUPRC
TSEC-FDF	SleepEEG	98.01 $±$ 0.01	99.01 $±$ 0.01	99.00 $±$ 0.01	99.10 $±$ 0.01	99.08 $±$ 0.01	99.19 $±$ 0.04
	BD	84.05 $±$ 0.02	85.06 $±$ 0.01	89.78 $±$ 0.02	87.60 $±$ 0.01	89.70 $±$ 0.02	89.50 $±$ 0.05
	HAR	79.64 $±$ 0.01	75.43 $±$ 0.03	69.88 $±$ 0.02	72.19 $±$ 0.01	79.59 $±$ 0.02	79.86 $±$ 0.03
	Earthquake	75.64 $±$ 0.05	73.21 $±$ 0.04	76.23 $±$ 0.06	77.06 $±$ 0.03	78.01 $±$ 0.03	78.64 $±$ 0.05
KNN	SleepEEG	85.25 $±$ 0.20	86.39 $±$ 0.10	64.31 $±$ 0.30	67.91 $±$ 0.10	64.34 $±$ 0.10	62.79 $±$ 0.20
	BD	67.66 $±$ 0.20	65.00 $±$ 0.30	68.21 $±$ 0.10	64.42 $±$ 0.02	81.90 $±$ 0.02	52.31 $±$ 0.30
	HAR	75.34 $±$ 0.20	74.39 $±$ 0.60	76.38 $±$ 0.30	73.28 $±$ 0.20	74.09 $±$ 0.20	77.28 $±$ 0.40
	Earthquake	29.94 $±$ 0.10	31.01 $±$ 0.50	27.51 $±$ 0.30	28.33 $±$ 0.50	45.29 $±$ 0.10	50.17 $±$ 0.10
TF-C	SleepEEG	94.95 $±$ 0.80	94.56 $±$ 0.10	89.08 $±$ 0.20	91.49 $±$ 0.20	98.11 $±$ 0.02	97.03 $±$ 0.05
	BD	78.24 $±$ 0.30	79.82 $±$ 0.02	80.11 $±$ 0.50	79.91 $±$ 0.40	90.52 $±$ 0.02	78.61 $±$ 0.08
	HAR	75.29 $±$ 0.40	71.08 $±$ 0.07	67.23 $±$ 0.80	68.22 $±$ 0.50	70.31 $±$ 0.30	76.19 $±$ 0.02
	Earthquake	68.34 $±$ 0.40	65.36 $±$ 0.05	61.24 $±$ 0.70	63.31 $±$ 0.30	68.77 $±$ 0.20	69.13 $±$ 0.03
BL-HMM	SleepEEG	90.74 $±$ 0.03	92.39 $±$ 0.03	91.77 $±$ 0.03	93.21 $±$ 0.04	95.34 $±$ 0.04	92.19 $±$ 0.08
	BD	61.29 $±$ 0.01	58.18 $±$ 0.02	59.87 $±$ 0.01	62.31 $±$ 0.02	65.98 $±$ 0.01	71.39 $±$ 0.01
	HAR	61.30 $±$ 0.05	58.94 $±$ 0.04	57.61 $±$ 0.07	62.04 $±$ 0.05	65.49 $±$ 0.03	66.99 $±$ 0.05
	Earthquake	61.57 $±$ 0.03	62.39 $±$ 0.02	59.71 $±$ 0.04	58.66 $±$ 0.04	60.39 $±$ 0.01	62.60 $±$ 0.04
LB-SimTSC	SleepEEG	67.13 $±$ 0.10	69.06 $±$ 0.30	64.88 $±$ 0.05	72.87 $±$ 0.20	69.35 $±$ 0.07	74.56 $±$ 0.40
	BD	56.29 $±$ 0.50	49.36 $±$ 0.40	51.71 $±$ 0.70	56.73 $±$ 0.50	54.00 $±$ 0.20	57.09 $±$ 0.80
	HAR	51.60 $±$ 0.30	52.77 $±$ 0.10	48.19 $±$ 0.60	50.61 $±$ 0.20	51.45 $±$ 0.60	53.16 $±$ 0.40
	Earthquake	41.30 $±$ 0.10	40.23 $±$ 0.50	45.76 $±$ 0.10	48.73 $±$ 0.50	48.61 $±$ 0.30	49.77 $±$ 0.40

表2 不同数据集上5个模型的准确率等指标的对比结果 (%)

Tab. 2 Comparison results of accuracy and other indicators of five models on different datasets

模型	数据集	准确率	精确率	查全率	F1值	AUROC	AUPRC
TSEC-FDF	SleepEEG	98.01 $±$ 0.01	99.01 $±$ 0.01	99.00 $±$ 0.01	99.10 $±$ 0.01	99.08 $±$ 0.01	99.19 $±$ 0.04
	BD	84.05 $±$ 0.02	85.06 $±$ 0.01	89.78 $±$ 0.02	87.60 $±$ 0.01	89.70 $±$ 0.02	89.50 $±$ 0.05
	HAR	79.64 $±$ 0.01	75.43 $±$ 0.03	69.88 $±$ 0.02	72.19 $±$ 0.01	79.59 $±$ 0.02	79.86 $±$ 0.03
	Earthquake	75.64 $±$ 0.05	73.21 $±$ 0.04	76.23 $±$ 0.06	77.06 $±$ 0.03	78.01 $±$ 0.03	78.64 $±$ 0.05
KNN	SleepEEG	85.25 $±$ 0.20	86.39 $±$ 0.10	64.31 $±$ 0.30	67.91 $±$ 0.10	64.34 $±$ 0.10	62.79 $±$ 0.20
	BD	67.66 $±$ 0.20	65.00 $±$ 0.30	68.21 $±$ 0.10	64.42 $±$ 0.02	81.90 $±$ 0.02	52.31 $±$ 0.30
	HAR	75.34 $±$ 0.20	74.39 $±$ 0.60	76.38 $±$ 0.30	73.28 $±$ 0.20	74.09 $±$ 0.20	77.28 $±$ 0.40
	Earthquake	29.94 $±$ 0.10	31.01 $±$ 0.50	27.51 $±$ 0.30	28.33 $±$ 0.50	45.29 $±$ 0.10	50.17 $±$ 0.10
TF-C	SleepEEG	94.95 $±$ 0.80	94.56 $±$ 0.10	89.08 $±$ 0.20	91.49 $±$ 0.20	98.11 $±$ 0.02	97.03 $±$ 0.05
	BD	78.24 $±$ 0.30	79.82 $±$ 0.02	80.11 $±$ 0.50	79.91 $±$ 0.40	90.52 $±$ 0.02	78.61 $±$ 0.08
	HAR	75.29 $±$ 0.40	71.08 $±$ 0.07	67.23 $±$ 0.80	68.22 $±$ 0.50	70.31 $±$ 0.30	76.19 $±$ 0.02
	Earthquake	68.34 $±$ 0.40	65.36 $±$ 0.05	61.24 $±$ 0.70	63.31 $±$ 0.30	68.77 $±$ 0.20	69.13 $±$ 0.03
BL-HMM	SleepEEG	90.74 $±$ 0.03	92.39 $±$ 0.03	91.77 $±$ 0.03	93.21 $±$ 0.04	95.34 $±$ 0.04	92.19 $±$ 0.08
	BD	61.29 $±$ 0.01	58.18 $±$ 0.02	59.87 $±$ 0.01	62.31 $±$ 0.02	65.98 $±$ 0.01	71.39 $±$ 0.01
	HAR	61.30 $±$ 0.05	58.94 $±$ 0.04	57.61 $±$ 0.07	62.04 $±$ 0.05	65.49 $±$ 0.03	66.99 $±$ 0.05
	Earthquake	61.57 $±$ 0.03	62.39 $±$ 0.02	59.71 $±$ 0.04	58.66 $±$ 0.04	60.39 $±$ 0.01	62.60 $±$ 0.04
LB-SimTSC	SleepEEG	67.13 $±$ 0.10	69.06 $±$ 0.30	64.88 $±$ 0.05	72.87 $±$ 0.20	69.35 $±$ 0.07	74.56 $±$ 0.40
	BD	56.29 $±$ 0.50	49.36 $±$ 0.40	51.71 $±$ 0.70	56.73 $±$ 0.50	54.00 $±$ 0.20	57.09 $±$ 0.80
	HAR	51.60 $±$ 0.30	52.77 $±$ 0.10	48.19 $±$ 0.60	50.61 $±$ 0.20	51.45 $±$ 0.60	53.16 $±$ 0.40
	Earthquake	41.30 $±$ 0.10	40.23 $±$ 0.50	45.76 $±$ 0.10	48.73 $±$ 0.50	48.61 $±$ 0.30	49.77 $±$ 0.40

图5 5个模型的准确率随样本数变化的对比

Fig. 5 Comparison of accuracies of five models varying with number of samples

表3 不同模型间在不同实验指标下的T测试分析（α<0.05时显著）

Tab. 3 T-test analysis among different models on different experimental indicators（α<0.05 means remarkable）

模型	对比模型	准确率	精确率	查全率	F1值	AUROC	AUPRC
TSEC-FDF	KNN	0.002	0.002	0.001	0.001	0.001	0.000
	TF-C	0.453	0.534	0.294	0.464	0.434	0.443
	BL-HMM	0.011	0.019	0.031	0.036	0.014	0.026
	LB-SimTSC	0.000	0.000	0.000	0.000	0.000	0.000
KNN	TSEC-FDF	0.002	0.002	0.001	0.001	0.001	0.000
	TF-C	0.014	0.013	0.012	0.008	0.009	0.001
	BL-HMM	0.528	0.433	0.152	0.178	0.346	0.053
	LB-SimTSC	0.085	0.145	0.303	0.545	0.063	0.341
TF-C	TSEC-FDF	0.453	0.534	0.294	0.464	0.434	0.443
	KNN	0.014	0.013	0.012	0.008	0.009	0.001
	BL-HMM	0.062	0.076	0.251	0.159	0.082	0.132
	LB-SimTSC	0.000	0.000	0.001	0.001	0.000	0.000
BL-HMM	TSEC-FDF	0.011	0.019	0.031	0.036	0.014	0.026
	KNN	0.528	0.433	0.152	0.178	0.346	0.053
	TF-C	0.062	0.076	0.251	0.159	0.082	0.132
	LB-SimTSC	0.021	0.028	0.016	0.054	0.007	0.005
LB-SimTSC	TSEC-FDF	0.000	0.000	0.000	0.000	0.000	0.000
	KNN	0.085	0.145	0.303	0.545	0.063	0.341
	TF-C	0.000	0.000	0.001	0.001	0.000	0.000
	BL-HMM	0.021	0.028	0.016	0.054	0.007	0.005

参考文献 23

1	KAUR P， JOSHI J C， AGGARWAL P. A Multi-Model Decision Support System （MM-DSS） for avalanche hazard prediction over North-West Himalaya［J］. Natural Hazards， 2022， 110（1）： 563-585.
2	ELDELE E， CHEN Z， LIU C， et al. An attention-based deep learning approach for sleep stage classification with single-channel EEG［J］. IEEE Transactions on Neural Systems and Rehabilitation Engineering， 2021， 29： 809-818.
3	LIU J， PAN C， LEI F， et al. Fault prediction of bearings based on LSTM and statistical process analysis［J］. Reliability Engineering and System Safety， 2021， 214： No.107646.
4	JAIN P K， TIWARI A K. Heart monitoring systems： a review［J］. Computers in Biology and Medicine， 2014， 54： 1-13.
5	YU B， YIN H， ZHU Z. Spatio-temporal graph convolutional networks： a deep learning framework for traffic forecasting［C］// Proceedings of the 27th International Joint Conference on Artificial Intelligence. California： ijcai.org， 2018： 3634-3640.
6	CHAI D， WANG L， YANG Q. Bike flow prediction with multi-graph convolutional networks［C］// Proceedings of the 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems. New York： ACM， 2018： 397-400.
7	宋凌云，马卓源，李战怀，等. 面向金融风险预测的时序图神经网络综述［J］. 软件学报， 2024， 35（8）：3897-3922.
	SONG L Y， MA Z Y， LI Z H， et al. Review on temporal graph neural networks for financial risk prediction［J］. Journal of Software， 2024， 35（8）：3897-3922.
8	WU Z， PAN S， CHEN F， et al. A comprehensive survey on graph neural networks［J］. IEEE Transactions on Neural Networks and Learning Systems， 2020， 32（1）： 4-24.
9	JIA Z， LIN Y， WANG J， et al. Multi-view spatial-temporal graph convolutional networks with domain generalization for sleep stage classification［J］. IEEE Transactions on Neural Systems and Rehabilitation Engineering， 2021， 29： 1977-1986.
10	JIA Z， LIN Y， WANG J， et al. GraphSleepNet： adaptive spatial-temporal graph convolutional networks for sleep stage classification［C］// Proceedings of the 29th International Joint Conference on Artificial Intelligence. California： ijcai.org， 2020： 1324-1330.
11	ZAIDI T F， FAROOQ O. EEG sub-bands based sleep stages classification using Fourier Synchrosqueezed transform features［J］. Expert Systems with Applications， 2023， 212： No.118752.
12	DUAN Z， XU H， WANG Y， et al. Multivariate time-series classification with hierarchical variational graph pooling［J］. Neural Networks， 2022， 154： 481-490.
13	XU K， HU W， LESKOVEC J， et al. How powerful are graph neural networks？［EB/OL］. ［2024-05-08］..
14	ZHANG X， ZEMAN M， TSILIGKARIDIS T， et al. Graph-guided network for irregularly sampled multivariate time series［EB/OL］. ［2024-05-08］..
15	ZHANG X， ZHAO Z， TSILIGKARIDIS T， et al. Self-supervised contrastive pre-training for time series via time-frequency consistency［C］// Proceedings of the 36th International Conference on Neural Information Processing Systems. Red Hook： Curran Associates Inc.， 2022： 3988-4003.
16	XI L， YUN Z， LIU H， et al. Semi-supervised time series classification model with self-supervised learning［J］. Engineering Applications of Artificial Intelligence， 2022， 116： No.105331.
17	XI W， JAIN A， ZHANG L， et al. LB-SimTSC： an efficient similarity-aware graph neural network for semi-supervised time series classification［EB/OL］. ［2024-05-08］..
18	BRAEI M， WAGNER S. Anomaly detection in univariate time-series： a survey on the state-of-the-art［EB/OL］. ［2024-05-08］..
19	ZOVKO I I. Topics in market microstructure［M］. Amsterdam： Amsterdam University Press， 2008：13-17.
20	肖国庆，李雪琪，陈玥丹，等. 大规模图神经网络研究综述［J］. 计算机学报， 2024， 47（1）：148-171.
	XIAO G Q， LI X Q， CHEN Y D， et al. A survey of large-scale graph neural networks［J］. Chinese Journal of Computers， 2024， 47（1）：148-171.
21	KELLER J M， GRAY M R， GIVENS J A. A fuzzy k-nearest neighbor algorithm［J］. IEEE Transactions on Systems， Man， and Cybernetics， 1985， SMC-15（4）： 580-585.
22	GHIMATGAR H， KAZEMI K， HELFROUSH M S， et al. Neonatal EEG sleep stage classification based on deep learning and HMM［J］. Journal of Neural Engineering， 2020， 17（3）： No.036031.
23	ELDELE E， RAGAB M， CHEN Z， et al. Time-series representation learning via temporal and contextual contrasting［C］// Proceedings of the 30th International Joint Conference on Artificial Intelligence. California： ijcai.org， 2021： 2352-2359.

[1]	张春雪, 仇丽青, 孙承爱, 荆彩霞. 基于两阶段动态兴趣识别的购买行为预测模型[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2365-2371.
[2]	顾焰杰, 张英俊, 刘晓倩, 周围, 孙威. 基于时空多图融合的交通流量预测[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2618-2625.
[3]	姚迅, 秦忠正, 杨捷. 生成式标签对抗的文本分类模型[J]. 《计算机应用》唯一官方网站, 2024, 44(6): 1781-1785.
[4]	沈君凤, 周星辰, 汤灿. 基于改进的提示学习方法的双通道情感分析模型[J]. 《计算机应用》唯一官方网站, 2024, 44(6): 1796-1806.
[5]	王星, 刘贵娟, 陈志豪. 高斯混合模型与文本图卷积网络结合的虚假评论识别算法[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 360-368.
[6]	郭晓, 陈艳平, 唐瑞雪, 黄瑞章, 秦永彬. 融合行为词的罪名预测多任务学习模型[J]. 《计算机应用》唯一官方网站, 2024, 44(1): 159-166.
[7]	李豆豆, 李汪根, 夏义春, 束阳, 高坤. 基于特征交互与自适应融合的骨骼动作识别[J]. 《计算机应用》唯一官方网站, 2023, 43(8): 2581-2587.
[8]	何嘉明, 杨巨成, 吴超, 闫潇宁, 许能华. 基于多模态图卷积神经网络的行人重识别方法[J]. 《计算机应用》唯一官方网站, 2023, 43(7): 2182-2189.
[9]	樊小宇, 蔺素珍, 王彦博, 刘峰, 李大威. 基于残差图卷积神经网络的高倍欠采样核磁共振图像重建算法[J]. 《计算机应用》唯一官方网站, 2023, 43(4): 1261-1268.
[10]	陈容均, 严宣辉, 杨超城. 面向时间序列的混合图像化循环胶囊分类网络[J]. 《计算机应用》唯一官方网站, 2023, 43(3): 692-699.
[11]	王若莹, 吕凡, 赵柳清, 胡伏原. 融合用户需求和边界约束的平面图生成算法[J]. 《计算机应用》唯一官方网站, 2023, 43(2): 575-582.
[12]	席志红, 温家旭. 基于目标检测的室内动态场景定位与建图[J]. 《计算机应用》唯一官方网站, 2022, 42(9): 2853-2857.
[13]	玄英律, 万源, 陈嘉慧. 基于多尺度卷积和注意力机制的LSTM时间序列分类[J]. 《计算机应用》唯一官方网站, 2022, 42(8): 2343-2352.
[14]	陈浩杰, 范江亭, 刘勇. 深度强化学习解决动态旅行商问题[J]. 《计算机应用》唯一官方网站, 2022, 42(4): 1194-1200.
[15]	李晓杰, 崔超然, 宋广乐, 苏雅茜, 吴天泽, 张春云. 基于时序超图卷积神经网络的股票趋势预测方法[J]. 《计算机应用》唯一官方网站, 2022, 42(3): 797-803.

融合衍生特征的时间序列事件分类方法

Time series event classification method fused with derived features

RichHTML

PDF

可视化

摘要/Abstract

引用本文

使用本文

图/表 8

参考文献 23

相关文章 15

编辑推荐

Metrics