《计算机应用》唯一官方网站

• •    下一篇

基于CNN-Transformer编码器的多维脑电特征融合与抑郁状态识别模型

董思语1,贾鑫2,张密2,陈新文1,孙昕霙1,程方骁2   

  1. 1. 北京大学公共卫生学院
    2. 北京大学医学部医学技术研究院
  • 收稿日期:2025-08-11 修回日期:2025-09-08 发布日期:2025-11-05 出版日期:2025-11-05
  • 通讯作者: 董思语

CNN-Transformer encoder-based model for depressive state recognition with multidimensional EEG feature fusion

  • Received:2025-08-11 Revised:2025-09-08 Online:2025-11-05 Published:2025-11-05

摘要: 针对脑电(EEG)信号识别抑郁症任务中存在的特征融合不足、分类粒度有限等问题,提出了一种基于CNN-Transformer编码器的多维脑电特征融合抑郁状态识别模型(CTE-DSR)。首先,利用一维卷积神经网络 (1D-CNN) 提取EEG的局部空间序列特征;然后,引入Transformer编码器建模各导联间的全局相关性;同时,通过多层感知机 (MLP) 将抑郁相关非序列特征嵌入高维空间;最后,将两类特征融合并通过全连接层实现对抑郁状态的细粒度分类。采用损失加权策略改善模型对少数类的识别能力。实验结果表明,与其他五类机器学习模型(RF、AdaBoost等)和两类组合模型(CNN-LSTM、CNN-RNN)相比,CTE-DSR在二分类任务中的召回率和AUC值分别至少提升了23.0%和6.7%,在三分类中的F1值和宏平均AUC值分别至少提升了15.6%和1.0%。模型在抑郁状态识别中展现出性能优势,可用于辅助抑郁症早期筛查和复发风险评估。

Abstract: Considering the challenges of insufficient feature fusion and limited classification granularity in electroencephalogram (EEG)-based depression recognition task, a CNN-Transformer encoder-based model for Depressive State Recognition (CTE-DSR) that integrates multidimensional EEG features was proposed. Firstly, one-dimensional convolutional neural network (1D-CNN) was used to extract local spatial sequential features from EEG. Then, Transformer encoder module was introduced to model global correlations between leads. Simultaneously, multi-layer perceptron (MLP) was used to embed depression-related non-sequential features into a high-dimensional space. Finally, the two features were fused and implemented in a fully connected layer to achieve fine-grained classification of depression states. A loss weighting strategy was employed to improve the model's ability to identify minority classes. Experimental results show that compared with other five-class machine learning models (such as RF, AdaBoost) and two-class combination models (CNN-LSTM, CNN-RNN), CTE-DSR achieves at least 23.0% and 6.7% higher recall and AUC in binary classification, respectively, and at least 15.6% and 1.0% higher F1-score and macro-average AUC in ternary classification. The model has performance advantages in identifying depressive states and can be used to assist in early screening of depression and relapse risk assessment.

中图分类号: