《计算机应用》唯一官方网站

• •    下一篇

基于多尺度卷积和自注意力特征融合的多模态情感识别方法

陈田1,蔡从虎2,3,4,袁晓辉5,罗蓓蓓2   

  1. 1. 合肥工业大学
    2. 合肥工业大学计算机与信息学院
    3. 情感计算与先进智能机器安徽省重点实验室
    4. 智能互联系统安徽省实验室
    5. 北德克萨斯州大学计算机与工程学院
  • 收稿日期:2023-02-27 修回日期:2023-04-02 发布日期:2023-08-14 出版日期:2023-08-14
  • 通讯作者: 陈田
  • 基金资助:
    3D芯片可重构自测试与自适应测试的方法研究;纳米集成电路边缘缺陷测试分析仪研制

Multimodal emotion recognition method based on multiscale convolution and self-attentive feature fusion

  • Received:2023-02-27 Revised:2023-04-02 Online:2023-08-14 Published:2023-08-14

摘要: 基于生理信号的情感识别受噪声等因素影响,准确率和跨个体泛化能力较差。对此,提出一种基于脑电、心电和眼动信号的多模态情感识别方法。首先对生理信号进行多尺度卷积,获取更高维度的信号特征和减小参数量;其次,自注意力机制被用于多模态信号特征的融合中,以提升关键特征的权重和减少模态之间的特征干扰;最后,双向长短期记忆网络被用于提取融合特征的时序信息和分类。实验结果表明,方法在效价、唤醒度和效价/唤醒度四分类任务上分别取得90.29%、91.38%和83.53%的识别准确率,相比脑电单模态和脑电/心电双模态方法,准确率上提升了7.11和3.15个百分点。方法的多尺度卷积和自注意力特征融合具有有效性,方法能够准确识别情感,在个体间的识别稳定性更好。

关键词: 脑电, 自注意力, 心电, 眼动, 多模态, 情感识别

Abstract: Emotion recognition based on physiological signals was affected by noise and other factors, resulting in poor accuracy and weak cross-individual generalization ability. Concerned the issue, a multimodal emotion recognition method based on electroencephalogram (EEG), electrocardiogram (ECG), and eye movement signals was proposed. Firstly, multi-scale convolution was used to obtain higher-dimensional signal features and reduce parameter size. Secondly, Self-Attention was employed in the fusion of multimodal signal features to enhance the weights of key features and reduce feature interference between modalities. Finally, a bidirectional LSTM network was used to extract temporal information of fused features and classification. As was shown in the experimental results, recognition accuracies of 90.29%, 91.38%, and 83.53% for valence, arousal, and valence/arousal four-class recognition tasks, respectively, were achieved, which showed an improvement of 7.11% and 3.15% compared to the EEG single-modality and EEG/ECG bimodal methods. The multi-scale convolution and the feature fusion based on Self-Attention were effective, emotions could be accurately recognized by the method with better recognition stability between individuals.

Key words: Electroencephalogram (EEG), Self-Attention, electrocardiogram (ECG), eye movement, multimodal, emotion recognition

中图分类号: