EEG decoding via dual-branch representation fusion and EEG-text modal alignment

doi:10.11772/j.issn.1001-9081.2025081016

Abstract

Abstract: Decoding text from electroencephalography (EEG) is a hotspot in brain-computer interface (BCI) research. Existing methods focus on global context modeling, neglect inter-channel local correlations, fail to realize their synergy, and leave EEG-text representation alignment in public semantic mapping space unsolved. To address these limitations, this study proposes a dual-branch EEG decoding framework with a cross-modal alignment strategy. The spatiotemporal branch sequentially employs a bidirectional long short-term memory network, depthwise separable convolution, and gated axial self-attention to extract short-term dependencies, spatial relationships across adjacent channels, and long-range temporal interactions. In parallel, the context branch uses a multi-layer Transformer encoder to model global context. A cross-attention module integrates the features from both branches. For cross-modal alignment, the framework introduces a joint loss combining triplet loss and covariance alignment. The triplet loss constrains the geometric distance between EEG and text pairs, while the covariance alignment ensures consistency in second-order statistical properties of the two modalities. This design effectively narrows the semantic gap between EEG and text representations. Experiments on the ZuCo dataset show that the model improves BLEU-1 by approximately 1% over a strong baseline, validating its effectiveness in EEG-to-text decoding.

Key words: Electroencephalography, deep neural network, feature fusion, modality discrepancy, text generation

摘要： 摘要: 将脑电信号（EEG）解码可读文本是脑-机接口（BCI）领域的研究热点，现有方法偏重于全局上下文建模、忽视信号通道间局部关联且未能实现二者协同；同时EEG与文本表征在公共语义映射空间的对齐问题未有效解决。为此，该文提出一种基于双分支表征融合与跨模态对齐的EEG解码模型。信号编码阶段采用并行双分支架构：时空结构建模分支通过双向长短期记忆网络、深度可分离卷积与门控轴向自注意力机制，分层捕获信号的短时依赖、邻近通道空间相关性与跨步长程关系；上下文融合分支则基于多层Transformer编码器，借交叉注意力融合两路表征以互补整合。跨模态对齐机制引入三元组损失与协方差对齐联合损失，分别从EEG与文本表征样本对间几何距离及二阶统计特性约束向量对齐，弱化模态语义鸿沟。公开ZuCo数据集实验表明，该模型在BLEU-1指标上较主流基线提高约1个百分点，验证了其在面向文本的EEG解码任务中的有效性。

关键词: 电图, 深度神经网络, 特征融合, 模态差异性, 文本生成

CLC Number:

TP18

徐晓翠李波邹宇童. 基于双分支表征融合与跨模态对齐的脑电信号解码模型[J]. 《计算机应用》唯一官方网站, DOI: 10.11772/j.issn.1001-9081.2025081016.

[1]	Yuqian HUANG, Hui HUANG, Yongbin QIN, Ruizhang HUANG, Yanping CHEN, Yulin ZHOU, Qian SUN. Judicial element extraction method by integrating global and local semantics [J]. Journal of Computer Applications, 2026, 46(5): 1460-1467.
[2]	Xiaobo QI, Jing ZHANG, Ying SHI, Hui QI, Hangyuan DU. Multiple active learning method based on concept drift detection [J]. Journal of Computer Applications, 2026, 46(5): 1388-1396.
[3]	Xinyao LIU, Jun LIANG, Jiahao LONG, Renliang YAN. Fine-grained Chinese herbal medicine image classification based on feature fusion and channel information compensation [J]. Journal of Computer Applications, 2026, 46(5): 1677-1683.
[4]	Jiali ZHENG, Gang ZHOU, Jing CHEN, Shunhang LI. Adaptive multi-feature fusion detection method for AI-generated text [J]. Journal of Computer Applications, 2026, 46(5): 1433-1440.
[5]	Wenchao MING, Suzhen LIN, Zanxia JIN. Multi-band image captioning method based on scene concept-guided feature fusion [J]. Journal of Computer Applications, 2026, 46(5): 1560-1567.
[6]	Chi ZHANG, Xianjing MENG, Changhao DOU, Qian WANG, Leilei GENG, Xiaoming XI. MD-FVR： cascaded finger vein recognition network based on multi-domain feature fusion [J]. Journal of Computer Applications, 2026, 46(5): 1658-1666.
[7]	Xuechao LIAO, Rui CHEN. Prediction-evaluation framework for anomaly detection in electric vehicle lithium-ion battery [J]. Journal of Computer Applications, 2026, 46(5): 1614-1623.
[8]	Minqi WU, Yuanhua YANG, Hang LI, Yaqin HU, Zhihao TANG, Teng MEI. Lightweight underwater small object detection based on graph Transformer and RT-DETR [J]. Journal of Computer Applications, 2026, 46(5): 1586-1595.
[9]	Hongrui ZHANG, Weiming FENG, Luxia YANG, Yongjie MA. CSAF-YOLO： improved YOLO11 algorithm for underwater small object detection [J]. Journal of Computer Applications, 2026, 46(5): 1578-1585.
[10]	Xinyi YAN, Linglong ZHU, Yonghong ZHANG. CDC-DETR： multi-scale real-time human-vehicle detection method for complex traffic scenarios [J]. Journal of Computer Applications, 2026, 46(4): 1283-1291.
[11]	Shuai HE, Chunhua DENG. Object detection algorithm with few-shot learning based on YOLO-World [J]. Journal of Computer Applications, 2026, 46(4): 1275-1282.
[12]	Wenhao LI, Yinzhang GUO. Urban traffic flow prediction based on dual-layer multi-scale dynamic graph convolutional network model [J]. Journal of Computer Applications, 2026, 46(4): 1323-1333.
[13]	Huanxian LIU, Hongtao WANG, Xian’ao WANG, Hongmei WANG, Weifeng XU. Multimodal fact verification with cross-modal semantic association [J]. Journal of Computer Applications, 2026, 46(4): 1069-1076.
[14]	Hanqing LIU, Guoming SANG, Yijia ZHANG. Remote sensing image captioning model combining dense multi-scale feature fusion and feature knowledge-enhanced Transformer [J]. Journal of Computer Applications, 2026, 46(3): 741-749.
[15]	Jincheng FU, Shiyou YANG. Short-term wind power prediction using hybrid model based on Bayesian optimization and feature fusion [J]. Journal of Computer Applications, 2026, 46(2): 652-658.

EEG decoding via dual-branch representation fusion and EEG-text modal alignment

基于双分支表征融合与跨模态对齐的脑电信号解码模型

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics