Multi-scale feature fusion sentiment classification based on bidirectional cross attention#br#

doi:10.11772/j.issn.1001-9081.2024081193

Journal of Computer Applications

Received:2024-08-23 Revised:2024-12-03 Online:2024-12-17 Published:2024-12-17

基于双向交叉注意力的多尺度特征融合情感分类

梁一鸣¹,范菁²,柴汶泽²

1. 云南民族大学
2. 云南民族大学电气信息工程学院

通讯作者: 范菁
基金资助:
教育部人文社会科学研究青年基金项目;国家自然科学基金项目;云南省吴中海专家工作站

Abstract

Abstract: Abstract: This paper focuses on key challenges in fine-grained sentiment classification, addressing limitations in deep sentiment understanding in existing models, the unidirectional constraints of traditional attention mechanisms, and class imbalance issues in natural language processing. To tackle these challenges, a sentiment classification model that integrates multi-scale BERT features with a bidirectional cross-attention mechanism (M-BCA) is proposed. The model first extracts multi-scale features from the lower, middle, and upper layers of BERT, representing surface information, syntactic information, and deep semantic information, respectively. Then, a three-channel GRU is used to further extract deep semantic features. To enhance the interaction between multi-scale features, a bidirectional cross-attention mechanism is introduced, promoting interaction and learning across different scales of features. Additionally, to address the class imbalance issue, a data augmentation strategy and a hybrid loss function are designed to optimize the model’s learning of minority class samples. Experimental results demonstrate that M-BCA performs notably well in fine-grained sentiment classification tasks, particularly in classifying minority class samples, providing new research perspectives and technical pathways for the field of sentiment classification.

Key words: BERT, Fine-grained Sentiment Classification, Multi-scale Feature Fusion, Data Augmentation, Mixed Loss Function, Bidirectional Cross Attention

摘要： 摘要: 文章聚焦于细粒度情感分类中的关键问题，针对现有模型在深层情感理解上的局限性、传统注意力机制的单向性束缚及自然语言处理中的类别不平衡等挑战，提出了一种融合多尺度BERT特征和双向交叉注意力机制的情感分类模型（Multi-scale BERT features with Bidirectional Cross Attention，M-BCA）。首先，模型从BERT的低层、中层、高层提取多尺度特征，分别代表表面信息、语法信息和深层语义信息。接着，利用三通道GRU进一步提取深层语义特征。为增强多尺度特征之间的交互性，文章引入了双向交叉注意力机制，促进了不同尺度特征之间的交互与学习。此外，为应对不平衡数据问题，设计了数据增强策略与混合损失函数，以优化模型对少数类别样本的学习。实验结果显示，M-BCA在细粒度情感分类任务中性能显著，尤其在少数类别样本分类上表现突出，为情感分类领域提供了新的研究视角和技术途径。

关键词: BERT, 细粒度情感分类, 多尺度特征融合, 数据增强, 混合损失函数, 双向交叉注意力

梁一鸣范菁柴汶泽. 基于双向交叉注意力的多尺度特征融合情感分类[J]. 《计算机应用》唯一官方网站, DOI: 10.11772/j.issn.1001-9081.2024081193.

[1]	Dingmu YANG, Longqiang NI, Jing LIANG, Zhaoyuan QIU, Yongzhen ZHANG, Zhiqiang QI. Protocol conversion method based on semantic similarity [J]. Journal of Computer Applications, 2025, 45(4): 1263-1270.
[2]	Shiyue GUO, Jianwu DANG, Yangping WANG, Jiu YONG. 3D hand pose estimation combining attention mechanism and multi-scale feature fusion [J]. Journal of Computer Applications, 2025, 45(4): 1293-1299.
[3]	Renjie TIAN, Mingli JING, Long JIAO, Fei WANG. Recommendation algorithm of graph contrastive learning based on hybrid negative sampling [J]. Journal of Computer Applications, 2025, 45(4): 1053-1060.
[4]	Haitao SUN, Jiayu LIN, Zuhong LIANG, Jie GUO. Data augmentation technique incorporating label confusion for Chinese text classification [J]. Journal of Computer Applications, 2025, 45(4): 1113-1119.
[5]	Chenwei SUN, Junli HOU, Xianggen LIU, Jiancheng LYU. Large language model prompt generation method for engineering drawing understanding [J]. Journal of Computer Applications, 2025, 45(3): 801-807.
[6]	Kun SHENG, Zhongqing WANG. Synaesthesia metaphor analysis based on large language model and data augmentation [J]. Journal of Computer Applications, 2025, 45(3): 794-800.
[7]	Zhongwei ZHANG, Jun WANG, Shudong LIU, Zhiheng WANG. Object detection in remote sensing image based on multi-scale feature fusion and weighted boxes fusion [J]. Journal of Computer Applications, 2025, 45(2): 633-639.
[8]	Xuewen YAN, Zhangjin HUANG. Few-shot image classification method based on contrast learning [J]. Journal of Computer Applications, 2025, 45(2): 383-391.
[9]	Kun FU, Shicong YING, Tingting ZHENG, Jiajie QU, Jingyuan CUI, Jianwei LI. Graph data augmentation method for few-shot node classification [J]. Journal of Computer Applications, 2025, 45(2): 392-402.
[10]	Jialin ZHANG, Qinghua REN, Qirong MAO. Speaker verification system utilizing global-local feature dependency for anti-spoofing [J]. Journal of Computer Applications, 2025, 45(1): 308-317.
[11]	Shang LIU, Yuwei ZHOU, Rao DAI, Linfang DONG, Meng LIU. Small target detection algorithm in remote sensing images integrating attention and contextual information [J]. Journal of Computer Applications, 2025, 45(1): 292-300.
[12]	Pengcheng SONG, Lijun GUO, Rong ZHANG. Weakly supervised video anomaly detection with local-global temporal dependency [J]. Journal of Computer Applications, 2025, 45(1): 240-246.
[13]	Ying YANG, Xiaoyan HAO, Dan YU, Yao MA, Yongle CHEN. Graph data generation approach for graph neural network model extraction attacks [J]. Journal of Computer Applications, 2024, 44(8): 2483-2492.
[14]	Xun YAO, Zhongzheng QIN, Jie YANG. Generative label adversarial text classification model [J]. Journal of Computer Applications, 2024, 44(6): 1781-1785.
[15]	Junfeng SHEN, Xingchen ZHOU, Can TANG. Dual-channel sentiment analysis model based on improved prompt learning method [J]. Journal of Computer Applications, 2024, 44(6): 1796-1806.