Generative label adversarial text classification model

doi:10.11772/j.issn.1001-9081.2023050662

Abstract

Abstract:

Text classification is a fundamental task in Natural Language Processing （NLP）， aiming to assign text data to predefined categories. The combination of Graph Convolutional neural Network （GCN） and large-scale pre-trained model BERT （Bidirectional Encoder Representations from Transformer） has achieved excellent results in text classification tasks. Undirected information transmission of GCN in large-scale heterogeneous graphs produces information noise， which affects the judgment of the model and reduce the classification ability of the model. To solve this problem， a generative label adversarial model， the Class Adversarial Graph Convolutional Network （CAGCN） model， was proposed to reduce the interference of irrelevant information during classification and improve the classification performance of the model. Firstly， the composition method in TextGCN （Text Graph Convolutional Network） was used to construct the adjacency matrix， which was combined with GCN and BERT models as a Class Generator （CG）. Secondly， the pseudo-label feature training method was used in the model training to construct a clueter. The cluster and the class generator were jointly trained. Finally， experiments were carried out on several widely used datasets. Experimental results show that the classification accuracy of CAGCN model is 1.2， 0.1， 0.5， 1.7 and 0.5 percentage points higher than that of RoBERTaGCN model on the widely used classification datasets 20NG， R8， R52， Ohsumed and MR， respectively.

Key words: text classification, Graph Convolutional neural Network (GCN), BERT (Bidirectional Encoder Representations from Transformer), pseudo-label, heterogeneous graph

摘要：

文本分类是自然语言处理（NLP）中的一项基础任务，目的是将文本数据分配至预先定义的类别。图卷积神经网络（GCN）与大规模的预训练模型BERT（Bidirectional Encoder Representations from Transformer）的结合在文本分类任务中取得了良好的效果。大规模异构图中GCN的无向的信息传递产生信息噪声影响模型的判断，造成模型分类能力下降，针对这一问题，提出一种生成式标签对抗模型，即类对抗图卷积网络（CAGCN）模型，以降低分类时无关信息的干扰，提升模型的分类性能。首先，采用TextGCN（Text Graph Convolutional Network）中的构图法构建邻接矩阵，结合GCN和BERT模型作为类生成器（CG）；其次，在模型训练时采用伪标签特征训练法，并构建聚类器与类生成器联合训练；最后，在多个广泛使用的数据集上进行实验。实验结果表明，在泛用的分类数据集20NG、R8、R52、Ohsumed和MR上，CAGCN模型的分类准确率比RoBERTaGCN模型分别提高了1.2、0.1、0.5、1.7和0.5个百分点。

关键词: 文本分类, 图卷积神经网络, BERT, 伪标签, 异构图

CLC Number:

TP391.1

Xun YAO, Zhongzheng QIN, Jie YANG. Generative label adversarial text classification model[J]. Journal of Computer Applications, 2024, 44(6): 1781-1785.

姚迅, 秦忠正, 杨捷. 生成式标签对抗的文本分类模型[J]. 《计算机应用》唯一官方网站, 2024, 44(6): 1781-1785.

Figures/Tables 8

References 22

1	CAMBRIA E， WHITE B. Jumping NLP curves： a review of natural language processing research ［J］. IEEE Computational Intelligence Magazine， 2014， 9（2）： 48-57.
2	WANG A H. Don’t follow me： spam detection in Twitter ［C］// Proceedings of the 2010 International Conference on Security and Cryptography. Piscataway： IEEE， 2010： 1-10.
3	PIRYANI R， MADHAVI D， SINGH V K. Analytical mapping of opinion mining and sentiment analysis research during 2000—2015 ［J］. Information Processing & Management， 2017， 53（1）： 122-150.
4	THEKUMPARAMPIL K K， WANG C， OH S， et al. Attention-based graph neural network for semi-supervised learning［EB/OL］.［2023-04-16］. .
5	YAO L， MAO C， LUO Y. Graph convolutional networks for text classification ［C］// Proceedings of the 33rd AAAI Conference on Artificial Intelligence and 31st Innovative Applications of Artificial Intelligence Conference and 9th AAAI Symposium on Educational Advances in Artificial Intelligence. Palo Alto： AAAI Press， 2019： 7370-7377.
6	KIPF T N， WELLING M. Semi-supervised classification with graph convolutional networks ［EB/OL］. ［2023-04-16］. .
7	MIKOLOV T， CHEN K， CORRADO G， et al. Efficient estimation of word representations in vector space ［EB/OL］. ［2023-04-16］..
8	VASWANI A， SHAZEER N， PARMAR N， et al. Attention is all you need ［C］// Proceedings of 31st Conference on Neural Information Processing Systems. Red Hook： Curran Associates Inc.， 2017： 6000-6010.
9	DEVLIN J， CHANG M-W， LEE K， et al. BERT： pre-training of deep bidirectional transformers for language understanding ［C］// Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics： Human Language Technologies （Volume 1： Long and Short Papers）. Stroudsburg： ACL， 2019： 4171-4186.
10	LIN Y， MENG Y， SUN X， et al. BertGCN： transductive text classification by combining GCN and BERT ［C］// Findings of the Association for Computational Linguistics： ACL-IJCNLP 2021. Stroudsburg： ACL， 2021： 1456-1462.
11	VELIČKOVIĆ P， CUCURULL G， CASANOVA A， et al. Graph attention networks ［EB/OL］. ［2023-04-16］. .
12	KIPF T N， WELLING M. Variational graph auto-encoders ［EB/OL］. ［2023-04-16］. .
13	DE CAO N， KIPF T. MolGAN： an implicit generative model for small molecular graphs ［EB/OL］. ［2023-04-16］. .
14	VAN ZAANEN M， KANTERS P H M. Automatic mood classification using TF*IDF based on lyrics［C］// Proceedings of the 11th International Society for Music Information Retrieval Conference. Utrecht，Netherlands： ISMIR， 2010： 75-80.
15	LIU P， QIU X， HUANG X. Recurrent neural network for text classification with multi-task learning ［EB/OL］. ［2023-04-16］..
16	LIU X， YOU X， ZHANG X， et al. Tensor graph convolutional networks for text classification ［C］// Proceedings of the 34th AAAI Conference on Artificial Intelligence. Menlo Park： AAAI Press， 2020： 8409-8416.
17	WU F， SOUZA A， ZHANG T， et al. Simplifying graph convolutional networks ［C］// Proceedings of the 36th International Conference on Machine Learning. New York： JMLR.org， 2019： 6861-6871.
18	HUANG L， MA D， LI S， et al. Text level graph neural network for text classification ［C］// Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. Stroudsburg： ACL， 2019： 3444-3450.
19	REIMERS N， GUREVYCH I. Sentence-BERT： sentence embeddings using siamese BERT-networks ［C］// Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. Stroudsburg： ACL， 2019： 3982-3992.
20	CHEN J， ZHANG B， XU Y， et al. TextRGNN： residual graph neural networks for text classification ［EB/OL］. ［2023-04-16］..
21	XIE Q， HUANG J， DU P， et al. Inductive topic variational graph auto-encoder for text classification ［C］// Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics： Human Language Technologies. Stroudsburg： ACL， 2021： 4218-4227.
22	LIU Y， OTT M， GOYAL N， et al. RoBERTa： a robustly optimized BERT pretraining approach［EB/OL］. ［2023-04-16］. .

节点编号	正确标签类别	预测标签类别	正确标签数	预测标签数	标签最多类别	类别数
12，4595	C16	C14	4	11	C06	19
12，8863	C16	C14	6	30	C09	35
12，7656	C16	C14	1	23	C14	23
12，3708	C16	C14	8	99	C07	102

节点编号	正确标签类别	预测标签类别	正确标签数	预测标签数	标签最多类别	类别数
12，4595	C16	C14	4	11	C06	19
12，8863	C16	C14	6	30	C09	35
12，7656	C16	C14	1	23	C14	23
12，3708	C16	C14	8	99	C07	102

数据集	#Docs	#Words	Max. Length	Avg. Length	#Classes
20NG	18 846	42 757	843	221.25	20
R8	7 674	7 688	520	65.72	8
R52	9 100	8 892	612	69.82	52
Ohsumed	7 400	14 157	417	135.82	23
MR	10 662	18 764	56	20.38	2

数据集	#Docs	#Words	Max. Length	Avg. Length	#Classes
20NG	18 846	42 757	843	221.25	20
R8	7 674	7 688	520	65.72	8
R52	9 100	8 892	612	69.82	52
Ohsumed	7 400	14 157	417	135.82	23
MR	10 662	18 764	56	20.38	2

模型	20NG	R8	R52	Ohsumed	MR
TextGCN	86.30	97.10	93.60	68.40	76.70
SGC	88.50	97.20	94.00	68.50	75.90
BERT	85.30	97.80	96.40	70.50	85.70
RoBERTa	83.80	97.80	96.20	70.40	89.40
BertGCN	89.30	98.10	96.60	72.80	86.00
RoBERTaGCN	89.50	98.20	96.10	72.80	89.70
BertGAT	87.40	97.80	96.50	71.20	86.50
RoBERTaGAT	86.50	98.00	96.10	71.20	89.20
TextRGNN	88.72	98.21	95.62	70.86	78.15
T-VGAE	88.08	97.68	95.00	70.04	78.03
CAGCN+BERT	90.30	98.30	96.60	73.50	86.10
CAGCN+RoBERT	90.70	98.30	96.60	74.50	90.20