Multi-domain fake news detection model enhanced by APK-CNN and Transformer

doi:10.11772/j.issn.1001-9081.2023091359

Journal of Computer Applications ›› 2024, Vol. 44 ›› Issue (9): 2674-2682.DOI: 10.11772/j.issn.1001-9081.2023091359

• Artificial intelligence • Previous Articles Next Articles

Multi-domain fake news detection model enhanced by APK-CNN and Transformer

Jinjin LI, Guoming SANG(), Yijia ZHANG

Information Science and Technology College，Dalian Maritime University，Dalian Liaoning 116026，China

Received:2023-10-09 Revised:2023-12-08 Accepted:2023-12-11 Online:2024-03-21 Published:2024-09-10
Contact: Guoming SANG
About author:LI Jinjin， born in 2000， M. S. candidate. Her research interests include natural language processing， rumor detection.
ZHANG Yijia， born in 1979， Ph. D.， professor. His research interests include natural language processing， social media computing.
Supported by:
National Natural Science Foundation of China(62072070);Fundamental Research Funds for Central University(3132019207)

APK-CNN和Transformer增强的多域虚假新闻检测模型

李金金, 桑国明(), 张益嘉

大连海事大学信息科学技术学院，辽宁大连 116026

通讯作者: 桑国明
作者简介:李金金（2000—），女，河南漯河人，硕士研究生，CCF会员，主要研究方向：自然语言处理、谣言检测
桑国明（1971—），男，辽宁大连人，副教授，硕士，主要研究方向：自然语言处理、人工智能
张益嘉（1979—），男，辽宁大连人，教授，博士，主要研究方向：自然语言处理、社会媒体计算。
基金资助:
国家自然科学基金资助项目(62072070);中央高校基本科研业务费项目(3132019207)

Abstract

Abstract:

In order to solve the problems of domain shifting and incomplete domain labeling in social media news， as well as to explore more efficient multi-domain news feature extraction and fusion networks， a multi-domain fake news detection model based on enhancement by APK-CNN （Adaptive Pooling Kernel Convolutional Neural Network） and Transformer was proposed， namely Transm3. Firstly， a three-channel network was designed for feature extraction and representation of semantic， emotional， and stylistic information of the text and view combination of these features using a multi-granularity cross-domain interactor. Secondly， the news domain labels were refined by optimized soft-shared memory networking and domain adapters. Then， Transformer was combined with a multi-granularity cross-domain interactor to dynamically and weighty aggregate the interaction features of different domains. Finally， the fused features were fed into the classifier for true/false news discrimination. Experimental results show that compared with M³FEND （Memory-guided Multi-view Multi-domain FakE News Detection） and EANN （Event Adversarial Neural Networks for multi-modal fake news detection）， Transm3 improves the comprehensive F1 value by 3.68% and 6.46% on Chinese dataset， and 6.75% and 11.93% on English dataset； and the F1 values on sub-domains are also significantly improved. The effectiveness of Transm3 for multi-domain fake news detection is fully validated.

Key words: fake news detection, domain shift, soft-shared memory networking, Transformer, APK-CNN (Adaptive Pooling Kernel Convolutional Neural Network)

摘要：

为解决社交媒体新闻中的领域转移、领域标签不完整问题，以及探索更高效的多域新闻文本特征提取和融合网络，提出一种基于APK-CNN（Adaptive Pooling Kernel Convolutional Neural Network）和Transformer增强的多域虚假新闻检测模型Transm3。首先，设计三通道网络对文本的语义、情感和风格信息进行特征提取和表示，并利用多粒度跨域交互器对这些特征进行视图组合；其次，通过优化的软共享内存网络和域适配器来完善新闻领域标签；再次，将Transformer与多粒度跨域交互器结合，使用更先进的融合网络动态加权聚合不同领域的交互特征；最后，将融合特征输入分类器中用于真/假新闻判别。实验结果表明，Transm3与M³FEND（Memory-guided Multi-view Multi-domain FakE News Detection）和EANN（Event Adversarial Neural Networks for multi-modal fake news detection）相比，综合F1值在中文数据集上分别提高了3.68%和6.46%，在英文数据集上分别提高了6.75%和11.93%，在各分领域上F1值也有明显的提高，充分验证了Transm3在多域虚假新闻检测工作上的有效性。

关键词: 虚假新闻检测, 领域转移, 软共享内存网络, Transformer, APK-CNN

CLC Number:

TP391.1

Jinjin LI, Guoming SANG, Yijia ZHANG. Multi-domain fake news detection model enhanced by APK-CNN and Transformer[J]. Journal of Computer Applications, 2024, 44(9): 2674-2682.

李金金, 桑国明, 张益嘉. APK-CNN和Transformer增强的多域虚假新闻检测模型[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2674-2682.

Figures/Tables 9

References 28

1	SILVA A， LUO L， KARUNASEKERA S， et al. Embracing domain differences in fake news： cross-domain fake news detection using multi-modal data ［C］// Proceedings of the 2021 AAAI Conference on Artificial Intelligence. Palo Alto： AAAI Press， 2021， 35（1）： 557-565.
2	NAN Q， CAO J， ZHU Y， et al. MDFEND： multi-domain fake news detection ［C］// Proceedings of the 30th ACM International Conference on Information & Knowledge Management. New York： ACM， 2021： 3343-3347.
3	ZHU Y， SHENG Q， CAO J， et al. Memory-guided multi-view multi-domain fake news detection ［J］. IEEE Transactions on Knowledge and Data Engineering， 2023， 35（7）： 7178-7191.
4	SINGHAL S， SHAH R R， CHAKRABORTY T， et al. Spotfake： a multi-modal framework for fake news detection ［C］// Proceedings of the 2019 IEEE 5th International Conference on Multimedia Big Data. Piscataway： IEEE， 2019： 39-47.
5	MA J， GAO W， K-F WONG. Detect rumors on twitter by promoting information campaigns with generative adversarial learning ［C］// Proceedings of the 2019 World Wide Web Conference. New York： ACM， 2019： 3049-3055.
6	GANIN Y， USTINOVA E， AJAKAN H， et al. Domain-adversarial training of neural networks ［J］. The Journal of Machine Learning Research， 2016， 17（1）： 2096-2030.
7	MA J， ZHAO Z， YI X， et al. Modeling task relationships in multi-task learning with multi-gate mixture-of-experts ［C］// Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. New York： ACM， 2018： 1930-1939.
8	ZHU Y， ZHUANG F， WANG D. Aligning domain-specific distribution and classifier for cross-domain classification from multiple sources ［C］// Proceedings of the 2019 AAAI Conference on Artificial Intelligence. Palo Alto： AAAI Press， 2019， 33（1）： 5989-5996.
9	ZADEH A， LIANG P P， MAZUMDER N， et al. Memory fusion network for multi-view sequential learning ［C］// Proceedings of the 2018 AAAI Conference on Artificial Intelligence. Palo Alto： AAAI Press， 2018， 32（1）： 5634-5641.
10	ZHANG X， CAO J， LI X， et al. Mining dual emotion for fake news detection ［C］// Proceedings of the Web Conference 2021. New York： ACM， 2021： 3465-3476.
11	YANG Y， CAO J， LU M， et al. How to write high-quality news on social network？ Predicting news quality by mining writing style ［EB/OL］. ［2022-08-17］. .
12	CASTILLO C， MENDOZA M， POBLETE B. Information credibility on twitter ［C］// Proceedings of the 20th International Conference on World Wide Web. New York： ACM， 2011： 675-684.
13	VASWANI A， SHAZEER N， PARMAR N， et al. Attention is all you need ［C］// Proceedings of the 31st Neural Information Processing Systems. Red Hook： Curran Associates Inc.， 2017： 6000-6010.
14	MA J， GAO W， MITRA P， et al. Detecting rumors from microblogs with recurrent neural networks ［C］// Proceedings of the 25th International Joint Conference on Artificial Intelligence. Palo Alto： AAAI Press， 2016： 3818-3824.
15	XIA H， WANG Y， ZHANG J Z， et al. COVID-19 fake news detection： a hybrid CNN-BiLSTM-AM model ［J］. Technological Forecasting and Social Change， 2023， 195： 122746.
16	SHAFIQ M， GU Z. Deep residual learning for image recognition： a survey ［J］. Applied Sciences， 2022， 12（18）： 8972.
17	ZENG G， CHI J， MA R， et al. ADAPT： adversarial domain adaptation with purifier training for cross-domain credit risk forecasting ［C］// Proceedings of the 27th International Conference on Database Systems for Advanced Applications. Cham： Springer， 2022： 353-369.
18	RAZA S， DING C. Fake news detection based on news content and social contexts： a Transformer-based approach ［J］. International Journal of Data Science and Analytics， 2022， 13（4）： 335-362.
19	DAVOUDI M， MOOSAVI M R， SADREDDINI M H. DSS： a hybrid deep model for fake news detection using propagation tree and stance network ［J］. Expert Systems with Applications， 2022， 198： 116635.
20	SHAHID W， JAMSHIDI B， HAKAK S， et al. Detecting and mitigating the dissemination of fake news： challenges and future research opportunities ［J］. IEEE Transactions on Computational Social Systems， 2024， 11（4）： 4649-4662.
21	HUANG K-H， McKEOWN K， NAKOV P， et al. Faking fake news for real fake news detection： propaganda-loaded training data generation ［EB/OL］. ［2023-03-13］. .
22	MOHAPATRA A， THOTA N， PRAKASAM P. Fake news detection and classification using hybrid BiLSTM and self-attention model ［J］. Multimedia Tools and Applications， 2022， 81（13）： 18503-18519.
23	KIM Y. Convolutional neural networks for sentence classification ［EB/OL］. ［2022-12-02］. .
24	LIU Y， OTT M， GOYAL N， et al. RoBERTa： a robustly optimized BERT pretraining approach ［EB/OL］. ［2023-02-12］. .
25	CUI Y， CHE W， LIU T， et al. Pre-training with whole word masking for Chinese BERT ［J］. IEEE/ACM Transactions on Audio， Speech， and Language Processing， 2021， 29： 3504-3514.
26	PRZYBYLA P. Capturing the style of fake news ［C］// Proceedings of the 2020 AAAI Conference on Artificial Intelligence. Palo Alto： AAAI Press， 2020， 34（1）： 490-497.
27	WANG Y， MA F， JIN Z， et al. EANN： event adversarial neural networks for multi-modal fake news detection ［C］// Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. New York： ACM， 2018： 849-857.
28	QIN Z， CHENG Y， ZHAO Z， et al. Multitask mixture of sequential experts for user activity streams ［C］// Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. New York： ACM， 2020： 3083-3091.

领域	样本数		领域	样本数
领域	真新闻	假新闻	领域	真新闻	假新闻
科技	143	93	健康	485	515
军事	121	222	金融	959	362
教育	243	248	娱乐	1 000	440
灾难	185	591	社会	1 198	1 471
政治	306	546	合计	4 640	4 488

领域	样本数		领域	样本数
领域	真新闻	假新闻	领域	真新闻	假新闻
科技	143	93	健康	485	515
军事	121	222	金融	959	362
教育	243	248	娱乐	1 000	440
灾难	185	591	社会	1 198	1 471
政治	306	546	合计	4 640	4 488

领域	样本数
领域	真新闻	假新闻
合计	22 001	6 763
Gossipcop	16 804	5 067
Politifact	447	379
COVID	4 750	1 317

领域	样本数
领域	真新闻	假新闻
合计	22 001	6 763
Gossipcop	16 804	5 067
Politifact	447	379
COVID	4 750	1 317

模型		不同领域上的F1			overall
模型		Gossipcop	Politifact	COVID	F1	Acc	AUC
单域	BIGRU	76.66	77.22	88.85	79.58	86.68	88.40
	TextCNN	77.86	80.11	90.40	80.79	86.92	90.23
	RoBERTa	78.10	85.83	92.88	81.84	88.02	91.08
混合域	BIGRU	74.79	73.39	74.48	75.01	83.21	85.04
	TextCNN	75.19	70.40	83.22	76.79	83.62	86.74
	RoBERTa	78.23	79.67	90.14	81.01	87.44	90.58
	StyleLSTM	80.07	79.37	92.52	82.85	88.26	92.50
	DualEmo	80.56	78.68	90.19	82.70	88.18	92.51
多域	EANN	79.37	75.58	88.36	81.23	87.43	90.53
	MMoE	80.22	84.77	93.79	83.61	89.20	92.65
	MoSE	79.81	85.76	93.26	83.18	88.85	92.52
	EDDFN	80.67	85.05	93.06	83.78	89.12	92.63
	MDFEND	80.80	84.73	93.31	83.90	89.36	92.37
	M³FEND	82.37	84.78	93.92	85.17	89.77	93.42
Transm3		84.67	89.82	94.86	90.92	92.12	96.54

Multi-domain fake news detection model enhanced by APK-CNN and Transformer

APK-CNN和Transformer增强的多域虚假新闻检测模型

RichHTML

PDF

Knowledge

Abstract

Cite this article

share this article

Figures/Tables 9

References 28

Related Articles 15

Recommended Articles

Metrics

数据集	Head	F1/%	Acc/%	数据集	Head	F1/%	Acc/%
En-3	1	87.45	88.01	Ch-9	1	91.14	91.15
	2	89.66	88.97		2	93.20	93.19
	4	90.92	92.12		4	95.55	95.57
	8	88.12	89.63		8	92.35	92.35

[1]	Liehong REN, Lyuwen HUANG, Xu TIAN, Fei DUAN. Multivariate long-term series forecasting method with DFT-based frequency-sensitive dual-branch Transformer [J]. Journal of Computer Applications, 2024, 44(9): 2739-2746.
[2]	Yunchuan HUANG, Yongquan JIANG, Juntao HUANG, Yan YANG. Molecular toxicity prediction based on meta graph isomorphism network [J]. Journal of Computer Applications, 2024, 44(9): 2964-2969.
[3]	Xin YANG, Xueni CHEN, Chunjiang WU, Shijie ZHOU. Short-term traffic flow prediction of urban highway based on variant residual model and Transformer [J]. Journal of Computer Applications, 2024, 44(9): 2947-2951.
[4]	Jiepo FANG, Chongben TAO. Hybrid internet of vehicles intrusion detection system for zero-day attacks [J]. Journal of Computer Applications, 2024, 44(9): 2763-2769.
[5]	Jieru JIA, Jianchao YANG, Shuorui ZHANG, Tao YAN, Bin CHEN. Unsupervised person re-identification based on self-distilled vision Transformer [J]. Journal of Computer Applications, 2024, 44(9): 2893-2902.
[6]	Yuwei DING, Hongbo SHI, Jie LI, Min LIANG. Image denoising network based on local and global feature decoupling [J]. Journal of Computer Applications, 2024, 44(8): 2571-2579.
[7]	Kaili DENG, Weibo WEI, Zhenkuan PAN. Industrial defect detection method with improved masked autoencoder [J]. Journal of Computer Applications, 2024, 44(8): 2595-2603.
[8]	Fan YANG, Yao ZOU, Mingzhi ZHU, Zhenwei MA, Dawei CHENG, Changjun JIANG. Credit card fraud detection model based on graph attention Transformation neural network [J]. Journal of Computer Applications, 2024, 44(8): 2634-2642.
[9]	Dahai LI, Zhonghua WANG, Zhendong WANG. Dual-branch low-light image enhancement network combining spatial and frequency domain information [J]. Journal of Computer Applications, 2024, 44(7): 2175-2182.
[10]	Shibin LI, Jun GONG, Shengjun TANG. Semi-supervised heterophilic graph representation learning model based on Graph Transformer [J]. Journal of Computer Applications, 2024, 44(6): 1816-1823.
[11]	Junfeng SHEN, Xingchen ZHOU, Can TANG. Dual-channel sentiment analysis model based on improved prompt learning method [J]. Journal of Computer Applications, 2024, 44(6): 1796-1806.
[12]	Mengyuan HUANG, Kan CHANG, Mingyang LING, Xinjie WEI, Tuanfa QIN. Progressive enhancement algorithm for low-light images based on layer guidance [J]. Journal of Computer Applications, 2024, 44(6): 1911-1919.
[13]	Xiting LYU, Jinghua ZHAO, Haiying RONG, Jiale ZHAO. Information diffusion prediction model based on Transformer and relational graph convolutional network [J]. Journal of Computer Applications, 2024, 44(6): 1760-1766.
[14]	Xun YAO, Zhongzheng QIN, Jie YANG. Generative label adversarial text classification model [J]. Journal of Computer Applications, 2024, 44(6): 1781-1785.
[15]	Zihan LIU, Dengwen ZHOU, Yukai LIU. Image super-resolution network based on global dependency Transformer [J]. Journal of Computer Applications, 2024, 44(5): 1588-1596.

模型		不同领域的F1									overall
模型		科技	军事	教育	灾难	政治	健康	金融	娱乐	社会	F1	Acc	AUC
单域	BIGRU	51.75	33.65	74.16	72.93	85.88	83.73	81.37	79.92	79.18	81.03	81.03	89.02
	TextCNN	40.74	33.65	80.59	43.88	84.82	88.19	82.15	79.73	86.15	83.69	83.70	90.94
	RoBERTa	74.63	73.69	81.46	75.47	80.44	88.73	83.61	85.13	83.00	84.77	84.77	92.26
混合域	BIGRU	72.69	87.24	81.38	79.35	83.56	88.68	82.91	86.29	84.85	85.95	85.98	93.09
	TextCNN	72.54	88.39	83.62	82.22	85.61	87.68	86.38	84.56	85.40	86.86	86.87	93.81
	RoBERTa	77.77	90.72	83.31	85.12	83.66	90.90	87.35	87.69	85.77	87.95	87.97	94.51
	StyleLSTM	77.29	91.87	83.41	85.32	84.87	90.84	88.02	88.46	85.52	88.20	88.21	94.71
	DualEmo	83.23	90.26	83.62	83.96	84.55	89.05	90.53	89.44	85.69	88.46	88.46	95.41
多域	EANN	82.25	92.74	86.24	86.66	87.05	91.05	87.10	89.57	88.77	89.75	89.77	96.10
	MMoE	87.55	91.12	87.06	87.70	86.20	93.64	85.67	88.86	87.50	89.47	89.48	95.47
	MoSE	85.02	88.58	88.15	86.72	88.08	91.79	86.72	89.13	87.29	89.39	89.40	95.43
	EDDFN	81.86	91.37	86.76	87.86	84.78	93.79	86.36	88.32	86.89	89.19	89.19	95.28
	MDFEND	83.01	93.89	89.17	90.03	88.65	94.00	89.51	90.66	89.80	91.37	91.38	97.08
	M³FEND	82.92	95.06	89.98	88.96	88.25	94.60	90.09	93.15	90.89	92.16	92.16	97.50
Transm3		89.43	98.07	91.11	92.36	90.74	96.90	92.56	94.90	91.95	95.55	95.57	98.95

模型		不同领域的F1									overall
模型		科技	军事	教育	灾难	政治	健康	金融	娱乐	社会	F1	Acc	AUC
单域	BIGRU	51.75	33.65	74.16	72.93	85.88	83.73	81.37	79.92	79.18	81.03	81.03	89.02
	TextCNN	40.74	33.65	80.59	43.88	84.82	88.19	82.15	79.73	86.15	83.69	83.70	90.94
	RoBERTa	74.63	73.69	81.46	75.47	80.44	88.73	83.61	85.13	83.00	84.77	84.77	92.26
混合域	BIGRU	72.69	87.24	81.38	79.35	83.56	88.68	82.91	86.29	84.85	85.95	85.98	93.09
	TextCNN	72.54	88.39	83.62	82.22	85.61	87.68	86.38	84.56	85.40	86.86	86.87	93.81
	RoBERTa	77.77	90.72	83.31	85.12	83.66	90.90	87.35	87.69	85.77	87.95	87.97	94.51
	StyleLSTM	77.29	91.87	83.41	85.32	84.87	90.84	88.02	88.46	85.52	88.20	88.21	94.71
	DualEmo	83.23	90.26	83.62	83.96	84.55	89.05	90.53	89.44	85.69	88.46	88.46	95.41
多域	EANN	82.25	92.74	86.24	86.66	87.05	91.05	87.10	89.57	88.77	89.75	89.77	96.10
	MMoE	87.55	91.12	87.06	87.70	86.20	93.64	85.67	88.86	87.50	89.47	89.48	95.47
	MoSE	85.02	88.58	88.15	86.72	88.08	91.79	86.72	89.13	87.29	89.39	89.40	95.43
	EDDFN	81.86	91.37	86.76	87.86	84.78	93.79	86.36	88.32	86.89	89.19	89.19	95.28
	MDFEND	83.01	93.89	89.17	90.03	88.65	94.00	89.51	90.66	89.80	91.37	91.38	97.08
	M³FEND	82.92	95.06	89.98	88.96	88.25	94.60	90.09	93.15	90.89	92.16	92.16	97.50
Transm3		89.43	98.07	91.11	92.36	90.74	96.90	92.56	94.90	91.95	95.55	95.57	98.95