Deepfake speech detection model based on quantum-Transformer

doi:10.11772/ j.issn.1001-9081.2025101279

Journal of Computer Applications

Received:2025-11-03 Revised:2026-01-22 Accepted:2026-02-11 Online:2026-03-12 Published:2026-03-12
Contact: CHANG Yan

基于量子-Transformer的伪造语音检测模型

宋子扬，昌燕^*，闫丽丽，赵银山，刘洪林，宋海权

成都信息工程大学网络空间安全学院(芯谷产业学院)，成都 610225

通讯作者: 昌燕
基金资助:
国家自然科学基金项目.

Abstract

Abstract: Voice forgery technology poses a potential threat to people's lives. Currently, classical fake speech detection models on the market face challenges such as performance improvement bottlenecks and excessive model parameters. To address these issues, a quantum‑Transformer based fake speech detection model—the Quantum Security Speech Model (QSSM)—was proposed. In this model, parameterized quantum circuits (PQC) were used to construct a quantum QKV mapping module for generating Query, Key, and Value vectors. The self‑attention computation between feature vectors was implemented via the Swap test, and quantum attention pooling based on PQC was employed to aggregate contextual information. Experimental results demonstrate that the quantum‑Transformer model reduces the equal error rate by 0.5% to 4.5% compared with classical models such as RawNet2 in fake speech detection tasks, while decreasing the parameter count by 43% relative to the classical Transformer model. This model provides a new pathway for deploying fake speech detection solutions in resource‑constrained environments.

Key words: quantum computing, machine learning, deepfake, attention module, Speech detection

摘要： 语音伪造技术正在潜在威胁着人们的生活，目前市面上的经典伪造语音检测模型正面临着性能提升瓶颈、模型参数过多等问题。针对这些问题，本文提出一种基于量子-Transformer的伪造语音检测模型——量子安全语音模型(Quantum Security Speech Model,QSSM)模型。该模型使用参数化量子电路(Parameterized Quantum circuit,PQC)构建量子QKV映射模块以生成Query、Key和Value向量；通过Swap test实现特征向量间自注意力计算，利用PQC实现量子注意力池化以聚合上下文信息。实验结果表明，该量子-Transformer模型在伪造语音检测任务上的等错误率比RawNet2等经典模型下降0.5%~4.5%不等，与经典Transformer模型相比，参数量下降43%。该模型为资源受限环境下部署伪造语音检测方案提供了新的路径。

关键词: 关键词: 量子计算, 机器学习, 伪造语音, 注意力计算, 语音检测

CLC Number:

TN915.08

宋子扬昌燕闫丽丽赵银山刘洪林宋海权. 基于量子-Transformer的伪造语音检测模型[J]. 《计算机应用》唯一官方网站, DOI: 10.11772/ j.issn.1001-9081.2025101279.

[1]	Enkang XI, Jing FAN, Yadong JIN, Hua DONG, Hao YU, Yihang SUN. Review of threats faced by federated learning in privacy and security field [J]. Journal of Computer Applications, 2026, 46(3): 798-808.
[2]	Yonghao LIANG, Jinlong LI. Novel message passing network for neural Boolean satisfiability problem solver [J]. Journal of Computer Applications, 2025, 45(9): 2934-2940.
[3]	Lina GE, Mingyu WANG, Lei TIAN. Review of research on efficiency of federated learning [J]. Journal of Computer Applications, 2025, 45(8): 2387-2398.
[4]	Shujian GUO, Jieyue YU, Xuesong YIN. Graph regularized elastic net subspace clustering [J]. Journal of Computer Applications, 2025, 45(5): 1464-1471.
[5]	Junyi ZHU, Leilei CHANG, Xiaobin XU, Zhiyong HAO, Haiyue YU, Jiang JIANG. Self-supervised learning method using minimal prior knowledge [J]. Journal of Computer Applications, 2025, 45(4): 1035-1041.
[6]	Zirong HONG, Guangqing BAO. Review of radar automatic target recognition based on ensemble learning [J]. Journal of Computer Applications, 2025, 45(2): 371-382.
[7]	Yiming WANG, Shiyuan LI, Nanqing LIAO, Qingfeng CHEN. Uncertainty-aware unsupervised medical image registration model based on evidential deep learning [J]. Journal of Computer Applications, 2025, 45(10): 3371-3380.
[8]	You SHANG, Xianghua MIAO. Bayesian membership inference attacks for generative adversarial networks [J]. Journal of Computer Applications, 2025, 45(10): 3252-3258.
[9]	Jialin ZHANG, Qinghua REN, Qirong MAO. Speaker verification system utilizing global-local feature dependency for anti-spoofing [J]. Journal of Computer Applications, 2025, 45(1): 308-317.
[10]	Xuebin CHEN, Zhiqiang REN, Hongyang ZHANG. Review on security threats and defense measures in federated learning [J]. Journal of Computer Applications, 2024, 44(6): 1663-1672.
[11]	Zihao YAO, Yuanming LI, Ziqiang MA, Yang LI, Lianggen WEI. Multi-object cache side-channel attack detection model based on machine learning [J]. Journal of Computer Applications, 2024, 44(6): 1862-1871.
[12]	Wei SHE, Yang LI, Lihong ZHONG, Defeng KONG, Zhao TIAN. Hyperparameter optimization for neural network based on improved real coding genetic algorithm [J]. Journal of Computer Applications, 2024, 44(3): 671-676.
[13]	Yi ZHENG, Cunyi LIAO, Tianqian ZHANG, Ji WANG, Shouyin LIU. Image denoising-based cell-level RSRP estimation method for urban areas [J]. Journal of Computer Applications, 2024, 44(3): 855-862.
[14]	Bo LI, Jianqiang HUANG, Dongqiang HUANG, Xiaoying WANG. Adaptive computing optimization of sparse matrix-vector multiplication based on heterogeneous platforms [J]. Journal of Computer Applications, 2024, 44(12): 3867-3875.
[15]	Xuebin CHEN, Changsheng QU. Overview of backdoor attacks and defense in federated learning [J]. Journal of Computer Applications, 2024, 44(11): 3459-3469.