利用全局-局部特征依赖的反欺骗说话人验证方法

doi:10.11772/j.issn.1001-9081.2023121877

《计算机应用》唯一官方网站

• • 下一篇

利用全局-局部特征依赖的反欺骗说话人验证方法

张嘉琳¹,任庆桦¹,毛启容²

1. 江苏大学
2. 江苏大学计算机科学与通信工程学院

收稿日期:2024-01-08 修回日期:2024-02-27 发布日期:2024-04-01 出版日期:2024-04-01
通讯作者: 毛启容
基金资助:
支持方言和情感的复杂环境智能语音交互关键技术研发

Anti-spoofing speaker verification method utilizing global-local feature dependency

Received:2024-01-08 Revised:2024-02-27 Online:2024-04-01 Published:2024-04-01
Contact: Rong QiMAO

摘要/Abstract

摘要： 摘要: 针对现有卷积模型为主的反欺骗说话人验证方法捕获全局特征依赖不理想的问题，提出了一种利用全局-局部特征依赖的反欺骗说话人验证方法。首先，对于欺骗语音检测模块，设计两种滤波器组合方式对原始语音进行滤波，通过对频率子带的掩蔽实现样本扩充；其次，提出多维全局注意力机制，通过对信道维度、频率维度和时间维度分别进行池化，获得每个维度的全局依赖关系，将全局信息通过加权的方式与原始特征融合。最后，对于说话人验证部分，引入统计金字塔池化时延神经网络（SPD-TDNN），在获取多尺度时频特征的同时，计算特征的标准差，加入全局信息。结果表明，与集成时频图卷积模型（AASIST）相比，提出的欺骗语音检测方法在ASVspoof2019数据集上将等错误率降低了53%。与单独的金字塔池化说话人验证方法相比，提出的反欺骗说话人验证方法将等错误率降低了23个百分点。验证了所提方法借助全局特征依赖能够实现更好的分类效果。

关键词: 关键词: 说话人验证, 数据增强, 频率掩蔽, 注意力机制, 欺骗语音检测

Abstract: Abstract: Aiming at the problem that the convolutional model-based anti-spoofing speaker verification method cannot capture global feature dependency well, an anti-spoofing speaker verification method utilizing global-local feature dependency was proposed. First, for the spoofing speech detection module, two filter combinations were designed to filter the original speech, and sample augmentation was achieved by masking the frequency sub-bands. Second, a multidimensional global attention mechanism was proposed, where the global dependencies of each dimension were obtained by pooling the channel dimension, frequency dimension, and time dimension, respectively, and the global information was fused with the original features by weighting. Finally, for the speaker verification part, a Statistical Pyramid Dense Time Delay Neural Network (SPD-TDNN) was introduced to compute the standard deviation of the features and join the global information while obtaining the multi-scale time-frequency features. The results show that the proposed spoofing speech detection method reduces the equal error rate by 53% on the ASVspoof2019 dataset compared to the Audio Anti-Spoofing using Integrated Spectro-Temporal graph attention network (AASIST). The proposed anti-spoofing speaker verification method reduces the equal error rate by 23 percentage points compared to the statistical pyramid dense time delay neural network method. It has been verified that the proposed method achieves better classification results with the help of global feature dependency.

Key words: Keywords: speaker verification, data augmentation, frequency masking, attention mechanism, synthetic speech detection

中图分类号:

中图分类号:TN912.34 (声音识别及其装置)

张嘉琳任庆桦毛启容. 利用全局-局部特征依赖的反欺骗说话人验证方法[J]. 计算机应用, DOI: 10.11772/j.issn.1001-9081.2023121877.

[1]	孙滔, 段张甜, 朱浩楠, 郭沛豪, 孙鹤立. 基于新奇度量的社交事件推荐方法[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 760-766.
[2]	尚爱国, 朱欣娟. 基于多任务学习的意图检测和槽位填充联合方法[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 690-695.
[3]	郑宇亮, 陈云华, 白伟杰, 陈平华. 融合事件数据和图像帧的车辆目标检测[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 931-937.
[4]	赵奎, 仇慧琪, 李旭, 徐知非. 结合注意力和多路径融合的实时肺结节检测算法[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 945-952.
[5]	黄子杰, 欧阳, 江德港, 郭彩玲, 李柏林. 面向牵引座焊缝表面质量检测的轻量型深度学习算法[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 983-988.
[6]	董永峰, 白佳明, 王利琴, 王旭. 融合先验知识和字形特征的中文命名实体识别[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 702-708.
[7]	江锐, 刘威, 陈成, 卢涛. 非对称端到端的无监督图像去雨网络[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 922-930.
[8]	罗歆然, 李天瑞, 贾真. 基于自注意力机制与词汇增强的中文医学命名实体识别[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 385-392.
[9]	邓辅秦, 官桧锋, 谭朝恩, 付兰慧, 王宏民, 林天麟, 张建民. 基于请求与应答通信机制和局部注意力机制的多机器人强化学习路径规划方法[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 432-438.
[10]	郭安迪, 贾真, 李天瑞. 基于伪实体数据增强的高精准率医学领域实体关系抽取[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 393-402.
[11]	党伟超, 张磊, 高改梅, 刘春霞. 融合片段对比学习的弱监督动作定位方法[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 548-555.
[12]	黄子麒, 胡建鹏. 实体类别增强的汽车领域嵌套命名实体识别[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 377-384.
[13]	陈丽安, 过弋. 融合个体偏差信息的文本情感分析模型[J]. 《计算机应用》唯一官方网站, 2024, 44(1): 145-151.
[14]	史含笑, 王雷春. 结合LSTM和自注意力机制的图卷积网络短期电力负荷预测[J]. 《计算机应用》唯一官方网站, 2024, 44(1): 311-317.
[15]	王红斌, 房晓, 江虹. 融入三维语义特征的常识推理问答方法[J]. 《计算机应用》唯一官方网站, 2024, 44(1): 138-144.

利用全局-局部特征依赖的反欺骗说话人验证方法

Anti-spoofing speaker verification method utilizing global-local feature dependency

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics