DPCS2017+8+基于语音频谱融合特征的手机来源识别

• •

DPCS2017+8+基于语音频谱融合特征的手机来源识别

裴安山¹,王让定¹,严迪群²

1. 宁波大学信息科学与工程学院
2. 宁波大学

收稿日期:2017-07-28 修回日期:2017-08-10 发布日期:2017-08-10
通讯作者: 裴安山

DPCS2017+8+Source Cell-phone Identification Based on Spectral Fusion Feature of Recorded Speech

Received:2017-07-28 Revised:2017-08-10 Online:2017-08-10
Contact: An-Shan PEI

摘要/Abstract

摘要： 摘要: 随着手机录音设备的普及以及各种功能强大且易于操作的数字媒体编辑软件的出现，语音的手机来源识别已成为多媒体取证领域重要的热点问题，针对该问题提出了一种基于频谱融合特征的手机来源识别算法。首先通过分析不同手机相同语音的语谱图，发现不同手机的语音频谱特征是不同的；然后对语音的频谱信息量、对数谱和相位谱特征进行了研究；其次将三个特征串联构成原始融合特征，并用每个样本的原始融合特征构建样本特征空间；最后采用WEKA 平台的CfsSubsetEval评价函数按照最佳优先搜索原则对所构建的特征空间进行特征选择，并采用LibSVM对特征选择后的样本特征空间进行模型训练和样本识别。实验部分给出了特征选择后的频谱单一特征和频谱融合特征在23款主流型号的手机语音库上分类的结果，结果表明该算法所提频谱融合特征有效提高了手机品牌类内的识别准确率，在TIMIT数据库和研究所自建的CKC-SD数据库上平均识别准确率分别达到99.96%和99.91%，另外，与Hanilci基于梅尔倒谱系数特征的录音设备来源识别算法进行了对比，平均识别准确率分别提高了6.58%和5.14%。因此可得本文算法所提融合特征能提高手机来源识别的平均识别准确率，有效降低手机类内识别的误判率。

关键词: 关键词: 关键词: 多媒体取证, 手机来源识别, 频谱融合特征, 特征选择, 平均识别准确率

Abstract: With the popularity of cell-phone recording devices and the availability of various powerful and easy to operate digital media editing software, source cell-phone identification has become a hot topic in multimedia forensics, a cell-phone source recognition algorithm based on spectral fusion features is proposed to solve this problem . First, the same speech spectrogram of the different cell-phone is analyzed, found that the speech spectrum characteristics of different cell-phone is different; then the speech spectral logarithmic spectrum, phase spectrum and information quantity characteristics are researched; Secondly, three features are connected in series to form the original fusion feature, and the sample feature space is constructed with the original fusion feature of each sample; finally, the CfsSubsetEval evaluation function of WEKA platform is selected according to the best priority search method to select feature, and LibSVM is used to model training and recognition of the sample after feature selection. Twenty-three popular models of the cell-phone are evaluated in the experiment, the results show that the proposed spectral fusion feature has better identification accuracy in cell-phone brands than spectral single feature and the average recognition rates achieved 99.96% and 99.91% on the TIMIT database and CKC-SD database. In addition, it is compared with the source identification algorithm of Hanilci based on Mel frequency cepstral coefficients, the average recognition accuracy was improved by 6.58% and 5.14%. Therefore, the proposed algorithm can improve the average recognition accuracy of cell-phone source identification, and effectively reduce the false positives rate of cell-phone identification.

Key words: audio forensics, source cell-phone identification, spectral fusion feature, feature selection, average recognition accuracy

中图分类号:

TP391

裴安山王让定严迪群. DPCS2017+8+基于语音频谱融合特征的手机来源识别[J]. 计算机应用.

[1]	孙林, 刘梦含. 基于自适应布谷鸟优化特征选择的K-means聚类[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 831-841.
[2]	徐大鹏, 侯新民. 基于网络结构设计的图神经网络特征选择方法[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 663-670.
[3]	孟圣洁, 于万钧, 陈颖. 最大相关和最大差异的高维数据特征选择算法[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 767-771.
[4]	何添, 沈宗鑫, 黄倩倩, 黄雁勇. 基于自适应学习的多视图无监督特征选择方法[J]. 《计算机应用》唯一官方网站, 2023, 43(9): 2657-2664.
[5]	孙林, 黄金旭, 徐久成. 基于邻域容差互信息和鲸鱼优化算法的非平衡数据特征选择[J]. 《计算机应用》唯一官方网站, 2023, 43(6): 1842-1854.
[6]	于振华, 刘争气, 刘颖, 郭城. 基于自适应混合粒子群优化的软件缺陷预测特征选择方法[J]. 《计算机应用》唯一官方网站, 2023, 43(4): 1206-1213.
[7]	孙林, 马天娇, 薛占熬. 基于Fisher score与模糊邻域熵的多标记特征选择算法[J]. 《计算机应用》唯一官方网站, 2023, 43(12): 3779-3789.
[8]	徐精诚, 陈学斌, 董燕灵, 杨佳. 融合特征选择的随机森林DDoS攻击检测[J]. 《计算机应用》唯一官方网站, 2023, 43(11): 3497-3503.
[9]	马磊, 罗川, 李天瑞, 陈红梅. 基于模糊粗糙集的无监督动态特征选择算法[J]. 《计算机应用》唯一官方网站, 2023, 43(10): 3121-3128.
[10]	陈亮, 汤显峰. 改进正余弦算法优化特征选择及数据分类[J]. 《计算机应用》唯一官方网站, 2022, 42(6): 1852-1861.
[11]	赵静, 韩京宇, 钱龙, 毛毅. 基于改进的RAKEL算法的心电图诊断分类[J]. 《计算机应用》唯一官方网站, 2022, 42(6): 1892-1897.
[12]	李莉, 石可欣, 任振康. 基于特征选择和TrAdaBoost的跨项目缺陷预测方法[J]. 《计算机应用》唯一官方网站, 2022, 42(5): 1554-1562.
[13]	孙林, 赵婧, 徐久成, 王欣雅. 基于邻域粗糙集和帝王蝶优化的特征选择算法[J]. 《计算机应用》唯一官方网站, 2022, 42(5): 1355-1366.
[14]	李晓寒, 贾华丁, 程雪, 李太勇. 基于改进遗传算法和图神经网络的股市波动预测方法[J]. 《计算机应用》唯一官方网站, 2022, 42(5): 1624-1633.
[15]	轩书婷, 刘惊雷. 基于离散哈希的聚类[J]. 《计算机应用》唯一官方网站, 2022, 42(3): 713-723.