• •    

DPCS2017+8+基于语音频谱融合特征的手机来源识别

裴安山1,王让定1,严迪群2   

  1. 1. 宁波大学信息科学与工程学院
    2. 宁波大学
  • 收稿日期:2017-07-28 修回日期:2017-08-10 发布日期:2017-08-10
  • 通讯作者: 裴安山

DPCS2017+8+Source Cell-phone Identification Based on Spectral Fusion Feature of Recorded Speech

  • Received:2017-07-28 Revised:2017-08-10 Online:2017-08-10
  • Contact: An-Shan PEI

摘要: 摘 要: 随着手机录音设备的普及以及各种功能强大且易于操作的数字媒体编辑软件的出现,语音的手机来源识别已成为多媒体取证领域重要的热点问题,针对该问题提出了一种基于频谱融合特征的手机来源识别算法。首先通过分析不同手机相同语音的语谱图,发现不同手机的语音频谱特征是不同的;然后对语音的频谱信息量、对数谱和相位谱特征进行了研究;其次将三个特征串联构成原始融合特征,并用每个样本的原始融合特征构建样本特征空间;最后采用WEKA 平台的CfsSubsetEval评价函数按照最佳优先搜索原则对所构建的特征空间进行特征选择,并采用LibSVM对特征选择后的样本特征空间进行模型训练和样本识别。实验部分给出了特征选择后的频谱单一特征和频谱融合特征在23款主流型号的手机语音库上分类的结果,结果表明该算法所提频谱融合特征有效提高了手机品牌类内的识别准确率,在TIMIT数据库和研究所自建的CKC-SD数据库上平均识别准确率分别达到99.96%和99.91%,另外,与Hanilci基于梅尔倒谱系数特征的录音设备来源识别算法进行了对比,平均识别准确率分别提高了6.58%和5.14%。因此可得本文算法所提融合特征能提高手机来源识别的平均识别准确率,有效降低手机类内识别的误判率。

关键词: 关键词: 关键词: 多媒体取证, 手机来源识别, 频谱融合特征, 特征选择, 平均识别准确率

Abstract: With the popularity of cell-phone recording devices and the availability of various powerful and easy to operate digital media editing software, source cell-phone identification has become a hot topic in multimedia forensics, a cell-phone source recognition algorithm based on spectral fusion features is proposed to solve this problem . First, the same speech spectrogram of the different cell-phone is analyzed, found that the speech spectrum characteristics of different cell-phone is different; then the speech spectral logarithmic spectrum, phase spectrum and information quantity characteristics are researched; Secondly, three features are connected in series to form the original fusion feature, and the sample feature space is constructed with the original fusion feature of each sample; finally, the CfsSubsetEval evaluation function of WEKA platform is selected according to the best priority search method to select feature, and LibSVM is used to model training and recognition of the sample after feature selection. Twenty-three popular models of the cell-phone are evaluated in the experiment, the results show that the proposed spectral fusion feature has better identification accuracy in cell-phone brands than spectral single feature and the average recognition rates achieved 99.96% and 99.91% on the TIMIT database and CKC-SD database. In addition, it is compared with the source identification algorithm of Hanilci based on Mel frequency cepstral coefficients, the average recognition accuracy was improved by 6.58% and 5.14%. Therefore, the proposed algorithm can improve the average recognition accuracy of cell-phone source identification, and effectively reduce the false positives rate of cell-phone identification.

Key words: audio forensics, source cell-phone identification, spectral fusion feature, feature selection, average recognition accuracy

中图分类号: