计算机应用 ›› 2012, Vol. 32 ›› Issue (09): 2542-2544.DOI: 10.3724/SP.J.1087.2012.02542

• 人工智能 • 上一篇    下一篇

基于梅尔频率倒谱系数与翻转梅尔频率倒谱系数的说话人识别方法

胡峰松,张璇*   

  1. 湖南大学 信息科学与工程学院,长沙 410082
  • 收稿日期:2012-03-13 修回日期:2012-06-04 发布日期:2012-09-01 出版日期:2012-09-01
  • 通讯作者: 张璇
  • 作者简介:胡峰松(1969-),男,湖南长沙人,副教授,博士,主要研究方向:数字图像处理、说话人识别; 张璇(1988-),女,湖南张家界人,硕士研究生,主要研究方向:说话人识别。

Speaker recognition method based on Mel frequency cepstrum coefficient and inverted Mel frequency cepstrum coefficient

HU Feng-song,ZHANG Xuan*   

  1. College of Information Science and Engineering,Hunan University,Changsha Hunan 410082,China
  • Received:2012-03-13 Revised:2012-06-04 Online:2012-09-01 Published:2012-09-01
  • Contact: Xuan ZHANG

摘要: 为提高说话人识别系统的识别率,提出了基于梅尔频率倒谱系数(MFCC)与翻转梅尔频率倒谱系数(IMFCC)为特征参数的特征提取新方法。该方法利用Fisher准则将MFCC和IMFCC相结合,构造了一种混合特征参数。实验结果表明,新的混合特征参数与MFCC相比,在纯净语音库及噪声环境中均具有较好的识别性能。

关键词: 说话人识别, 梅尔频率倒谱系数, 翻转梅尔频率倒谱系数, Fisher准则, 高斯混合模型

Abstract: To improve the performance of speaker recognition system, a new method of feature extraction was proposed based on Mel Frequency Cepstrum Coefficient (MFCC) and Inverted MFCC (IMFCC). This method constructed a mixed feature by combining MFCC with IMFCC using Fisher criterion. The experimental results show that the mixed feature proposed in this paper has better recognition performance compared with MFCC not only in the pure voice database but also in the noisy environments.

Key words: speaker recognition, Mel Frequency Cepstrum Coefficient (MFCC), Inverted MFCC (IMFCC), Fisher criterion, Gaussian Mixture Model (GMM)

中图分类号: