计算机应用

• 智能感知与模式识别 • 上一篇    下一篇

混合窗函数和子带频谱质心在低信噪比语音识别中的应用

赵欢 张林 陈珍文   

  1. 湖南大学 湖南长沙湖南大学计算机与通信学院
  • 收稿日期:2008-09-02 修回日期:2008-10-15 发布日期:2009-04-22 出版日期:2009-02-01
  • 通讯作者: 张林

Using mixed window function and subband spectrum centroid in MFCC feature extraction process

<a href="http://www.joca.cn/EN/article/advancedSearchResult.do?searchSQL=((([Author]) AND 1[Journal]) AND year[Order])" target="_blank"></a>H<a href="http://www.joca.cn/EN/article/advancedSearchResult.do?searchSQL=((([Author]) AND 1[Journal]) AND year[Order])" target="_blank"></a>u<a href="http://www.joca.cn/EN/article/advancedSearchResult.do?searchSQL=((([Author]) AND 1[Journal]) AND year[Order])" target="_blank"></a>a<a href="http://www.joca.cn/EN/article/advancedSearchResult.do?searchSQL=((([Author]) AND 1[Journal]) AND year[Order])" target="_blank"></a>n<a href="http://www.joca.cn/EN/article/advancedSearchResult.do?searchSQL=((([Author]) AND 1[Journal]) AND year[Order])" target="_blank"></a> <a href="http://www.joca.cn/EN/article/advancedSearchResult.do?searchSQL=((([Author]) AND 1[Journal]) AND year[Order])" target="_blank"></a>Z<a href="http://www.joca.cn/EN/article/advancedSearchResult.do?searchSQL=((([Author]) AND 1[Journal]) AND year[Order])" target="_blank"></a>h<a href="http://www.joca.cn/EN/article/advancedSearchResult.do?searchSQL=((([Author]) AND 1[Journal]) AND year[Order])" target="_blank"></a>a<a href="http://www.joca.cn/EN/article/advancedSearchResult.do?searchSQL=((([Author]) AND 1[Journal]) AND year[Order])" target="_blank"></a>o<a href="http://www.joca.cn/EN/article/advancedSearchResult.do?searchSQL=((([Author]) AND 1[Journal]) AND year[Order])" target="_blank"></a> <a href="http://www.joca.cn/EN/article/advancedSearchResult.do?searchSQL=((([Author]) AND 1[Journal]) AND year[Order])" target="_blank"></a>L<a href="http://www.joca.cn/EN/article/advancedSearchResult.do?searchSQL=((([Author]) AND 1[Journal]) AND year[Order])" target="_blank"></a>i<a href="http://www.joca.cn/EN/article/advancedSearchResult.do?searchSQL=((([Author]) AND 1[Journal]) AND year[Order])" target="_blank"></a>n<a href="http://www.joca.cn/EN/article/advancedSearchResult.do?searchSQL=((([Author]) AND 1[Journal]) AND year[Order])" target="_blank"></a> <a href="http://www.joca.cn/EN/article/advancedSearchResult.do?searchSQL=((([Author]) AND 1[Journal]) AND year[Order])" target="_blank"></a>Z<a href="http://www.joca.cn/EN/article/advancedSearchResult.do?searchSQL=((([Author]) AND 1[Journal]) AND year[Order])" target="_blank"></a>h<a href="http://www.joca.cn/EN/article/advancedSearchResult.do?searchSQL=((([Author]) AND 1[Journal]) AND year[Order])" target="_blank"></a>a<a href="http://www.joca.cn/EN/article/advancedSearchResult.do?searchSQL=((([Author]) AND 1[Journal]) AND year[Order])" target="_blank"></a>n<a href="http://www.joca.cn/EN/article/advancedSearchResult.do?searchSQL=((([Author]) AND 1[Journal]) AND year[Order])" target="_blank"></a>g<a href="http://www.joca.cn/EN/article/advancedSearchResult.do?searchSQL=((([Author]) AND 1[Journal]) AND year[Order])" target="_blank"></a> <a href="http://www.joca.cn/EN/article/advancedSearchResult.do?searchSQL=((([Author]) AND 1[Journal]) AND year[Order])" target="_blank"></a>   

  • Received:2008-09-02 Revised:2008-10-15 Online:2009-04-22 Published:2009-02-01
  • Contact: Lin Zhang

摘要: 为改善低信噪比环境下语音的质量,在传统MFCC特征提取的基础上,提出了两种提高识别系统鲁棒性的方法。一种方法利用混合窗函数对旁瓣的抑制来提高系统的鲁棒性;另一种方法是基于频谱峰值位置受背景噪声影响相对较小,将子带幅度信息和Mel子带频谱质心(MSSC)相结合。实验表明混合窗函数和子带频谱质心(MSSC)以及它们相结合的系统与使用传统MFCC的基准系统相比,在低信噪比的平稳噪声环境下系统的鲁棒性得到了一定的提高。

关键词: 语音识别, MFCC, 低信噪比, 子带频谱质心

Abstract: In order to improve the quality of speech in low SNR, two methods were proposed to improve the robustness of the system in this paper based on the traditional MFCC feature extraction. One is to use the side lobe suppression of mixed window function to improve the robustness of system; the other is to incorporate subband amplitude information with Mel-subband spectrum centroid(MSSC) because spectral peak position remains practically unaffected in the presence of background noise. Experimental results show that mixed window function and MSSC and their combination system could improve the robustness of system compared to the benchmark system based on traditional MFCC in the low SNR of stationary noises.

Key words: speech recognition, MFCC, low signal to noise ratio, subband spectrum centriod