计算机应用 ›› 2011, Vol. 31 ›› Issue (12): 3441-3445.

• 典型应用 • 上一篇    下一篇

脉冲噪声环境下基于卡尔曼滤波的语音增强

何志勇1,朱忠奎2   

  1. 1. 苏州大学 机电工程学院,江苏 苏州 215021
    2. 苏州大学 城市轨道交通学院,江苏 苏州 215021
  • 收稿日期:2011-06-21 修回日期:2011-07-31 发布日期:2011-12-12 出版日期:2011-12-01
  • 通讯作者: 何志勇
  • 基金资助:
    国家自然科学基金资助项目;江苏省高校自然科学基金资助项目

Removal of impulsive noise based on Kalman filtering for speech enhancement

HE Zhi-yong1,ZHU Zhong-kui2   

  1. 1. School of Mechanical and Electrical Engineering, Soochow University, Suzhou Jiangsu 215021, China
    2. School of Urban Rail Transportation, Soochow University, Suzhou Jiangsu 215021, China
  • Received:2011-06-21 Revised:2011-07-31 Online:2011-12-12 Published:2011-12-01
  • Contact: HE Zhi-yong

摘要: 语音增强的目标在于从含噪信号中提取纯净语音,纯净语音在某些环境下会被脉冲噪声所污染,但脉冲噪声的时域分布特征却给语音增强带来困难,使传统方法在脉冲噪声环境下难以取得满意效果。为在平稳脉冲噪声环境下进行语音增强,提出了一种新方法。该方法通过计算确定脉冲噪声样本的能量与含噪信号样本的能量之比最大的频段,利用该频段能量分布情况逐帧判别语音信号是否被脉冲噪声所污染。进一步地,该方法只在被脉冲噪声污染的帧应用卡尔曼滤波算法去噪,并改进了传统算法执行时的自回归(AR)模型参数估计过程。实验中,采用白色脉冲噪声以及有色脉冲噪声污染语音信号,并对低输入信噪比的信号进行语音增强,结果表明所提出的算法能显著地改善信噪比和抑制脉冲噪声。

关键词: 脉冲噪声, 语音增强, 卡尔曼滤波, AR模型, 滤波器组

Abstract: Speech enhancement aims at extracting clean speech from noisy speech. Traditional speech enhancement algorithms proposed for continuous noise can not work well when clean speech is polluted by impulsive noise. The feature of the distribution of impulsive noise in time domain has to be concerned for speech enhancement. This paper provided a new speech enhancement method for the removal of noise when clean speech is polluted by stationary impulsive noise. By analyzing the samples of impulsive noise and noisy speech, the method found the frequency band in which the energy of impulsive noise was most evident compared with the energy of noisy speech. Furthermore, filter banks were used in recognizing whether the speech frame was polluted. The method based on Kalman filtering removed impulsive noise from the polluted signal frame. Compared with the traditional algorithms based on Kalman filtering, the method also improved the process of the parameters estimation of Auto Regressive (AR) model. In some speech enhancement tests, speech was polluted by white impulsive noise or colored impulsive noise, the results show that the method has achieved an improved Signal-to-Noise Ratio (SNR) and removes impulsive noise even when the SNR of noisy speech is low.

Key words: impulse noise, speech enhancement, filter bank