计算机应用 ›› 2010, Vol. 30 ›› Issue (05): 1262-1265.

• 模式识别 • 上一篇    下一篇

基于子频带加权的语音活动检测算法

张玲1,顾彦飞2,何伟1   

  1. 1.
    2. 重庆大学
  • 收稿日期:2009-12-07 修回日期:2009-12-31 发布日期:2010-05-04 出版日期:2010-05-01
  • 通讯作者: 顾彦飞

Voice activity detection algorithm based on sub-band weighting

  • Received:2009-12-07 Revised:2009-12-31 Online:2010-05-04 Published:2010-05-01

摘要: 为了降低噪声及决策导向(DD)参数估计算法的帧延迟特性对语音活动检测(VAD)算法鲁棒性的影响,首先采用两步降噪(TSNR)技术估计算法提高语音瞬变时刻参数估计准确性,并针对语音噪声的频率选择性,通过频带分割,将噪声污染限制到孤立子频带中,构建了由子频带特征与可靠性因子结合提供判别结果的子频带加权VAD算法。实验表明,此子频带加权算法优于Sohn算法、Cho算法以及G.729B等全频带算法。

关键词: 统计模型, 语音活动检测, 两步降噪技术, 子频带, 决策导向

Abstract: To reduce the influence of noise and the frame delay of Decision-Directed (DD) parameter estimation algorithm on the robustness of Voice Activity Detection (VAD), Two-Step Noise Reduction (TSNR) technique was used to track the priori SNR of the current frame to improve the estimation accuracy of parameters in speech transient. Then, based on the frequency-selective property of noise signals, frequency spectrum was split into sub-bands to isolate noise corruption within particular sub-bands, so that the sub-bands weighting algorithm, in which sub-bands features were combined with their reliability to provide judgment, was constructed. The experiments show that the sub-band weighting method is superior to those full-band ones such as Sohn, Cho and G.729 B.

Key words: statistic model, Voice Activity Detection (VAD), Two-Step Noise Reduction (TSNR), sub-band, Decision-Directed (DD)