计算机应用 ›› 2013, Vol. 33 ›› Issue (01): 175-178.DOI: 10.3724/SP.J.1087.2013.00175

• 人工智能 • 上一篇    下一篇

基于临界频带及能量熵的语音端点检测

张婷1,何凌1,黄华1,刘肖珩2   

  1. 1. 四川大学 电气信息学院, 成都 610065
    2. 四川大学 华西基础医学与法医学院, 成都 610041
  • 收稿日期:2012-07-26 修回日期:2012-08-22 出版日期:2013-01-01 发布日期:2013-01-09
  • 通讯作者: 何凌
  • 作者简介:张婷(1987-),女,甘肃酒泉人,硕士研究生,主要研究方向:语音信号处理、医学电子学;何凌(1981-),女,四川成都人,讲师,博士,主要研究方向:语音信号处理;黄华(1961-),男,四川成都人,教授,博士生导师,博士,主要研究方向:医学电子学、医学信息工程;刘肖珩(1968-),男,四川成都人,教授,博士生导师,博士,主要研究方向:生物医学工程。
  • 基金资助:

    国家自然科学基金资助项目(40975015, 41275041)

Speech endpoint detection based on critical band and energy entropy

ZHANG Ting1,HE Ling1,HUANG Hua1,LIU Xiaoheng2   

  1. 1. School of Electrical Engineering and Information, Sichuan University, Chengdu Sichuan 610065, China
    2. College of Basic and Forensic Medicine, Sichuan University, Chengdu Sichuan 610041, China
  • Received:2012-07-26 Revised:2012-08-22 Online:2013-01-01 Published:2013-01-09
  • Contact: HE Ling

摘要: 语音端点检测的准确性直接关系着语音识别、合成、增强等语音领域的准确性,为了提高语音端点检测的有效性,提出了一种基于临界频带及能量熵的语音端点检测算法。算法充分利用人耳听觉特性的频率分布,将含噪语音信号进行临界频带划分,并结合各频带内信号的能量熵值在语音段和噪声段的不同分布,实现不同背景噪声下语音端点检测。实验结果表明,提出的语音端点检测算法与传统的短时能量法相比,检测正确率平均高1.6个百分点。所提方法在不同噪声的低信噪比(SNR)环境下均能实现语音端点检测。

关键词: 小波降噪, 临界频带, 能量熵, 语音端点检测

Abstract: The accuracy of the speech endpoint detection has a direct impact on the precision of speech recognition, synthesis, enhancement, etc. To improve the effectiveness of speech endpoint detection, an algorithm based on critical band and energy entropy was proposed. It took full advantage of the frequency distribution of human auditory characteristics, and divided the speech signals according to critical bands. Combined with the different distribution of energy entropy of each critical band of the signals respectively in the speech segments and noise segments, speech endpoint detection under different background noises was completed. The experimental results indicate that the average accuracy of the newly proposed algorithm is 1.6% higher than the traditional short-time energy algorithm. The proposed method can achieve the detection of speech endpoint under various noise environment of low Signal to Noise Ratio (SNR).

Key words: wavelet denoising, critical band, energy entropy, speech endpoint detection

中图分类号: