计算机应用 ›› 2013, Vol. 33 ›› Issue (10): 2945-2949.

• 多媒体技术 • 上一篇    下一篇

基于能量检测的复杂环境下的鸟鸣识别

张小霞,李应   

  1. 福州大学 数学与计算机科学学院,福州 350108
  • 收稿日期:2013-04-28 修回日期:2013-06-17 出版日期:2013-10-01 发布日期:2013-11-01
  • 通讯作者: 张小霞
  • 作者简介: 
    张小霞(1988-),女,福建南平人,硕士研究生,主要研究方向:声音识别;李应(1964-),男,福建福州人,教授,博士,主要研究方向:声音识别、多媒体数据检索、信息安全。
  • 基金资助:
    国家自然科学基金资助项目

Bird sounds recognition based on energy detection in complex environments

ZHANG Xiaoxia,LI Ying   

  1. College of Mathematics and Computer Science, Fuzhou University, Fuzhou Fujian 350108, China
  • Received:2013-04-28 Revised:2013-06-17 Online:2013-11-01 Published:2013-10-01
  • Contact: ZHANG Xiaoxia

摘要: 针对实际环境噪声使得鸟鸣识别准确率受到影响的问题,提出一种基于能量检测的抗噪鸟鸣识别方法。首先,对包含有噪声的鸟鸣信号用能量检测方法检测并筛选出有用鸟鸣信号;其次,根据梅尔尺度的分布,对有用鸟鸣信号提取小波包分解子带倒谱系数(WPSCC)特征;最后,用支持向量机(SVM)分类器分别对提取的小波包分解子带倒谱系数(WPSCC)和梅尔频率倒谱系数(MFCC)特征进行建模分类识别。同时还对比了在添加不同信噪比的噪声下15类鸟鸣在能量检测前后的识别性能差异。实验结果表明,提取的WPSCC特征具有较好的抗噪功能,且经过能量检测后的识别性能更佳,更适用于复杂环境下的鸟鸣识别

关键词: 能量检测, 小波包分解子带倒谱系数, 梅尔频率倒谱系数, 支持向量机, 鸟鸣识别

Abstract: For the purpose of improving the recognition accuracy of bird sounds in various kinds of noisy environments in real world, a new bird sounds recognition approach based on energy detection was proposed. First of all, the useful bird sound signals were detected and selected by the method of energy detection from the bird sounds with noises. Secondly, according to the distribution of Mel scale, the feature of Wavelet Packet decomposition Subband Cepstral Coefficient (WPSCC) was extracted from the useful signals. Finally, the classifier of Support Vector Machine (SVM) was applied to model on the WPSCC and Mel-Frequency Cepstral Coefficient (MFCC) respectively for classification and identification. Meanwhile, the comparisons of recognition performance difference were implemented on 15 kinds of bird sounds at different Signal-to-Noise Ratio (SNR) in different noises, before or after energy detection. The experimental results show that WPSCC has better noise immunity function, and the recognition performance after energy detection can be greatly improved, which means it is more suitable for the bird sounds recognition in complex environments.

Key words: energy detection, Wavelet Packet decomposition Subband Cepstral Coefficient (WPSCC), Mel-Frequency Cepstral Coefficient (MFCC), Support Vector Machine (SVM), bird sounds recognition

中图分类号: