计算机应用 ›› 2009, Vol. 29 ›› Issue (05): 1419-1422.

• 模式识别 • 上一篇    下一篇

基于单类支持向量机的音频分类

颜景斌1,吴石2,伊戈尔.艾杜阿尔达维奇3   

  1. 1. 哈尔滨理工大学
    2. 哈尔滨理工大学 电气与电子工程学院
    3. 白俄罗斯国立大学 无线电物理与电子系
  • 收稿日期:2008-11-14 修回日期:2009-01-14 发布日期:2009-06-09 出版日期:2009-05-01
  • 通讯作者: 颜景斌
  • 基金资助:
    白俄罗斯国立大学科学技术中心基金(2006-B-1375)

Audio classification based on one-class SVM

  • Received:2008-11-14 Revised:2009-01-14 Online:2009-06-09 Published:2009-05-01
  • Contact: Jing-Bin YAN

摘要: 研究一种基于单类支持向量机的音频分类方法,能够使每一类样本都独立地获得一个决策函数,通过决策函数的最大值来判断样本所属的类。通过使用小波包变换提取语音特征向量,并融合多特征向量,将音频分为5类:纯语音、音乐、环境音、含背景音语音和静音。实验结果表明这种方法具有较好的分类精度,性能优于贝叶斯、隐马尔可夫模型和神经网络分类器。

关键词: 单类支持向量机, 音频分类, 特征提取, 小波, One-Class Support Vector Machine (OCSVM), audio classification, character extraction

Abstract: The author studied an audio classification method based on One-Class Support Vector Machine (OCSVM), which could form a decision function for every single class sample and accordingly obtain the aim of classification based on maximum of decision function. By employing wavelet packed transformation to extract features of audio and integrating multiple features, five audio classes were made: pure speech, music, environmental sound, speech over background and silence. Experimental results show that OCSVM has better classification accuracy, and performs better than the other classification systems using the Bayes, Hidden Markov Model (HMM) and Neural Network (NN).

Key words: wavelet

中图分类号: