Journal of Computer Applications

• Graphics and image processing • Previous Articles     Next Articles

Audio retrieval with frame coefficients of wavelet packet best base and pyramidal algorithm

LI Ying   

  • Received:2007-10-30 Revised:2007-12-18 Online:2008-04-01 Published:2008-04-01
  • Contact: LI Ying

用小波包最好基结构系数和塔型算法检索音频数据

李应   

  1. 福州大学 数学与计算机科学学院
  • 通讯作者: 李应

Abstract: To solve the problem of query-by-example in multimedia audio data, the characteristics of wavelet multiresolution, wavelet packet transform and its best base were analyzed. A method for audio retrieval was proposed using wavelet frame coefficients of packet best base and wavelet multiresolution pyramidal algorithm. First, audio data files were prepocessed by transforming them into frame coefficients of best base and wavelet coefficient files with audio data. And then elementary classification for these files was carried out using frame coefficients of best base, and after that these files were searched using the different hierarchy pyramidal algorithms. By comparing our method with the method using different level wavelet approximate coefficient algorithm, it is found that our method is highly efficient and reduces the searching time without influencing the retrieval precision.

Key words: audio retrieval, best base, wavelet transform, pyramidal algorithm

摘要: 提出一种用小波包最好基结构系数和多分辨塔型算法检索音频数据的方法。这种方法首先对音频数据文件进行预处理,即把音频原数据文件变换成小波包最好基结构系数和小波不同级多分辨分析系数;最后用最好基结构系数对这些文件进行初步分类;最后再用塔型算法进行不同层次的检索。把这种方法与使用不同级小波逼近系数算法比较,结果表明这种方法对音频数据文件检索是有效的。

关键词: 音频数据检索, 最好基, 小波变换, 塔形算法