计算机应用

• 智能感知与模式识别(Intelligence percepti • 上一篇    下一篇

基于混合微粒群算法的说话人识别

许允喜 陈方   

  1. 湖州师范学院 南京航天航空大学
  • 收稿日期:2007-12-24 修回日期:2008-01-28 发布日期:2008-06-01 出版日期:2008-06-01
  • 通讯作者: 许允喜

Speaker recognition based on hybrid particle swarm optimization algorithm

Xun-xi XU Fang CHENG   

  • Received:2007-12-24 Revised:2008-01-28 Online:2008-06-01 Published:2008-06-01
  • Contact: Xun-xi XU

摘要: 为了解决传统高斯混合模型(GMM)对初值敏感,在实际训练中极易得到局部最优参数的问题,提出了一种采用微粒群算法优化GMM参数的新方法。该方法将最大似然估计融入到微粒群算法迭代过程中,形成了新的混合算法。它利用微粒群算法的全局优化性及最大似然估计的局部寻优性求解高斯混合模型的参数,以提高参数精度。说话人辨认实验表明,与传统的方法相比,新方法可以得到更优的模型参数,使得系统的识别率进一步提高。

关键词: 说话人识别, 微粒群算法, 高斯混合模型

Abstract: The traditional training methods of Gaussian Mixture Model (GMM) are sensitive to the initial model parameters, which often leads to a local optimal parameter in practice. To resolve this problem, a new GMM optimization method was proposed based on Particle Swarm Optimization (PSO). It utilized Maximum Likelihood (ML) algorithm in the PSO iteration and provided a new architecture of hybrid algorithm. Because of the global optimization characteristic of the particle swarm optimizer method and the strong local searching capacity of ML, it can obtain model parameters with high precision. Experiment for text-independent speaker identification shows that this method can obtain more optimum GMM parameters and better results than the traditional method.

Key words: speaker identification, Particle Swarm Optimization (PSO), Gaussian Mixture Model (GMM)