计算机应用

• 人工智能 • 上一篇    下一篇

基于HTK的语音识别的并行化研究与实现

刘勇进 史晓东   

  1. 福建厦门大学信息科学与技术学院计算机科学系
  • 收稿日期:2008-10-10 修回日期:2008-12-08 发布日期:2009-04-01 出版日期:2009-04-01
  • 通讯作者: 刘勇进

Research and implementation of parallel speech recognition based on HTK

Yong-jin LIU Xiao-dong SHI   

  • Received:2008-10-10 Revised:2008-12-08 Online:2009-04-01 Published:2009-04-01
  • Contact: Yong-jin LIU

摘要: 详细地分析了语音识别的过程,给出了相应的算法描述,并分析了语音识别并行化的可能性。将并行计算的思想应用于语音识别的算法中,使用多线程技术,并引入避免竞争条件的机制,在多核计算机上并行地计算HMM模型节点的似然率,从而得到语音识别的并行化算法。分析了该并行化算法的性能,同时在语音识别工具包HTK 3.4上实现了这种并行化算法。基于WSJ0语料库的实验结果表明该并行化算法在不影响识别结果的前提下能够有效地提高语音识别的实时性能。

关键词: 多核, HTK, 并行计算, 语音识别

Abstract: After comprehensively analyzing the process of speech recognition and depicting its corresponding algorithm, the feasibility of the parallelism in the algorithm was analyzed. The parallel computing concept and the multi-threading technology were applied in the algorithm of speech recognition, and a protection mechanism was introduced to avoid the occurring race condition during the computing of likelihood of Hidden Markov Model (HMM) on multi-core computer. Then the parallel algorithm for speech recognition was proposed and its performance was also evaluated. The experiments on WSJ0 corpora demonstrate that the implementation of parallel algorithm which was realized in HTK 3.4 toolkit can greatly improve the real-time performance of speech recognition without affecting the results of recognition.

Key words: multi-core, Hidden Markov Modelling Tookit (HTK), parallel computing, speech recognition

中图分类号: