计算机应用 ›› 2005, Vol. 25 ›› Issue (02): 291-293.DOI: 10.3724/SP.J.1087.2005.0291

• 数据库与数据挖掘 • 上一篇    下一篇

语音识别错误的分类分析

付跃文,杜利民   

  1.  中国科学院声学研究所
  • 发布日期:2005-02-01 出版日期:2005-02-01
  • 基金资助:

    国家973计划项目(G1998030505)

Classification analysis of speech recognition errors

 FU Yue-wen, DU Li-min   

  1. Institute of Acoustics, Chinese Academy of Sciences
  • Online:2005-02-01 Published:2005-02-01

摘要:

大词表连续语音识别系统由多个组件构成,识别错误受多种因素的影响。系统开发者需要分析错误发生的不同原因。根据语音识别的基本理论给出了对错误进行分类分析的原理,将识别错误按错误原因分为解码错误、声学模型错误、语言模型错误、声学和语言复合错误四大类,并对分类后的错误做了统计分析。实验证明,识别错误的分类分析为系统的改进提供了参考依据。

关键词: 大词表连续语音识别, 识别错误, 分类

Abstract:

Large vocabulary continuous speech recognition system consists of several components, and recognition errors are caused by different factors. Developers need to know how the errors occur. In this paper we deduced the principle to classify the recognition errors from the recognition theory and put each error according to its cause into one of four classes: the decoding, the acoustic model, the language model, and the acoustic and language model. We then performed statistical analysis of classified errors. Experiments show that classification analysis of recognition errors provides guidance for improving the system.

Key words: large vocabulary continuous speech recognition, recognition, classification

中图分类号: