计算机应用 ›› 2013, Vol. 33 ›› Issue (06): 1742-1745.DOI: 10.3724/SP.J.1087.2013.01742

• 多媒体技术 • 上一篇    下一篇

汉语语音识别中基于音节的声学模型改进算法

晁浩1,2,杨占磊2,刘文举2   

  1. 1. 河南理工大学 计算机科学与技术学院,河南 焦作 454000
    2. 中国科学院自动化研究所 模式识别国家重点实验室,北京100190
  • 收稿日期:2012-12-03 修回日期:2013-01-05 出版日期:2013-06-01 发布日期:2013-06-05
  • 通讯作者: 晁浩
  • 作者简介:晁浩(1981-),男,河南许昌人,讲师,博士,主要研究方向:语音识别;杨占磊(1984-),男,河北石家庄人,助理研究员,博士,主要研究方向:语音识别;刘文举(1960-),男,北京人,研究员,博士生导师,博士,主要研究方向:语音识别、语音增强、计算听觉场景分析。
  • 基金资助:

    国家自然科学基金资助项目(51175135);国家973计划项目(2004CB318105);国家863计划项目(20060101Z4073,2006AA01Z194)

Improved syllable-based acoustic modeling for continuous Chinese speech recognition

CHAO Hao1,2,YANG Zhanlei1,LIU Wenju1   

  1. 1. National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100190,China
    2. School of Computer Science and Technology, Henan Polytechnic University, Jiaozuo Henan 454000,China
  • Received:2012-12-03 Revised:2013-01-05 Online:2013-06-05 Published:2013-06-01
  • Contact: CHAO Hao

摘要: 针对汉语语音识别中协同发音现象引起的语音信号的易变性,提出一种基于音节的声学建模方法。首先建立基于音节的声学模型以解决音节内部声韵母之间的音变现象,并提出以音节内双音子模型来初始化基于音节声学模型的参数以缓解训练数据稀疏的问题;然后引入音节之间的过渡模型来处理音节之间的协同发音问题。在“863-test”测试集上进行的汉语连续语音识别实验显示汉语字的相对错误率下降了12.13%,表明了基于音节的声学模型和音节间过渡模型相结合在解决汉语协同发音问题上的有效性。

关键词: 语音识别, 协同发音, 音变, 声学建模, 音节模型

Abstract: Concerning the changeability of the speech signal caused by co-articulation phenomenon in Chinese speech recognition, a syllable-based acoustic modeling method was proposed. Firstly, context independent syllable-based acoustic models were trained, and the models were initialized by intra-syllable IFs based diphones to solve the problem of training data sparsity. Secondly, the inter-syllable co-articulation effect was captured by incorporating inter-syllable transition models into the recognition system. The experiments conducted on “863-test” dataset show that the relative character error rate is reduced by 12.13%. This proves that syllable-based acoustic model and inter-syllable transition model are effective in solving co-articulation effect.

中图分类号: