[1] TOLEDANO D T, GOMEZ L A H, GRANDE L V. Automatic phonetic segmentation[J]. IEEE Transactions on Speech and Audio Processing, 2003, 11(6):617-625. [2] WU Y J, KAWAI H, NI J, et al. Discriminative training and explicit duration modeling for HMM-based automatic segmentation[J]. Speech Communication, 2005, 47(3):397-410. [3] van HEMERT J P. Automatic segmentation of speech[J]. IEEE Transactions on Signal Processing, 1991, 39(4):1008-1012. [4] CHOU F C, TSENG C Y, LEE L S. A set of corpus-based text-to-speech synthesis technologies for Mandarin Chinese[J]. IEEE Transactions on Speech and Audio Processing, 2002, 10(7):481-494. [5] 杜守栓. 方言口音普通话语音自动切分算法研究[D].北京:中国科学院, 2006:15-26.(DU S S. Research on robust automatic segmentation of dialectal speech[D]. Beijing:University of Chinese Academy of Sciences, 2006:15-26.) [6] 何可嘉. 广播语言的自动标注系统[D].北京:北京邮电大学, 2010:22-47.(HE K J. An automatic labeling system for broadcast news[D]. Beijing:Beijing University of Posts and Telecommunications, 2010:22-47.) [7] 韩虎. 汉语连续语音的音节自动标注算法研究及实现[D].哈尔滨:哈尔滨工业大学, 2008:21-44.(HAN H. Research and realization of the automatic syllable marking algorithm for Chinese continuous speech[D]. Harbin:Harbin Institute of Technology, 2008:21-44.) [8] LEE K S. MLP-based phone boundary refining for a TTS database[J]. IEEE Transactions on Audio, Speech and Language Processing, 2006, 14(3):981-989. [9] BROGNAUX S, DRUGMAN T. HMM-based speech segmentation:improvements of fully automatic approaches[J]. IEEE Transactions on Audio, Speech and Language Processing, 2016, 24(1):5-15 [10] 廖文辉, 刘炎.数据分析与SAS实验[M].北京:经济科学出版社, 2010:13-32.(LIAO W H, LIU Y. Data Analysis and SAS Experiment[M]. Beijing:Economic Science Press, 2010:13-32.) [11] 宋知用.Matlab在语音信号分析与合成中的应用[M].北京:北京航空航天大学出版社, 2013:117-129.(SONG Z Y. Application of Matlab in Speech Signal Analysis and Synthesis[M].Beijing:Beihang University Press, 2013:117-129.) [12] 章森, 刘磊, 刁麓弘.大规模语音语料库及其在TTS中应用的几个问题[J].计算机学报, 2010, 33(4):667-696.(ZHANG S, LIU L, DIAO L H. Problems on large-scale speech corpus and the applications in TTS[J]. Chinese Journal of Computers, 2010, 33(4):667-696.)BackgroundZHANG Yang, born in 1989, Ph. D. candidate. His research interests include speech signal processing.ZHAO Xiaoqun, born in 1962, Ph. D., professor. His research interests include speech signal processing, coding theory.WANG Digang, born in 1988, Ph. D. candidate. His research interests include coding theory. |