Abstract:Letter-to-Phoneme Conversion(L2P) is a very important component in English speech synthesis system.The first task of L2P is grapheme segmentation.A machine learning method named the Finite Generalization Algorithm(FGA) was presented,which was used to learn rules of English grapheme segmentation.The average accuracies of training and testing sets were 99.84% and 97.88% respectively for instances segmentation,and 99.72% and 96.35% respectively for words segmentation.The average number of rules is 472,about 1 rule per 52 words.
王永生,柴佩琪. 英语语音合成中基于有限泛化法的字素切分规则的机器学习[J]. 计算机应用, 2005, 25(09): 2010-2014.
WANG Yong-sheng,CHAI Pei-qi. English grapheme segmentation rules learning based on the finite generalization algorithm in English speech synthesis. Journal of Computer Applications, 2005, 25(09): 2010-2014.