Uighur characters recognition based on locality preserving projection and hidden Markov model

doi:10.3724/SP.J.1087.2012.02309

Journal of Computer Applications ›› 2012, Vol. 32 ›› Issue (08): 2309-2312.DOI: 10.3724/SP.J.1087.2012.02309

• Graphics and image technology • Previous Articles Next Articles

Uighur characters recognition based on locality preserving projection and hidden Markov model

LIU Wei¹,LI He-cheng²

1. Department of Physics, Qinghai Normal University, Xining Qinghai 810008, China
2. Department of Mathematics, Qinghai Normal University, Xining Qinghai 810008, China

Received:2012-01-16 Revised:2012-03-22 Online:2012-08-28 Published:2012-08-01
Contact: LIU Wei

基于局部保持投影与隐马尔可夫模型的维文字符识别

刘卫¹,李和成²

1. 青海师范大学物理系，西宁 810008
2. 青海师范大学数学系，西宁 810008

通讯作者: 刘卫
作者简介:刘卫(1975-)，男，山东滕州人，副教授，主要研究方向：图像处理、模式识别、机器学习;
李和成(1973-)，男，青海乐都人，教授，博士，主要研究方向：进化计算、数据挖掘。
基金资助:
复杂双层规划问题的高性能可信进化算法研究

Abstract

Abstract: Concerning the shortcomings of classical Hidden Markov Model (HMM) in handwritten Uighur characters recognition, such as largly varied width of characters, slow convergent speed and premature convergence, a new Uighur characters recognition algorithm was proposed in combination with Locality Preserving Projection (LPP) and HMM. Firstly, the aspect ratio of original image was maintained by a highly-normalized method. Sub-images were obtained by using sliding window, and observation sequences were extracted from these windows. Secondly, the observation sequences were mapped into low-dimensional space based on LPP, and the scale of adjacency matrix was reduced via the random sampling technique. Finally, HMM was trained by adopting obtained observation sequences. The algorithm decreases dimension of observation vectors, accelerates the convergence, and prevents premature convergence effectively. The simulation results show the LPP-HMM algorithm is efficient and robust, which decrease average convergence steps as well as errors.

Key words: Hidden Markov Model (HMM), Locality Preserving Projection (LPP), Uighur characters recognition, normalization, convergence

摘要： 针对传统隐马尔可夫模型(HMM)在对手写维吾尔文字符建模时，字符宽度变化大，模型训练收敛缓慢，且易陷入局部极值的问题，提出一种基于保局投影(LPP)与HMM相结合的维吾尔字符识别方法。首先，通过高度归一化保持原图像的宽高比，用滑动窗获取子图像序列，形成观测向量序列；其次，采用局部保持投影将观测序列映射到低维空间，并用随机抽样方法降低邻接图矩阵的规模；最后，采用新观测序列训练HMM。该算法在降维的同时提高了HMM的收敛速度，降低了陷入局部极值的风险。实验结果显示，算法的平均收敛步数减少，错误率降低，表明算法是有效的。

关键词: 隐马尔可夫模型, 局部保持投影, 维文识别, 归一化, 收敛

CLC Number:

TP391.41

LIU Wei LI He-cheng. Uighur characters recognition based on locality preserving projection and hidden Markov model[J]. Journal of Computer Applications, 2012, 32(08): 2309-2312.

刘卫李和成. 基于局部保持投影与隐马尔可夫模型的维文字符识别[J]. 计算机应用, 2012, 32(08): 2309-2312.

References

[1]赵继印，郑蕊蕊，吴宝春,等. 脱机手写体汉字识别综述[J].电子学报, 2010, 38(2): 405-414. [2]袁保社，吾守尔·斯拉木. 一种手写维吾尔文字母识别算法[J].计算机工程,2010, 36(2): 186-188. [3]UBUL K, HAMDULL A, AYSA A, et al. Research on Uyghur off-line handwriting-based writer identification [C]// ICSP 2008: 9th International Conference on Signal Processing. Piscataway: IEEE, 2008: 1656-1659. [4]哈力木拉提，阿孜古丽. 多字体印刷维吾尔文字符识别系统的研究与开发[J].计算机学报,2004, 27(11): 1480-1484. [5]RABINER L R. A tutorial on hidden Markov models and selected applications in speech recognition[J].Proceedings of the IEEE， 1989，77(2): 257-286. [6]LORIGO L M, GOVINDARAJU V. Off-line Arabic handwriting recognition: a survey[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2006, 28(5): 712-724. [7]VINCIARELLI A. A survey on off-line cursive word recognition[J].Pattern Recognition, 2002, 35(7): 1433-1446. [8]XIANG DONG，YAN HUAHUA，CHEN XIANQIAO, et al. Offline Arabic handwriting recognition system based on HMM [C]// 2010 3rd IEEE International Conference on Computer Science and Information Technology. Piscataway: IEEE, 2010, 1: 526-529. [9]MOHAMAD R A-H, LIKFORMAN-SULEM L, MOKBEL C. Combining slanted-frame classifiers for improved HMM-based Arabic handwriting recognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 2009, 31(7): 1165-1177. [10]EL-HAJJ R, LIKFORMAN-SULEM L, MOKBEL C. Arabic handwriting recognition using baseline dependant features and hidden Markov modeling [C]// Proceedings of the Eighth International Conference on Document Analysis and Recognition. Piscataway: IEEE, 2005, 2: 893-897. [11]VINCIARELLI A, BENGIO S. Offline cursive word recognition using continuous density hidden Markov models trained with PCA or ICA features [C]// ICPR'02: Proceedings of 16th International Conference on Pattern Recognition. Washington, DC: IEEE Computer Society, 2002, 3: 81-84. [12]HE XIAOFEI, NIYOGI P. Locality preserving projection [C]// Advances in Neural Information Processing Systems. Cambridge: MIT Press, 2004: 153-160. [13]黄勇. 基于图像优化局部保留投影的人脸表情识别[J].计算机工程与应用, 2011, 47(27): 210-211. [14]刘敏，李晓东，王振海. 一种新的有监督保局投影人脸识别算法[J].JOCA, 2009, 29(5): 1416-1418.

[1]	Xiao CHEN, Yan CHANG, Danchen WANG, Shibin ZHANG. Low-cost adversarial example defense algorithm based on example preprocessing [J]. Journal of Computer Applications, 2024, 44(9): 2756-2762.
[2]	Weipeng JING, Qingxin XIAO, Hui LUO. Channel compensation algorithm for speaker recognition based on probabilistic spherical discriminant analysis [J]. Journal of Computer Applications, 2024, 44(2): 556-562.
[3]	Yongjian MA, Xuhua SHI, Peiyao WANG. Constrained multi-objective evolutionary algorithm based on two-stage search and dynamic resource allocation [J]. Journal of Computer Applications, 2024, 44(1): 269-277.
[4]	Yun OU, Kaiqing ZHOU, Pengfei YIN, Xuewei LIU. Improved grey wolf optimizer algorithm based on dual convergence factor strategy [J]. Journal of Computer Applications, 2023, 43(9): 2679-2685.
[5]	Saijuan XU, Zhenyu PEI, Jiawei LIN, Genggeng LIU. Constrained multi-objective evolutionary algorithm based on multi-stage search [J]. Journal of Computer Applications, 2023, 43(8): 2345-2351.
[6]	Xiang GUO, Wengang JIANG, Yuhang WANG. Encrypted traffic classification method based on improved Inception-ResNet [J]. Journal of Computer Applications, 2023, 43(8): 2471-2476.
[7]	Lei LI, Guofu ZHANG, Zhaopin SU, Feng YUE. Software testing resource allocation algorithm for dynamic changes in architecture [J]. Journal of Computer Applications, 2023, 43(7): 2261-2270.
[8]	Hao GAO, Qingke ZHANG, Xianglong BU, Junqing LI, Huaxiang ZHANG. Teaching-learning-based optimization algorithm based on cooperative mutation and Lévy flight strategy and its application [J]. Journal of Computer Applications, 2023, 43(5): 1355-1364.
[9]	Lin HUANG, Qiang FU, Nan TONG. Solving robot path planning problem by adaptively adjusted Harris hawk optimization algorithm [J]. Journal of Computer Applications, 2023, 43(12): 3840-3847.
[10]	LIU Yongmin, YANG Yujin, LUO Haoyi, HUANG Hao, XIE Tieqiang. Intrusion detection method for wireless sensor network based on bidirectional circulation generative adversarial network [J]. Journal of Computer Applications, 2023, 43(1): 160-168.
[11]	Linxiu SHA, Fan NIE, Qian GAO, Hao MENG. Alternately optimizing algorithm based on Brownian movement and gradient information [J]. Journal of Computer Applications, 2022, 42(7): 2139-2145.
[12]	Tingping ZHANG, Cong SHUAI, Jianxi YANG, Junzhi ZOU, Chaoshun YU, Lifang DU. Re-identification of vehicles based on joint stripe relations [J]. Journal of Computer Applications, 2022, 42(6): 1884-1891.
[13]	Qingqing NIE, Dingsheng WAN, Yuelong ZHU, Zhijia LI, Cheng YAO. Hydrological model based on temporal convolutional network [J]. Journal of Computer Applications, 2022, 42(6): 1756-1761.
[14]	Fangxin NIE, Yujia WANG, Xin JIA. Teaching and learning information interactive particle swarm optimization algorithm [J]. Journal of Computer Applications, 2022, 42(3): 874-882.
[15]	Yaoming MA, Yu ZHANG. Insulator detection algorithm based on improved Faster-RCNN [J]. Journal of Computer Applications, 2022, 42(2): 631-637.

Uighur characters recognition based on locality preserving projection and hidden Markov model

基于局部保持投影与隐马尔可夫模型的维文字符识别

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics