Journal of Computer Applications ›› 2012, Vol. 32 ›› Issue (11): 3198-3205.DOI: 10.3724/SP.J.1087.2012.03198

Previous Articles     Next Articles

Character segmentation method of check under complex background

YE Long-huan1,WANG Jun-Feng2,GAO Lin3,YUAN Jun1   

  1. 1. College of Computer Science, Sichuan University, Chengdu Sichuan 610064,China
    2. College of Computer Science, Sichuan University, Chengdu Sichuan 610065, China
    3. College of Computer Science and Technology, Southwest University of Science and Technology,Mianyang Sichuan 621010,China
  • Received:2012-05-11 Revised:2012-07-12 Online:2012-11-12 Published:2012-11-01
  • Contact: WANG Jun-Feng

复杂背景下的票据字符分割方法

叶龙欢1,王俊峰2,高琳3,袁军1   

  1. 1. 四川大学 计算机学院,成都 610064
    2. 四川大学 计算机学院,成都610065
    3. 西南科技大学 计算机科学与技术学院,四川 绵阳 621010
  • 通讯作者: 王俊峰
  • 作者简介:叶龙欢(1988-),女,四川成都人,硕士研究生,主要研究方向:数字图像处理;王俊峰(1976-),男,安徽芜湖人,教授,博士生导师,博士,主要研究方向:空间信息网、网络安全、数字图像处理;高琳(1976-),男,湖北襄阳人,讲师,主要研究方向:图像处理、模式识别;袁军(1989-),男,安徽阜阳人,硕士研究生,主要研究方向:数字图像处理。
  • 基金资助:
    国家自然科学基金资助项目(11102124);四川省应用基础研究项目(2010JY0013);教育部新世纪优秀人才支持计划项目(NCET-10-0604);教育部博士点基金资助项目(20090181110053)

Abstract: Complex background including shading, seal and some images has a bad effect on character recognition of check. Thus, in this paper, an effective method that extracts the significant texture characteristics of character from the image by fast lifting wavelet transform was proposed to solve this problem. A coarse-to-fine searching strategy was adopted to distinguish the characters from background at the level of block of pixels and single pixel. First, Support Vector Machine (SVM) was used to classify blocks according to texture characteristics, and during this process text region could be located. Then character segmentation was achieved by using K-means algorithm for clustering the pixels at the text region. The experimental results show the high accuracy and strong robustness of the proposed method at the situation of strong interference of complex texture and seal.

Key words: character segmentation, fast lifting wavelet transform, Support Vector Machine (SVM), K-means clustering

摘要: 针对票据字符识别中图像存在的底纹、印章和图案等复杂背景干扰问题,提出一种有效的字符分割方法。通过快速提升小波变换提取出图像中具有显著性的字符纹理特征。采用一种由粗到精的搜索策略,在图像区域和像素两个层次上逐步区分出文字和背景。首先根据区域纹理特征,利用支持向量机对区域进行分类,定位出包含文字的图像区域;然后采用K-means算法对文字区域内的像素进行聚类划分,从而实现文字分割。实验结果表明,方法具有较高的准确性,并且在背景纹理和印章干扰的情况下具有较好的鲁棒性。

关键词: 字符分割, 快速提升小波变换, 支持向量机, K-means聚类

CLC Number: