Journal of Computer Applications ›› 2010, Vol. 30 ›› Issue (05): 1259-1261.
• Pattern recognition • Previous Articles Next Articles
Received:
Revised:
Online:
Published:
Contact:
蔡锋1,刘立柱2
通讯作者:
Abstract: Locating the telephone number region is a very important technology in telephone number recognition of the fax images. After realizing a relative precise page analysis on the fax images by adopting the Connected Component Analysis (CCA) to form comparatively whole word regions, the features of horizontal traversing times and spatial distribution were abstracted to form feature vector of fifty-one dimensions. The multi-class Support Vector Machine (SVM) based on normal decision tree was introduced to achieve the key words location. The experimental results show that the method can realize the location quickly and effectively, and it is valuable in applications.
Key words: Connected Component Analysis (CCA), horizontal traversing times, spatial distribution feature, Support Vector Machine (SVM), key words location
摘要: 电话号码区域定位是传真图像电话号码识别中的关键技术之一。首先采用连通域分析对传真图像实现较为精确的版面分析,形成比较完整的单词连通域,提取单词连通域的水平穿越次数和空间分布特征,形成51维的特征向量。采用基于正态决策树的多分类支持向量机(SVM),来完成对传真图像电话号码区域关键词的定位。实验结果表明,算法能够快速有效地完成关键词的定位,具有较强的实用价值。
关键词: 连通域分析, 水平穿越次数, 空间分布特征, 支持向量机, 关键词定位
蔡锋 刘立柱. 基于连通域分析和支持向量机的传真图像关键词定位[J]. 计算机应用, 2010, 30(05): 1259-1261.
0 / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: https://www.joca.cn/EN/
https://www.joca.cn/EN/Y2010/V30/I05/1259