文档图像几何畸变快速校正的新方法

计算机应用 ›› 2010, Vol. 30 ›› Issue (12): 3317-3320.

文档图像几何畸变快速校正的新方法

宋丽丽¹,吴亚东²,孙波²

1. 西南科技大学计算机学院
2.

收稿日期:2010-06-08 修回日期:2010-07-20 发布日期:2010-12-22 出版日期:2010-12-01
通讯作者: 宋丽丽
基金资助:
数字图像自动修补理论与算法研究

New document image distortion correction method

Received:2010-06-08 Revised:2010-07-20 Online:2010-12-22 Published:2010-12-01

摘要/Abstract

摘要： 针对由照相机拍摄的文档图像可能存在倾斜或扭曲变形而导致光学字符识别（OCR）软件不能正确识别的情况，首先采用连通域标记方法进行单词及文本线的检测；然后根据单词中位点信息线性拟合得到其校正基线；最后根据校正基线和垂直位移距离分别对单词进行旋转和位移而得到校正后的图像。与传统方法相比，该方法得到的校正基线和垂直位移距离不受文档具体文字内容的影响，能更加准确地代表单词的倾斜走向，并保证校正后的单词在水平方向上对齐；同时表现出了很好的鲁棒性。经过分析算法的计算复杂度, 并与传统方法相比较, 该算法的效率和鲁棒性较高。

关键词: 文档图像几何畸变, 连通域标记, 中位点, 校正基线, 垂直位移距离

Abstract: Document image distortion often appears when captured by the camera, which may induce recognition mistakes by Optical Character Recognition (OCR) software. In this paper, the technology of connected components labeling was used to detect words and text lines, and then based on the information of the middle dots of the words， linear fitting was used to get the words baselines. Finally, according to the words baselines and the distance for vertical displace, words rotation and vertical displace were made to obtain the corrected image. Compared with the traditional method, the computation of the words baselines and the distance for vertical displace in this paper are independent of the documents content, so as to guarantee the precision of words slope and make all words be aligned with the same line. The computation complexity of the algorithm was discussed at the end of this paper, and comparative experiments with traditional method were made. The experimental results show the proposed method is of high efficiency and robustness.

Key words: document image distortion, connected components labeling, middle dot, correction baseline, vertical displace distance

宋丽丽吴亚东孙波. 文档图像几何畸变快速校正的新方法[J]. 计算机应用, 2010, 30(12): 3317-3320.