Journal of Computer Applications ›› 2023, Vol. 43 ›› Issue (5): 1416-1421.DOI: 10.11772/j.issn.1001-9081.2022040520
Special Issue: 人工智能
• Artificial intelligence • Previous Articles Next Articles
Jingchao CHEN1,2, Shugong XU1(), Youdong DING2
Received:
2022-04-15
Revised:
2022-06-09
Accepted:
2022-06-13
Online:
2022-07-01
Published:
2023-05-10
Contact:
Shugong XU
About author:
CHEN Jingchao, born in 1997, M. S. candidate. His research interests include text editing, font recognition.通讯作者:
徐树公
作者简介:
陈靖超(1997—),男,上海人,硕士研究生,主要研究方向:文本编辑、字体识别CLC Number:
Jingchao CHEN, Shugong XU, Youdong DING. Text image editing method based on font and character attribute guidance[J]. Journal of Computer Applications, 2023, 43(5): 1416-1421.
陈靖超, 徐树公, 丁友东. 基于字体字符属性引导的文本图像编辑方法[J]. 《计算机应用》唯一官方网站, 2023, 43(5): 1416-1421.
Add to citation manager EndNote|Ris|BibTeX
URL: https://www.joca.cn/EN/10.11772/j.issn.1001-9081.2022040520
区域 | PSNR/dB | SSIM |
---|---|---|
文本区域 | 17.30 | 0.70 |
背景区域 | 32.10 | 0.95 |
整体区域 | 22.91 | 0.79 |
Tab. 1 PSNR and SSIM of each area of edited results
区域 | PSNR/dB | SSIM |
---|---|---|
文本区域 | 17.30 | 0.70 |
背景区域 | 32.10 | 0.95 |
整体区域 | 22.91 | 0.79 |
SRNet | 字体分类器 | 字符分类器 | 端到端微调 | PSNR/dB | SSIM | MSE | |||
---|---|---|---|---|---|---|---|---|---|
结果 | Δ | 结果 | Δ | 结果 | Δ | ||||
√ | × | × | × | 22.91 | — | 0.787 | — | 0.007 4 | — |
√ | √ | × | × | 23.93 | 1.02 | 0.813 | 0.026 | 0.006 1 | -0.001 3 |
√ | √ | √ | × | 24.45 | 0.52 | 0.827 | 0.014 | 0.005 3 | -0.000 8 |
√ | √ | √ | √ | 25.48 | 1.03 | 0.842 | 0.015 | 0.004 3 | -0.001 0 |
Tab. 2 Quantitative evaluation results of ablation study
SRNet | 字体分类器 | 字符分类器 | 端到端微调 | PSNR/dB | SSIM | MSE | |||
---|---|---|---|---|---|---|---|---|---|
结果 | Δ | 结果 | Δ | 结果 | Δ | ||||
√ | × | × | × | 22.91 | — | 0.787 | — | 0.007 4 | — |
√ | √ | × | × | 23.93 | 1.02 | 0.813 | 0.026 | 0.006 1 | -0.001 3 |
√ | √ | √ | × | 24.45 | 0.52 | 0.827 | 0.014 | 0.005 3 | -0.000 8 |
√ | √ | √ | √ | 25.48 | 1.03 | 0.842 | 0.015 | 0.004 3 | -0.001 0 |
方法 | PSNR/dB | SSIM | MSE |
---|---|---|---|
SRNet | 22.91 | 0.787 | 0.007 4 |
SwapText | 23.37 | 0.796 | 0.006 7 |
本文方法 | 25.48 | 0.842 | 0.004 3 |
Tab. 3 Quantitative evaluation results of comparison experiments
方法 | PSNR/dB | SSIM | MSE |
---|---|---|---|
SRNet | 22.91 | 0.787 | 0.007 4 |
SwapText | 23.37 | 0.796 | 0.006 7 |
本文方法 | 25.48 | 0.842 | 0.004 3 |
1 | ZHOU X Y, YAO C, WEN H, et al. EAST: an efficient and accurate scene text detector[C]// Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2017: 2642-2651. 10.1109/cvpr.2017.283 |
2 | LI Y, WU Z, ZHAO S, et al. PSENet: psoriasis severity evaluation network[C]// Proceedings of the 34th AAAI Conference on Artificial Intelligence. Palo Alto, CA: AAAI Press, 2020: 800-807. 10.1609/aaai.v34i01.5424 |
3 | WANG W H, XIE E Z, LI X, et al. PAN++: towards efficient and accurate end-to-end spotting of arbitrarily-shaped text[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 44(9): 5349-5367. |
4 | LIAO M H, WAN Z Y, YAO C, et al. Real-time scene text detection with differentiable binarization[C]// Proceedings of the 34th AAAI Conference on Artificial Intelligence. Palo Alto, CA: AAAI Press, 2020: 11474-11481. 10.1609/aaai.v34i07.6812 |
5 | 师广琛,巫义锐. 像素聚合和特征增强的任意形状场景文本检测[J]. 中国图象图形学报, 2021, 26(7):1614-1624. 10.11834/jig.200522 |
SHI G C, WU Y R. Arbitrary shape scene-text detection based on pixel aggregation and feature enhancement[J]. Journal of Image and Graphics, 2021, 26(7): 1614-1624. 10.11834/jig.200522 | |
6 | SHI B G, BAI X, YAO C. An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(11): 2298-2304. 10.1109/tpami.2016.2646371 |
7 | WANG T W, ZHU Y Z, JIN L W, et al. Decoupled attention network for text recognition[C]// Proceedings of the 34th AAAI Conference on Artificial Intelligence. Palo Alto, CA: AAAI Press, 2020: 12216-12224. 10.1609/aaai.v34i07.6903 |
8 | LI H, WANG P, SHEN C H, et al. Show, attend and read: a simple and strong baseline for irregular text recognition[C]// Proceedings of the 33rd AAAI Conference on Artificial Intelligence. Palo Alto, CA: AAAI Press, 2019: 8610-8617. 10.1609/aaai.v33i01.33018610 |
9 | WANG Y Z, LIAN Z H. Exploring font-independent features for scene text recognition[C]// Proceedings of the 28th ACM International Conference on Multimedia. New York: ACM, 2020: 1900-1920. 10.1145/3394171.3413592 |
10 | 朱莉,陈宏,景小荣. 任意方向自然场景文本识别[J]. 重庆邮电大学学报(自然科学版), 2022, 34(1):125-133. |
ZHU L, CHEN H, JING X R. Text recognition of natural scenes in any direction[J]. Journal of Chongqing University of Posts and Telecommunications (Natural Science Edition), 2022, 34(1): 125-133. | |
11 | WANG Y Z, GAO Y, LIAN Z H. Attribute2Font: creating fonts you want from attributes[J]. ACM Transactions on Graphics, 2020, 39(4): No.69. 10.1145/3386569.3392456 |
12 | XIE Y C, CHEN X Y, SUN L, et al. DG-Font: deformable generative networks for unsupervised font generation[C]// Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2021: 5126-5136. 10.1109/cvpr46437.2021.00509 |
13 | LIU Y T, LIAN Z H. FontRL: Chinese font synthesis via deep reinforcement learning[C]// Proceedings of the 35th AAAI Conference on Artificial Intelligence. Palo Alto, CA: AAAI Press, 2021: 2198-2206. 10.1609/aaai.v35i3.16318 |
14 | WU L, ZHANG C Q, LIU J M, et al. Editing text in the wild[C]// Proceedings of the 27th ACM International Conference on Multimedia. New York: ACM, 2019: 1500-1508. 10.1145/3343031.3350929 |
15 | YANG Q P, HUANG J, LIN W. SwapText: image based texts transfer in scenes[C]// Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2020: 14688-14697. 10.1109/cvpr42600.2020.01471 |
16 | ROY P, BHATTACHARYA S, GHOSH S, et al. STEFANN: scene text editor using font adaptive neural network[C]// Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2020: 13225-13234. 10.1109/cvpr42600.2020.01324 |
17 | SHIMODA W, HARAGUCHI D, UCHIDA S, et al. De-rendering stylized texts[C]// Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision. Piscataway: IEEE, 2021: 1056-1065. 10.1109/iccv48922.2021.00111 |
18 | WANG Z, BOVIK A C, SHEIKH H R, et al. Image quality assessment: from error visibility to structural similarity[J]. IEEE Transactions on Image Processing, 2004, 13(4): 600-612. 10.1109/tip.2003.819861 |
19 | ZHANG S T, LIU Y L, JIN L W, et al. EnsNet: ensconce text in the wild[C]// Proceedings of the 33rd AAAI Conference on Artificial Intelligence. Palo Alto, CA: AAAI Press, 2019: 801-808. 10.1609/aaai.v33i01.3301801 |
20 | LIU C Y, LIU Y L, JIN L W, et al. EraseNet: end-to-end text removal in the wild[J]. IEEE Transactions on Image Processing, 2020, 29: 8760-8775. 10.1109/tip.2020.3018859 |
21 | RONNEBERGER O, FISCHER P, BROX T. U-net: convolutional networks for biomedical image segmentation[C]// Proceedings of the 2015 International Conference on Medical Image Computing and Computer-Assisted Intervention, LNCS 9351. Cham: Springer, 2015: 234-241. |
22 | YU F, KOLTUN V. Multi-scale context aggregation by dilated convolutions[EB/OL]. [2022-04-07]. . 10.1109/cvpr.2017.75 |
23 | HE K M, ZHANG X Y, REN S Q, et al. Deep residual learning for image recognition[C]// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2016: 770-778. 10.1109/cvpr.2016.90 |
24 | BAEK J, KIM G, LEE J, et al. What is wrong with scene text recognition model comparisons? dataset and model analysis[C]// Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision. Piscataway: IEEE, 2019: 4714-4722. 10.1109/iccv.2019.00481 |
25 | GRAVES A, LIWICKI M, FERNÁNDEZ S, et al. A novel connectionist system for unconstrained handwriting recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2009, 31(5): 855-868. 10.1109/tpami.2008.137 |
26 | ISOLA P, ZHU J Y, ZHOU T H, et al. Image-to-image translation with conditional adversarial networks[C]// Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2017: 5967-5976. 10.1109/cvpr.2017.632 |
27 | KINGMA D P, BA J L. Adam: a method for stochastic optimization[EB/OL]. [2022-03-18]. . |
28 | KARATZAS D, SHAFAIT F, UCHIDA S, et al. ICDAR 2013 robust reading competition[C]// Proceedings of the 12th International Conference on Document Analysis and Recognition. Piscataway: IEEE, 2013: 1484-1493. 10.1109/icdar.2013.221 |
[1] | QIN Jun, LUO Yifan, TIE Jun, ZHENG Lu, LYU Weilong. Beijing Opera character recognition based on attention mechanism with HyperColumn [J]. Journal of Computer Applications, 2021, 41(4): 1027-1034. |
[2] | ZHENG Yanbin, HAN Mengyun, FAN Wenxin. Handwritten Chinese character recognition based on two dimensional principal component analysis and convolutional neural network [J]. Journal of Computer Applications, 2020, 40(8): 2465-2471. |
[3] | DONG Junfei, ZHENG Bochuan, YANG Zejing. Character recognition of license plate based on convolution neural network [J]. Journal of Computer Applications, 2017, 37(7): 2014-2018. |
[4] | KONG Yu, WANG Shuying. Information acquisition solution for automotive after-sales service based on Android platform [J]. Journal of Computer Applications, 2015, 35(12): 3586-3591. |
[5] | SU Chang HU Xiao-dong WANG Bin-fu SHANG Feng-jun. Video image character recognition based on stroke-related weight [J]. Journal of Computer Applications, 2012, 32(08): 2305-2312. |
[6] | . Vehicle license plate character recognition based on relative entropy function criterion [J]. Journal of Computer Applications, 2010, 30(4): 977-979. |
[7] | . Improved Chinese chessboard recognition method [J]. Journal of Computer Applications, 2010, 30(4): 980-981. |
[8] | Min WU. Algorithm of numerical attributes reduction based on similarity rough set [J]. Journal of Computer Applications, 2010, 30(1): 156-158. |
[9] | . Multi-class cluster support vector machines [J]. Journal of Computer Applications, 2010, 30(1): 143-145. |
[10] | . Handwritten Chinese character recognition based on double elastic mesh [J]. Journal of Computer Applications, 2009, 29(2): 395-397. |
[11] | . New HHT-based handwritten Chinese character recognition method [J]. Journal of Computer Applications, 2009, 29(12): 3363-3365. |
[12] | . Handwritten character recognition based on compressive sensing [J]. Journal of Computer Applications, 2009, 29(08): 2080-2082. |
[13] | . Effective strategy to prevent click fraud [J]. Journal of Computer Applications, 2009, 29(07): 1790-1792. |
[14] | . Extraction of the stroke plane for the handwritten Chinese character recognition based on elastic stroke length and recognition [J]. Journal of Computer Applications, 2007, 27(6): 1500-1501. |
[15] | . Handwritten Chinese Character Recognition Based on Local Gabor Filter Bank [J]. Journal of Computer Applications, 2007, 27(5): 1222-1224. |
Viewed | ||||||
Full text |
|
|||||
Abstract |
|
|||||