| 1 | ZHOU X Y, YAO C, WEN H, et al. EAST: an efficient and accurate scene text detector[C]// Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2017: 2642-2651.  10.1109/cvpr.2017.283 | 
																													
																						| 2 | LI Y, WU Z, ZHAO S, et al. PSENet: psoriasis severity evaluation network[C]// Proceedings of the 34th AAAI Conference on Artificial Intelligence. Palo Alto, CA: AAAI Press, 2020: 800-807.  10.1609/aaai.v34i01.5424 | 
																													
																						| 3 | WANG W H, XIE E Z, LI X, et al. PAN++: towards efficient and accurate end-to-end spotting of arbitrarily-shaped text[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 44(9): 5349-5367. | 
																													
																						| 4 | LIAO M H, WAN Z Y, YAO C, et al. Real-time scene text detection with differentiable binarization[C]// Proceedings of the 34th AAAI Conference on Artificial Intelligence. Palo Alto, CA: AAAI Press, 2020: 11474-11481.  10.1609/aaai.v34i07.6812 | 
																													
																						| 5 | 师广琛,巫义锐. 像素聚合和特征增强的任意形状场景文本检测[J]. 中国图象图形学报, 2021, 26(7):1614-1624.  10.11834/jig.200522 | 
																													
																						|  | SHI G C, WU Y R. Arbitrary shape scene-text detection based on pixel aggregation and feature enhancement[J]. Journal of Image and Graphics, 2021, 26(7): 1614-1624.  10.11834/jig.200522 | 
																													
																						| 6 | SHI B G, BAI X, YAO C. An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(11): 2298-2304.  10.1109/tpami.2016.2646371 | 
																													
																						| 7 | WANG T W, ZHU Y Z, JIN L W, et al. Decoupled attention network for text recognition[C]// Proceedings of the 34th AAAI Conference on Artificial Intelligence. Palo Alto, CA: AAAI Press, 2020: 12216-12224.  10.1609/aaai.v34i07.6903 | 
																													
																						| 8 | LI H, WANG P, SHEN C H, et al. Show, attend and read: a simple and strong baseline for irregular text recognition[C]// Proceedings of the 33rd AAAI Conference on Artificial Intelligence. Palo Alto, CA: AAAI Press, 2019: 8610-8617.  10.1609/aaai.v33i01.33018610 | 
																													
																						| 9 | WANG Y Z, LIAN Z H. Exploring font-independent features for scene text recognition[C]// Proceedings of the 28th ACM International Conference on Multimedia. New York: ACM, 2020: 1900-1920.  10.1145/3394171.3413592 | 
																													
																						| 10 | 朱莉,陈宏,景小荣. 任意方向自然场景文本识别[J]. 重庆邮电大学学报(自然科学版), 2022, 34(1):125-133. | 
																													
																						|  | ZHU L, CHEN H, JING X R. Text recognition of natural scenes in any direction[J]. Journal of Chongqing University of Posts and Telecommunications (Natural Science Edition), 2022, 34(1): 125-133. | 
																													
																						| 11 | WANG Y Z, GAO Y, LIAN Z H. Attribute2Font: creating fonts you want from attributes[J]. ACM Transactions on Graphics, 2020, 39(4): No.69.  10.1145/3386569.3392456 | 
																													
																						| 12 | XIE Y C, CHEN X Y, SUN L, et al. DG-Font: deformable generative networks for unsupervised font generation[C]// Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2021: 5126-5136.  10.1109/cvpr46437.2021.00509 | 
																													
																						| 13 | LIU Y T, LIAN Z H. FontRL: Chinese font synthesis via deep reinforcement learning[C]// Proceedings of the 35th AAAI Conference on Artificial Intelligence. Palo Alto, CA: AAAI Press, 2021: 2198-2206.  10.1609/aaai.v35i3.16318 | 
																													
																						| 14 | WU L, ZHANG C Q, LIU J M, et al. Editing text in the wild[C]// Proceedings of the 27th ACM International Conference on Multimedia. New York: ACM, 2019: 1500-1508.  10.1145/3343031.3350929 | 
																													
																						| 15 | YANG Q P, HUANG J, LIN W. SwapText: image based texts transfer in scenes[C]// Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2020: 14688-14697.  10.1109/cvpr42600.2020.01471 | 
																													
																						| 16 | ROY P, BHATTACHARYA S, GHOSH S, et al. STEFANN: scene text editor using font adaptive neural network[C]// Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2020: 13225-13234.  10.1109/cvpr42600.2020.01324 | 
																													
																						| 17 | SHIMODA W, HARAGUCHI D, UCHIDA S, et al. De-rendering stylized texts[C]// Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision. Piscataway: IEEE, 2021: 1056-1065.  10.1109/iccv48922.2021.00111 | 
																													
																						| 18 | WANG Z, BOVIK A C, SHEIKH H R, et al. Image quality assessment: from error visibility to structural similarity[J]. IEEE Transactions on Image Processing, 2004, 13(4): 600-612.  10.1109/tip.2003.819861 | 
																													
																						| 19 | ZHANG S T, LIU Y L, JIN L W, et al. EnsNet: ensconce text in the wild[C]// Proceedings of the 33rd AAAI Conference on Artificial Intelligence. Palo Alto, CA: AAAI Press, 2019: 801-808.  10.1609/aaai.v33i01.3301801 | 
																													
																						| 20 | LIU C Y, LIU Y L, JIN L W, et al. EraseNet: end-to-end text removal in the wild[J]. IEEE Transactions on Image Processing, 2020, 29: 8760-8775.  10.1109/tip.2020.3018859 | 
																													
																						| 21 | RONNEBERGER O, FISCHER P, BROX T. U-net: convolutional networks for biomedical image segmentation[C]// Proceedings of the 2015 International Conference on Medical Image Computing and Computer-Assisted Intervention, LNCS 9351. Cham: Springer, 2015: 234-241. | 
																													
																						| 22 | YU F, KOLTUN V. Multi-scale context aggregation by dilated convolutions[EB/OL]. [2022-04-07]. .  10.1109/cvpr.2017.75 | 
																													
																						| 23 | HE K M, ZHANG X Y, REN S Q, et al. Deep residual learning for image recognition[C]// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2016: 770-778.  10.1109/cvpr.2016.90 | 
																													
																						| 24 | BAEK J, KIM G, LEE J, et al. What is wrong with scene text recognition model comparisons? dataset and model analysis[C]// Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision. Piscataway: IEEE, 2019: 4714-4722.  10.1109/iccv.2019.00481 | 
																													
																						| 25 | GRAVES A, LIWICKI M, FERNÁNDEZ S, et al. A novel connectionist system for unconstrained handwriting recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2009, 31(5): 855-868.  10.1109/tpami.2008.137 | 
																													
																						| 26 | ISOLA P, ZHU J Y, ZHOU T H, et al. Image-to-image translation with conditional adversarial networks[C]// Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2017: 5967-5976.  10.1109/cvpr.2017.632 | 
																													
																						| 27 | KINGMA D P, BA J L. Adam: a method for stochastic optimization[EB/OL]. [2022-03-18]. . | 
																													
																						| 28 | KARATZAS D, SHAFAIT F, UCHIDA S, et al. ICDAR 2013 robust reading competition[C]// Proceedings of the 12th International Conference on Document Analysis and Recognition. Piscataway: IEEE, 2013: 1484-1493.  10.1109/icdar.2013.221 |