《计算机应用》唯一官方网站

• •    下一篇

基于多尺度混合域注意力机制的笔迹鉴别方法

熊武1,曹从军2,宋雪芳1,邵云龙1,王旭升1   

  1. 1. 西安理工大学
    2. 西安理工大学印包学院
  • 收稿日期:2023-07-27 修回日期:2023-09-20 发布日期:2023-10-26 出版日期:2023-10-26
  • 通讯作者: 熊武

Handwriting identification method based on multi-scale mixed domain attention mechanism

  • Received:2023-07-27 Revised:2023-09-20 Online:2023-10-26 Published:2023-10-26

摘要: 针对笔迹鉴别任务中图像大面积是背景,笔迹信息稀疏,关键性信息难以捕捉,并且个人笔迹签名风格具有微小变化而刻意模仿的笔迹高度相似,以及公开的中文笔迹数据集的匮乏的问题,通过对注意力机制和孪生网络模型进行改进,提出了一种基于多尺度混合域注意力机制的笔迹鉴别网络(MMDANet)。首先,在有效通道注意力模块上并联一个最大池化层,对二维条带池化模块的通道数扩展到三维,将两者融合生成混合域模块(MDM),解决了笔迹图像大面积是背景,笔迹信息稀疏,细节特征难以提取的问题;其次,利用PANet特征金字塔进行多尺度提取特征,捕获真伪笔迹间的细微差异,采用孪生网络的对比损失与AM-Softmax损失加权融合进行训练,从而增加了类别间的区分度,解决了个人笔迹风格变化和真伪笔迹高度相似的问题。最后为促进中文笔迹数据集的发展,自制了总体样本为8000张的中文笔迹数据集(CHD)。所提方法在自制中文数据集CHD上的准确率达到了84.25%,且相较于次优的Two-stage SiamNet方法,所提方法在三个外文数据集Cedar、Bengla和Hindi上准确率分别提升了4.53%、1.02%和1.67%。实验结果表明,MMDANet可以更准确地捕获真伪笔迹的细微差异,能完成复杂的笔迹鉴别任务。

Abstract: In the task of handwriting identification, the large area of image is background, handwriting information is sparse, key information is difficult to capture, and personal handwriting signature style has slight changes and handwriting imitated is highly similar, as well as the lack of public Chinese handwriting datasets. By improving attention mechanism and Siamese network model, a handwriting identification Network based on the Multi-scale and Mixed Domain Attention mechanism (MMDANet) was proposed. Firstly, a maximum pooling layer was connected in parallel to the effective channel attention module, and the number of channels of two-dimensional strip pooling module was extended to three dimensions, and the two were fused to generate a Mixed Domain Module (MDM), thereby solving the problem that large area of handwriting image is background, handwriting information is sparse and detailed features are difficult to extract. Secondly, the Path Aggregation Network (PANet) feature pyramid was used to extract features at multiple scales to capture the subtle differences between true and false handwriting, and the comparison loss of Siamese network and the weighted fusion of Additive Margin Softmax (AM-Softmax) loss were used for training. It increased the degree of discrimination between categories, and solved the problem of personal handwriting style variation and high similarity between true and false handwriting. Finally, in order to promote the development of Chinese handwriting datasets, a Chinese Handwriting Dataset (CHD) with a total sample of 8,000 pieces was self-made. The accuracy rate of the proposed method on the Chinese dataset CHD reaches 84.25%, and compared with the suboptimal method Two-stage Siamese Network (Two-stage SiamNet), the proposed method is accurate on three foreign language datasets Cedar, Bengla and Hindi. The rates increased by 4.53%, 1.02% and 1.67% respectively.The experimental results show that the MMDANet can more accurately capture the subtle differences between true and false handwriting, and complete complex handwriting identification tasks.

中图分类号: