• •    

CCDM2022+65: 一种改进的基于多路径特征的胶囊网络

徐清海,丁世飞,孙统风,张健,郭丽丽   

  1. 中国矿业大学
  • 收稿日期:2022-03-22 修回日期:2022-04-20 发布日期:2022-06-29
  • 通讯作者: 丁世飞
  • 基金资助:
    国家自然科学基金

CCDM 2022+65: An improved capsule network based on multipath feature

  • Received:2022-03-22 Revised:2022-04-20 Online:2022-06-29
  • Supported by:
    the National Natural Science Foundation of China

摘要: 摘 要: 针对胶囊网络(CapsNet)在复杂数据集的分类任务上表现不佳,以及其在路由过程中存在巨大的计算开销等问题,提出了一种基于多路径特征的胶囊网络(MCNet)。MCNet包含新的胶囊特征提取器和新的胶囊池化方法。胶囊特征提取器从多个不同路径中并行地提取不同层次、不同位置的特征,然后将特征编码为包含更多语义信息的胶囊特征。胶囊池化方法在胶囊特征图的每个位置选取最活跃的胶囊,用少量的胶囊表示了有效的胶囊特征。本研究在4个数据集上(CIFAR10、SVHN、Fashion-MNIST、MNIST)进行了实验。其中在CIFAR10数据集上,MCNet的分类准确率为79.27%,可训练的参数数量为5.88M。与CapsNet相比,MCNet的分类准确率提升了8.7%,参数数量减少了39.8%。实验结果表明,MCNet能够有效提升分类准确率,同时减少可训练的参数数量。

关键词: 胶囊网络, 深度学习, 动态路由, 胶囊池化, 反卷积重构

Abstract: Abstract: In view of the problems that capsule network (CapsNet) does not perform well in the classification of complex datasets and thers is a large amount of computation in the routing process, the capsule network based on multipath feature (MCNet) was proposed. A novel capsule feature extractor and a novel capsule pooling method were included in MCNet. Tthe features of different layers and locations were extracted in parallel from different paths by the capsule feature extractor, and then the features were encoded into capsule features that contain more semantic information. In the capsule pooling method, the most active capsules at each position of the capsule feature map were selected, and the effective capsule features were represented by the small number of capsules, Experiments were carried out on multiple datasets (CIFAR10, SVHN, Fashion-MNIST, MNIST). On the CIFAR10 dataset, the classification accuracy of MCNet is 79.27%, and the number of trainable parameters is 5.88M. Compared with CapsNet, the classification accuracy of MCNet is improved by 8.7%, and the number of parameters is reduced by 39.8%. The experimental results show that, MCNet not only improves the classification accuracy but also reduces the number of trainable parameters.

Key words: Keywords: capsule network, deep learning, dynamic routing, capsule pooling, deconvolution reconstruction