Journal of Computer Applications ›› 2026, Vol. 46 ›› Issue (5): 1677-1683.DOI: 10.11772/j.issn.1001-9081.2025050632
• Frontier and comprehensive applications • Previous Articles
Xinyao LIU1, Jun LIANG1, Jiahao LONG1, Renliang YAN2(
)
Received:2025-06-09
Revised:2025-07-10
Accepted:2025-07-18
Online:2025-08-01
Published:2026-05-10
Contact:
Renliang YAN
About author:LIU Xinyao, born in 1998, M. S. candidate. Her research interests include image classification, pattern recognition.Supported by:通讯作者:
颜仁梁
作者简介:刘馨瑶(1998—),女,山西大同人,硕士研究生,主要研究方向:图像分类、模式识别基金资助:CLC Number:
Xinyao LIU, Jun LIANG, Jiahao LONG, Renliang YAN. Fine-grained Chinese herbal medicine image classification based on feature fusion and channel information compensation[J]. Journal of Computer Applications, 2026, 46(5): 1677-1683.
刘馨瑶, 梁军, 龙嘉濠, 颜仁梁. 基于特征融合和通道信息补偿的中草药细粒度图像分类[J]. 《计算机应用》唯一官方网站, 2026, 46(5): 1677-1683.
Add to citation manager EndNote|Ris|BibTeX
URL: https://www.joca.cn/EN/10.11772/j.issn.1001-9081.2025050632
| 名称 | 配置环境 |
|---|---|
| CPU | Intel Xeon Silver 4210R CPU @ 2.40 GHz |
| GPU | NVIDIA Quadro RTX A5000 * 2 |
| 操作系统 | Ubuntu-22.04.1 (64位) |
| 显存 | 24 GB |
| PyTorch版本 | 1.12.1 |
| torchvision版本 | 0.13.1 |
| CUDA版本 | 11.6 |
Tab. 1 Hardware and software environment
| 名称 | 配置环境 |
|---|---|
| CPU | Intel Xeon Silver 4210R CPU @ 2.40 GHz |
| GPU | NVIDIA Quadro RTX A5000 * 2 |
| 操作系统 | Ubuntu-22.04.1 (64位) |
| 显存 | 24 GB |
| PyTorch版本 | 1.12.1 |
| torchvision版本 | 0.13.1 |
| CUDA版本 | 11.6 |
| 模型 | 准确率/% | 召回率/% | F1分数/% |
|---|---|---|---|
| CMAL-Net[ | 90.041 | 78 | 75 |
| ConvNeXt[ | 88.814 | 82 | 79 |
| PIM[ | 93.498 | 86 | 84 |
| IELT[ | 92.922 | 93 | 89 |
| SR-GNN[ | 82.317 | 71 | 68 |
| I2-HOFI[ | 93.642 | 89 | 87 |
| SIM-OFE[ | 93.257 | 86 | 83 |
| CHMRN | 93.910 | 91 | 88 |
Tab. 2 Performance comparison of different models on Herb-150 dataset
| 模型 | 准确率/% | 召回率/% | F1分数/% |
|---|---|---|---|
| CMAL-Net[ | 90.041 | 78 | 75 |
| ConvNeXt[ | 88.814 | 82 | 79 |
| PIM[ | 93.498 | 86 | 84 |
| IELT[ | 92.922 | 93 | 89 |
| SR-GNN[ | 82.317 | 71 | 68 |
| I2-HOFI[ | 93.642 | 89 | 87 |
| SIM-OFE[ | 93.257 | 86 | 83 |
| CHMRN | 93.910 | 91 | 88 |
自顶向下的特征 融合模块 | 自底向上的通道 特征信息补偿模块 | 准确率/% |
|---|---|---|
| 79.59 | ||
| √ | 93.66 | |
| √ | √ | 93.91 |
Tab. 3 Ablation experimental results of CHMRN
自顶向下的特征 融合模块 | 自底向上的通道 特征信息补偿模块 | 准确率/% |
|---|---|---|
| 79.59 | ||
| √ | 93.66 | |
| √ | √ | 93.91 |
| [1] | LI X, HAN M, SONG X, et al. Characteristics and comparative study of medicinal materials between China and India based on data mining from literatures[J]. Journal of Ethnopharmacology, 2024, 333: No.118409. |
| [2] | 赵戈伟,许升全,谢娟英. DL-MAML:一种新的蝴蝶物种自动识别模型[J]. 计算机研究与发展, 2024, 61(3): 674-684. |
| ZHAO G W, XU S Q, XIE J Y. DL-MAML: an innovative model for automatically identifying butterfly species[J]. Journal of Computer Research and Development, 2024, 61(3): 674-684. | |
| [3] | 余鹰,危伟,汤洪,等. 多层次知识自蒸馏联合多步骤训练的细粒度图像识别[J]. 计算机研究与发展, 2023, 60(8): 1834-1845. |
| YU Y, WEI W, YANG H, et al. Multi-stage training with multi-level knowledge self-distillation for fine-grained image recognition[J]. Journal of Computer Research and Development, 2023, 60(8): 1834-1845. | |
| [4] | WEI X S, SONG Y Z, AODHA O MAC, et al. Fine-grained image analysis with deep learning: a survey[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 44(12): 8927-8948. |
| [5] | VAN DER KLIS R, ALANIZ S, MANCINI M, et al. PDiscoNet: semantically consistent part discovery for fine-grained recognition[C]// Proceedings of the 2023 IEEE/CVF International Conference on Computer Vision. Piscataway: IEEE, 2023: 1866-1876. |
| [6] | WANG J, XU Q, JIANG B, et al. Multi-granularity part sampling attention for fine-grained visual classification[J]. IEEE Transactions on Image Processing, 2024, 33: 4529-4542. |
| [7] | YU X, ZHAO Y, GAO Y. SPARE: self-supervised part erasing for ultra-fine-grained visual categorization[J]. Pattern Recognition, 2022, 128: No.108691. |
| [8] | ZHU L, CHEN T, YIN J, et al. Learning Gabor texture features for fine-grained recognition[C]// Proceedings of the 2023 IEEE/CVF International Conference on Computer Vision. Piscataway: IEEE, 2023: 1621-1631. |
| [9] | MA Z, WU X, CHU A, et al. SwinFG: a fine-grained recognition scheme based on Swin Transformer[J]. Expert Systems with Applications, 2024, 244: No.123021. |
| [10] | 张文丽,宋威. 基于特征融合与集成学习的细粒度图像分类[J]. 激光与光电子学进展, 2024, 61(22): No.2237010. |
| ZHANG W L, SONG W. Fine-grained image classification based on feature fusion and ensemble learning[J]. Laser and Optoelectronics Progress, 2024, 61(22): No.2237010. | |
| [11] | BAI Q, SUN Z, WANG K, et al. MPSA: multi-position supervised soft attention-based convolutional neural network for histopathological image classification[J]. Expert Systems with Applications, 2024, 253: No.124336. |
| [12] | DOSOVITSKIY A, BEYER L, KOLESNIKOV A, et al. An image is worth 16x16 words: Transformers for image recognition at scale[EB/OL]. [2023-11-26]. . |
| [13] | ZHANG Z C, CHEN Z D, WANG Y, et al. A Vision Transformer for fine-grained classification by reducing noise and enhancing discriminative information[J]. Pattern Recognition, 2024, 145: No.109979. |
| [14] | XU Q, WANG J, JIANG B, et al. Fine-grained visual classification via internal ensemble learning transformer[J]. IEEE Transactions on Multimedia, 2023, 25: 9015-9028. |
| [15] | HE J, CHEN J N, LIU S, et al. TransFG: a Transformer architecture for fine-grained recognition[C]// Proceedings of the 36th AAAI Conference on Artificial Intelligence. Palo Alto: AAAI Press, 2022: 852-860. |
| [16] | XIA Z, PAN X, SONG S, et al. Vision Transformer with deformable attention[C]// Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2022: 4784-4793. |
| [17] | SHU Y, VAN DEN HENGEL A, LIU L. Learning common rationale to improve self-supervised representation for fine-grained visual recognition problems[C]// Proceedings of the 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2023: 11392-11401. |
| [18] | SASTRY S, KHANAL S, DHAKAL A, et al. BirdSAT: cross-view contrastive masked autoencoders for bird species classification and mapping[C]// Proceedings of the 2024 IEEE/CVF Winter Conference on Applications of Computer Vision. Piscataway: IEEE, 2024: 7136-7145. |
| [19] | PAUL D, CHOWDHURY A, XIONG X, et al. A simple interpretable transformer for fine-grained image classification and analysis[EB/OL]. [2024-11-22]. . |
| [20] | LIU D. Progressive multi-task anti-noise learning and distilling frameworks for fine-grained vehicle recognition[J]. IEEE Transactions on Intelligent Transportation Systems, 2024, 25(9): 10667-10678 . |
| [21] | WANG S, WANG Z, LI H, et al. Accurate fine-grained object recognition with structure-driven relation graph networks[J]. International Journal of Computer Vision, 2024, 132(1): 137-160. |
| [22] | CHEN H, ZHANG H, LIU C, et al. FET-FGVC: feature-enhanced transformer for fine-grained visual classification[J]. Pattern Recognition, 2024, 149: No.110265. |
| [23] | LI T, YANG J, LI C, et al. Deep recognition of Chinese herbal medicines based on a Caputo fractional order convolutional neural network[C]// Proceedings of the 2023 International Workshop on Internet of Things of Big Data for Healthcare, CCIS 2019. Cham: Springer, 2024: 41-51. |
| [24] | CHE K, LIANG Y, ZENG Y, et al. Revolutionizing traditional Chinese medicine image classification and recognition with an improved YOLOv5[C]// Proceedings of the 2nd International Conference on Health Big Data and Intelligent Healthcare. Piscataway: IEEE, 2023: 1-5. |
| [25] | LIU Z, MAO H, WU C Y, et al. A ConvNet for the 2020s[C]// Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2022: 11966-11976. |
| [26] | WOO S, DEBNATH S, HU R, et al. ConvNeXt V2: co-designing and scaling ConvNets with masked autoencoders[C]// Proceedings of the 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2023: 16133-16142. |
| [27] | LIN T Y, DOLLÁR P, GIRSHICK R, et al. Feature pyramid networks for object detection[C]// Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2017: 936-944. |
| [28] | LIU D, ZHAO L, WANG Y, et al. Learn from each other to classify better: cross-layer mutual attention learning for fine-grained visual classification[J]. Pattern Recognition, 2023, 140: No.109550. |
| [29] | DEHGHAN A, MASOOD S Z, SHU G, et al. View independent vehicle make, model and color recognition using convolutional neural network[EB/OL]. [2024-03-25]. . |
| [30] | SIKDAR A, LIU Y, KEDARISETTY S, et al. Interweaving insights: high-order feature interaction for fine-grained visual recognition[J]. International Journal of Computer Vision, 2025, 133(4): 1755-1779. |
| [31] | SUN H, HE X, XU J, et al. SIM-OFE: structure information mining and object-aware feature enhancement for fine-grained visual categorization[J]. IEEE Transactions on Image Processing, 2024, 33: 5312-5326. |
| [32] | CHOU P Y, LIN C H, KAO W C. A novel plug-in module for fine-grained visual classification[EB/OL]. [2024-09-14]. . |
| [33] | WAH C, BRANSON S, WELINDER P, et al. The Caltech-UCSD Birds-200-2011 dataset[EB/OL]. [2024-09-14]. . |
| [34] | BERA A, WHARTON Z, LIU Y, et al. SR-GNN: spatial relation-aware graph neural network for fine-grained image categorization[J]. IEEE Transactions on Image Processing, 2022, 31: 6017-6031. |
| [1] | Yuqian HUANG, Hui HUANG, Yongbin QIN, Ruizhang HUANG, Yanping CHEN, Yulin ZHOU, Qian SUN. Judicial element extraction method by integrating global and local semantics [J]. Journal of Computer Applications, 2026, 46(5): 1460-1467. |
| [2] | Jiali ZHENG, Gang ZHOU, Jing CHEN, Shunhang LI. Adaptive multi-feature fusion detection method for AI-generated text [J]. Journal of Computer Applications, 2026, 46(5): 1433-1440. |
| [3] | Wenchao MING, Suzhen LIN, Zanxia JIN. Multi-band image captioning method based on scene concept-guided feature fusion [J]. Journal of Computer Applications, 2026, 46(5): 1560-1567. |
| [4] | Chi ZHANG, Xianjing MENG, Changhao DOU, Qian WANG, Leilei GENG, Xiaoming XI. MD-FVR: cascaded finger vein recognition network based on multi-domain feature fusion [J]. Journal of Computer Applications, 2026, 46(5): 1658-1666. |
| [5] | Huijie GUO, Tianfeng DOU, Zhenlin ZHANG, Kaiyuan QI, Dong WU, Zhijian QU, Zhao LI, Chongguang REN. Time-interdependency-aware dynamic Bayesian network for traffic prediction [J]. Journal of Computer Applications, 2026, 46(5): 1507-1517. |
| [6] | Xuechao LIAO, Rui CHEN. Prediction-evaluation framework for anomaly detection in electric vehicle lithium-ion battery [J]. Journal of Computer Applications, 2026, 46(5): 1614-1623. |
| [7] | Xing SHENG, Sunxian WENG, Kuosong CHEN, Zhongping WANG, Ruifeng REN, Yong LIU. Deep learning-based patent value evaluation for power grid enterprises [J]. Journal of Computer Applications, 2026, 46(5): 1468-1474. |
| [8] | Minqi WU, Yuanhua YANG, Hang LI, Yaqin HU, Zhihao TANG, Teng MEI. Lightweight underwater small object detection based on graph Transformer and RT-DETR [J]. Journal of Computer Applications, 2026, 46(5): 1586-1595. |
| [9] | Hongrui ZHANG, Weiming FENG, Luxia YANG, Yongjie MA. CSAF-YOLO: improved YOLO11 algorithm for underwater small object detection [J]. Journal of Computer Applications, 2026, 46(5): 1578-1585. |
| [10] | Xinyi YAN, Linglong ZHU, Yonghong ZHANG. CDC-DETR: multi-scale real-time human-vehicle detection method for complex traffic scenarios [J]. Journal of Computer Applications, 2026, 46(4): 1283-1291. |
| [11] | Shuai HE, Chunhua DENG. Object detection algorithm with few-shot learning based on YOLO-World [J]. Journal of Computer Applications, 2026, 46(4): 1275-1282. |
| [12] | Haoxuan CHEN, Peichang YE, Lei LIU, Chengming LIU, Wenhua HU. Survey of automated code edit suggestion [J]. Journal of Computer Applications, 2026, 46(4): 1227-1237. |
| [13] | Wenhao LI, Yinzhang GUO. Urban traffic flow prediction based on dual-layer multi-scale dynamic graph convolutional network model [J]. Journal of Computer Applications, 2026, 46(4): 1323-1333. |
| [14] | Huanxian LIU, Hongtao WANG, Xian’ao WANG, Hongmei WANG, Weifeng XU. Multimodal fact verification with cross-modal semantic association [J]. Journal of Computer Applications, 2026, 46(4): 1069-1076. |
| [15] | Jian ZHANG, Jianbo YU, Jian TANG. Municipal solid waste incineration state recognition method based on multilayer preprocessing [J]. Journal of Computer Applications, 2026, 46(3): 940-949. |
| Viewed | ||||||
|
Full text |
|
|||||
|
Abstract |
|
|||||