Journal of Computer Applications ›› 2023, Vol. 43 ›› Issue (5): 1557-1564.DOI: 10.11772/j.issn.1001-9081.2022040554
Special Issue: 多媒体计算与计算机仿真
• Multimedia computing and computer simulation • Previous Articles Next Articles
Hui LIU1,2, Linyu ZHANG1,2(
), Fugang WANG1,2, Rujin HE1,2
Received:2022-04-19
Revised:2022-06-20
Accepted:2022-06-22
Online:2022-07-11
Published:2023-05-10
Contact:
Linyu ZHANG
About author:LIU Hui, born in 1966, M. S., senior engineer. His research interests include computer vision, new technology of communication network, telecommunication system service.
刘辉1,2, 张琳玉1,2(
), 王复港1,2, 何如瑾1,2
通讯作者:
张琳玉
作者简介:刘辉(1966—),男,四川仪陇人,高级工程师,硕士,主要研究方向:计算机视觉、通信网络新技术、电信系统业务CLC Number:
Hui LIU, Linyu ZHANG, Fugang WANG, Rujin HE. Object detection algorithm based on attention mechanism and context information[J]. Journal of Computer Applications, 2023, 43(5): 1557-1564.
刘辉, 张琳玉, 王复港, 何如瑾. 基于注意力机制和上下文信息的目标检测算法[J]. 《计算机应用》唯一官方网站, 2023, 43(5): 1557-1564.
Add to citation manager EndNote|Ris|BibTeX
URL: https://www.joca.cn/EN/10.11772/j.issn.1001-9081.2022040554
| 配置项 | 训练 | 测试 |
|---|---|---|
| 编程语言 | Python | Python |
| 深度学习框架 | Pytorch1.8.0 | Pytorch1.8.0 |
| 操作系统 | Windows 10 | Windows 10 |
| CPU | Core i9-10980XE | Core i5-11400F |
| 内存 | 128 GB | 16 GB |
| GPU | Nvidia RTX 3080 | Nvidia RTX3060 |
| CUDA | 11.1 | 11.1 |
Tab. 1 Experimental environment configuration
| 配置项 | 训练 | 测试 |
|---|---|---|
| 编程语言 | Python | Python |
| 深度学习框架 | Pytorch1.8.0 | Pytorch1.8.0 |
| 操作系统 | Windows 10 | Windows 10 |
| CPU | Core i9-10980XE | Core i5-11400F |
| 内存 | 128 GB | 16 GB |
| GPU | Nvidia RTX 3080 | Nvidia RTX3060 |
| CUDA | 11.1 | 11.1 |
| 算法 | mAP/% | FPS |
|---|---|---|
| YOLOv5 | 79.10 | 108 |
| YOLOv5+MDSCM | 80.00 | 91 |
| YOLOv5+CA | 81.10 | 108 |
| YOLOv5+GCA | 81.40 | 108 |
| YOLOv5+Soft-NMS | 79.60 | 104 |
| YOLOv5+MDSCM+GCA | 81.70 | 91 |
| YOLOv5+MDSCM+Soft-NMS | 80.90 | 90 |
| YOLOv5+GCA+Soft-NMS | 82.00 | 106 |
| AC-YOLO | 82.80 | 90 |
Tab. 2 Ablation experimental results on PASCAL VOC dataset
| 算法 | mAP/% | FPS |
|---|---|---|
| YOLOv5 | 79.10 | 108 |
| YOLOv5+MDSCM | 80.00 | 91 |
| YOLOv5+CA | 81.10 | 108 |
| YOLOv5+GCA | 81.40 | 108 |
| YOLOv5+Soft-NMS | 79.60 | 104 |
| YOLOv5+MDSCM+GCA | 81.70 | 91 |
| YOLOv5+MDSCM+Soft-NMS | 80.90 | 90 |
| YOLOv5+GCA+Soft-NMS | 82.00 | 106 |
| AC-YOLO | 82.80 | 90 |
| 网络 | 尺寸大小 | mAP/% | FPS |
|---|---|---|---|
| Faster RCNN | 640×640 | 73.32 | 5 |
| SSD | 640×640 | 77.66 | 54 |
| YOLOv3 | 640×640 | 72.34 | 60 |
| Tiny-YOLOv3 | 640×640 | 73.28 | 91 |
| YOLOv5 | 640×640 | 79.10 | 108 |
| AC-YOLO | 640×640 | 82.80 | 90 |
Tab. 3 Performance comparison of different networks on PASCAL VOC dataset
| 网络 | 尺寸大小 | mAP/% | FPS |
|---|---|---|---|
| Faster RCNN | 640×640 | 73.32 | 5 |
| SSD | 640×640 | 77.66 | 54 |
| YOLOv3 | 640×640 | 72.34 | 60 |
| Tiny-YOLOv3 | 640×640 | 73.28 | 91 |
| YOLOv5 | 640×640 | 79.10 | 108 |
| AC-YOLO | 640×640 | 82.80 | 90 |
| 类别 | AP(IoU=0.5) | |||
|---|---|---|---|---|
| YOLOv3 | SSD | YOLOv5 | 本文算法 | |
| Aero | 81.20 | 75.50 | 87.70 | 89.20 |
| Bike | 80.30 | 80.20 | 89.30 | 91.00 |
| Bird | 74.00 | 72.30 | 74.30 | 80.80 |
| Boat | 65.50 | 66.30 | 70.80 | 73.90 |
| Bottle | 64.10 | 47.60 | 71.60 | 71.80 |
| Bus | 81.50 | 83.00 | 85.50 | 89.60 |
| Car | 82.20 | 84.20 | 91.70 | 92.00 |
| Cat | 83.10 | 86.10 | 83.20 | 89.70 |
| Chair | 61.23 | 54.70 | 61.90 | 67.70 |
| Cow | 77.30 | 78.30 | 82.00 | 85.90 |
| Table | 75.20 | 73.90 | 73.80 | 77.50 |
| Dog | 82.20 | 84.50 | 81.00 | 88.00 |
| Horse | 84.69 | 85.30 | 87.90 | 91.40 |
| Mbike | 81.29 | 82.60 | 86.60 | 89.10 |
| Person | 78.46 | 76.20 | 86.60 | 88.50 |
| Plant | 52.18 | 48.60 | 52.40 | 57.80 |
| Sheep | 77.52 | 73.90 | 81.70 | 84.70 |
| Sofa | 74.41 | 76.00 | 70.80 | 78.20 |
| Train | 81.66 | 83.40 | 83.50 | 87.30 |
| TV | 71.99 | 74.00 | 79.80 | 82.00 |
Tab. 4 Comparison of precisions under different network structures on each category of PASCAL VOC dataset
| 类别 | AP(IoU=0.5) | |||
|---|---|---|---|---|
| YOLOv3 | SSD | YOLOv5 | 本文算法 | |
| Aero | 81.20 | 75.50 | 87.70 | 89.20 |
| Bike | 80.30 | 80.20 | 89.30 | 91.00 |
| Bird | 74.00 | 72.30 | 74.30 | 80.80 |
| Boat | 65.50 | 66.30 | 70.80 | 73.90 |
| Bottle | 64.10 | 47.60 | 71.60 | 71.80 |
| Bus | 81.50 | 83.00 | 85.50 | 89.60 |
| Car | 82.20 | 84.20 | 91.70 | 92.00 |
| Cat | 83.10 | 86.10 | 83.20 | 89.70 |
| Chair | 61.23 | 54.70 | 61.90 | 67.70 |
| Cow | 77.30 | 78.30 | 82.00 | 85.90 |
| Table | 75.20 | 73.90 | 73.80 | 77.50 |
| Dog | 82.20 | 84.50 | 81.00 | 88.00 |
| Horse | 84.69 | 85.30 | 87.90 | 91.40 |
| Mbike | 81.29 | 82.60 | 86.60 | 89.10 |
| Person | 78.46 | 76.20 | 86.60 | 88.50 |
| Plant | 52.18 | 48.60 | 52.40 | 57.80 |
| Sheep | 77.52 | 73.90 | 81.70 | 84.70 |
| Sofa | 74.41 | 76.00 | 70.80 | 78.20 |
| Train | 81.66 | 83.40 | 83.50 | 87.30 |
| TV | 71.99 | 74.00 | 79.80 | 82.00 |
| 类别 | AP(IoU=0.5) | |||
|---|---|---|---|---|
| YOLOv3 | SSD | YOLOv5 | 本文算法 | |
| mAP | 67.97 | 41.98 | 70.25 | 71.74 |
| Small- vehicle | 66.80 | 10.05 | 66.30 | 67.80 |
| Large-vehicle | 81.70 | 50.20 | 83.60 | 85.90 |
| Plane | 86.20 | 64.70 | 90.80 | 91.80 |
| Storage-tank | 69.90 | 57.90 | 69.70 | 74.90 |
| Ship | 84.30 | 31.30 | 86.80 | 88.10 |
| Harbor | 80.50 | 80.50 | 84.00 | 82.20 |
| Ground track-field | 56.70 | 24.90 | 61.90 | 59.30 |
| Soccer ball field | 51.70 | 22.70 | 52.50 | 55.60 |
| Tennis-court | 89.40 | 85.50 | 94.00 | 93.40 |
| Swimming pool | 60.30 | 18.50 | 62.90 | 64.10 |
| Baseball diamond | 73.60 | 38.20 | 76.90 | 74.00 |
| Roundabout | 50.30 | 44.50 | 58.00 | 59.40 |
| Basketball court | 60.40 | 62.50 | 64.40 | 66.20 |
| Bridge | 46,10 | 26.20 | 47.80 | 50.40 |
| Helicopter | 51.80 | 12.10 | 54.20 | 63.00 |
Tab. 5 Comparison of precisions under different network structures on each category of DOTA dataset
| 类别 | AP(IoU=0.5) | |||
|---|---|---|---|---|
| YOLOv3 | SSD | YOLOv5 | 本文算法 | |
| mAP | 67.97 | 41.98 | 70.25 | 71.74 |
| Small- vehicle | 66.80 | 10.05 | 66.30 | 67.80 |
| Large-vehicle | 81.70 | 50.20 | 83.60 | 85.90 |
| Plane | 86.20 | 64.70 | 90.80 | 91.80 |
| Storage-tank | 69.90 | 57.90 | 69.70 | 74.90 |
| Ship | 84.30 | 31.30 | 86.80 | 88.10 |
| Harbor | 80.50 | 80.50 | 84.00 | 82.20 |
| Ground track-field | 56.70 | 24.90 | 61.90 | 59.30 |
| Soccer ball field | 51.70 | 22.70 | 52.50 | 55.60 |
| Tennis-court | 89.40 | 85.50 | 94.00 | 93.40 |
| Swimming pool | 60.30 | 18.50 | 62.90 | 64.10 |
| Baseball diamond | 73.60 | 38.20 | 76.90 | 74.00 |
| Roundabout | 50.30 | 44.50 | 58.00 | 59.40 |
| Basketball court | 60.40 | 62.50 | 64.40 | 66.20 |
| Bridge | 46,10 | 26.20 | 47.80 | 50.40 |
| Helicopter | 51.80 | 12.10 | 54.20 | 63.00 |
| 类别 | AP(IoU=0.5) | |||
|---|---|---|---|---|
| YOLOv3 | SSD | YOLOv5 | 本文算法 | |
| mAP | 58.63 | 51.58 | 74.63 | 77.11 |
| Airplane | 59.60 | 49.40 | 89.10 | 93.10 |
| Airport | 72.70 | 63.10 | 78.10 | 80.90 |
| Baseball field | 73.40 | 66.60 | 81.90 | 79.90 |
| Basketball court | 75.70 | 71.10 | 80.00 | 84.40 |
| Bridge | 29.70 | 26.50 | 69.20 | 76.00 |
| Chimney | 65.60 | 63.30 | 89.70 | 81.70 |
| Dam | 56.60 | 54.30 | 73.10 | 77.10 |
| Expressway service area | 63.50 | 62.70 | 70.50 | 67.60 |
| Expressway toll station | 53.10 | 46.60 | 58.50 | 70.00 |
| Golf course | 65.30 | 64.40 | 70.30 | 66.70 |
| Ground track field | 68.60 | 53.10 | 66.20 | 75.70 |
| Harbor | 49.40 | 44.20 | 69.30 | 75.50 |
| Overpass | 48.10 | 35.70 | 78.80 | 76.70 |
| Ship | 59.20 | 58.30 | 80.30 | 87.00 |
| Stadium | 61.00 | 41.10 | 60.90 | 65.80 |
| Storage tank | 46.60 | 72.60 | 70.40 | 70.10 |
| Tennis court | 76.30 | 37.50 | 85.20 | 88.70 |
| Train station | 55.10 | 22.70 | 66.80 | 63.50 |
| Vehicle | 27.40 | 47.10 | 76.70 | 81.20 |
| Wind mill | 65.70 | 51.20 | 77.50 | 80.50 |
Tab. 6 Comparison of precisionsunder different network structures on each category of DIOR dataset
| 类别 | AP(IoU=0.5) | |||
|---|---|---|---|---|
| YOLOv3 | SSD | YOLOv5 | 本文算法 | |
| mAP | 58.63 | 51.58 | 74.63 | 77.11 |
| Airplane | 59.60 | 49.40 | 89.10 | 93.10 |
| Airport | 72.70 | 63.10 | 78.10 | 80.90 |
| Baseball field | 73.40 | 66.60 | 81.90 | 79.90 |
| Basketball court | 75.70 | 71.10 | 80.00 | 84.40 |
| Bridge | 29.70 | 26.50 | 69.20 | 76.00 |
| Chimney | 65.60 | 63.30 | 89.70 | 81.70 |
| Dam | 56.60 | 54.30 | 73.10 | 77.10 |
| Expressway service area | 63.50 | 62.70 | 70.50 | 67.60 |
| Expressway toll station | 53.10 | 46.60 | 58.50 | 70.00 |
| Golf course | 65.30 | 64.40 | 70.30 | 66.70 |
| Ground track field | 68.60 | 53.10 | 66.20 | 75.70 |
| Harbor | 49.40 | 44.20 | 69.30 | 75.50 |
| Overpass | 48.10 | 35.70 | 78.80 | 76.70 |
| Ship | 59.20 | 58.30 | 80.30 | 87.00 |
| Stadium | 61.00 | 41.10 | 60.90 | 65.80 |
| Storage tank | 46.60 | 72.60 | 70.40 | 70.10 |
| Tennis court | 76.30 | 37.50 | 85.20 | 88.70 |
| Train station | 55.10 | 22.70 | 66.80 | 63.50 |
| Vehicle | 27.40 | 47.10 | 76.70 | 81.20 |
| Wind mill | 65.70 | 51.20 | 77.50 | 80.50 |
| 1 | GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]// Proceeding of 2014 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2014:580-587. 10.1109/cvpr.2014.81 |
| 2 | GIRSHICK R. Fast R-CNN[C]// Proceeding of the 2015 IEEE International Conference on Computer Vision. Piscataway: IEEE, 2015:1440-1448. 10.1109/iccv.2015.169 |
| 3 | REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: unified, real-time object detection[C]// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2016: 779-788. 10.1109/cvpr.2016.91 |
| 4 | REDMON J, FARHADI A. YOLO9000: better, faster, stronger[C]// Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2017: 6517-6525. 10.1109/cvpr.2017.690 |
| 5 | REDMON J, FARHADI A. YOLOv3: an incremental improvement[EB/OL]. (2018-04-08) [2022-04-18].. 10.1109/cvpr.2017.690 |
| 6 | LIU W, ANGUELOV D, ERHAN D, et al. SSD: single shot multiBox detector[C]// Proceedings of the 2016 European Conference on Computer Vision, LNCS 9905. Cham: Springer, 2016: 21-37. |
| 7 | ZHANG S, WEN L, BIAN X, et al. Single-shot refinement neural network for object detection[C]// Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2018: 4203-4212. 10.1109/cvpr.2018.00442 |
| 8 | LIM J S, ASTRID M, YOON H J, et al. Small object detection using context and attention[C]// Proceedings of the 2021 International Conference on Artificial Intelligence in Information and Communication. Piscataway: IEEE, 2021: 181-186. 10.1109/icaiic51459.2021.9415217 |
| 9 | 许腾,唐贵进,刘清萍,等. 基于空洞卷积和Focal Loss的改进YOLOv3算法[J]. 南京邮电大学学报(自然科学版), 2020, 40(6):100-108. 10.14132/j.cnki.1673-5439.2020.06.015 |
| XU T, TANG G J, LIU Q P, et al. Improved YOLOv3 based on dilated convolution and Focal Loss[J]. Journal of Nanjing University of Posts and Telecommunications (Natural Science Edition), 2020, 40(6):100-108. 10.14132/j.cnki.1673-5439.2020.06.015 | |
| 10 | CAO G M, XIE X M, YANG W Z, et al. Feature-fused SSD: fast detection for small objects[C]// Proceedings of the 9th International Conference on Graphic and Image Processing. Bellingham, WA: SPIE, 2018: No.106151E. 10.1117/12.2304811 |
| 11 | 董如婵,焦李成,赵进,等. 一种深度融合机制的遥感图像目标检测技术[J]. 西安电子科技大学学报, 2021, 48(5): 128-138. |
| DONG R C, JIAO L C, ZHAO J, et al. Application of the deep fusion mechanism in object detection of remote sensing images[J]. Journal of Xidian University, 2021, 48(5):128-138. | |
| 12 | 马巧梅,王明俊,梁昊然. 复杂场景下基于改进YOLOv3的车牌定位检测算法[J]. 计算机工程与应用, 2021, 57(7):198-208. 10.3778/j.issn.1002-8331.2008-0137 |
| MA Q M, WANG M J, LIANG H R. License plate location detection algorithm based on improved YOLOv3 in complex scenes[J]. Computer Engineering and Applications, 2021, 57(7):198-208. 10.3778/j.issn.1002-8331.2008-0137 | |
| 13 | ZHAO H S, SHI J P, QI X J, et al. Pyramid scene parsing network[C]// Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2017: 6230-6239. 10.1109/cvpr.2017.660 |
| 14 | ZHANG F, JIAO L C, LI L L, et al. Multiresolution attention extractor for small object detection[EB/OL]. (2020-06-10) [2022-04-18].. 10.48550/arXiv.2006.05941 |
| 15 | Ultralytics. YOLOv5[EB/OL]. [2022-01-18].. 10.1117/1.jei.31.3.033033 |
| 16 | XU B Q, JIANG G W, LIU J H, et al. Aircraft rotated boxes detection method based on YOLOv5[C]// Proceedings of the 4th International Conference on Pattern Recognition and Artificial Intelligence. Piscataway: IEEE, 2021: 390-394. 10.1109/prai53619.2021.9551072 |
| 17 | WANG C Y, LIAO H Y M, WU Y H, et al. CSPNet: a new backbone that can enhance learning capability of CNN[C]// Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. Piscataway: IEEE, 2020: 1571-1580. 10.1109/cvprw50498.2020.00203 |
| 18 | HE K M, ZHANG X Y, REN S Q, et al. Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(9): 1904-1916. 10.1109/tpami.2015.2389824 |
| 19 | LIN T Y, DOLLÁR P, GIRSHISICK R, et al. Feature pyramid networks for object detection[C]// Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2017: 936-944. 10.1109/cvpr.2017.106 |
| 20 | LIU S, QI L, QIN H, et al. Path aggregation network for instance segmentation[C]// Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2018: 8759-8768. 10.1109/cvpr.2018.00913 |
| 21 | CHEN Q, WANG Y M, YANG T, et al. You only look one-level feature[C]// Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2021: 13034-13043. 10.1109/cvpr46437.2021.01284 |
| 22 | CHEN L C, PAPANDREOU G, SCHROFF F, et al. Rethinking atrous convolution for semantic image segmentation[EB/OL]. (2017-12-05) [2022-04-06].. 10.1007/978-3-030-01234-2_49 |
| 23 | HU J, SHEN L, SUN G. Squeeze-and-excitation networks[C]// Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2018: 7132-7141. 10.1109/cvpr.2018.00745 |
| 24 | WOO S, PARK J, LEE J Y, et al. CBAM: convolutional block attention module[C]// Proceedings of the 2018 European Conference on Computer Vision, LNCS 11211. Cham: Springer, 2018: 3-19. |
| 25 | HOU Q, ZHOU D, FENG J. Coordinate attention for efficient mobile network design[C]// Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2021: 13713-13722. 10.1109/cvpr46437.2021.01350 |
| 26 | JIANG B R, LUO R X, MAO J Y, et al. Acquisition of localization confidence for accurate object detection[C]// Proceedings of the 2018 European Conference on Computer Vision, LNCS 11218. Cham: Springer, 2018: 816-832. |
| 27 | NING C C, ZHOU H J, SONG Y, et al. Inception single shot multiBox detector for object detection[C]// Proceedings of the 2017 IEEE International Conference on Multimedia and Expo Workshops. Piscataway: IEEE, 2017: 549-554. 10.1109/icmew.2017.8026312 |
| 28 | BODLA N, SINGH B, CHELLAPPA R, et al. Soft-NMS - improving object detection with one line of code[C]// Proceedings of the 2017 IEEE International Conference on Computer Vision. Piscataway: IEEE, 2017: 5562-5570. 10.1109/iccv.2017.593 |
| 29 | WANG P Q, CHEN P F, YUAN Y, et al. Understanding convolution for semantic segmentation[C]// Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision. Piscataway: IEEE, 2018: 1451-1460. 10.1109/wacv.2018.00163 |
| 30 | EVERINGHAM M, ESLAMI S M A, van GOOL L, et al. The PASCAL Visual Object Classes challenge: a retrospective[J]. International Journal of Computer Vision, 2015, 111(1):98-136. 10.1007/s11263-014-0733-5 |
| 31 | XIA G S, BAI X, DING J, et al. DOTA: a large-scale dataset for object detection in aerial images[C]// Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2018: 3974-3983. 10.1109/cvpr.2018.00418 |
| 32 | LI K, WAN G, CHENG G, et al. Object detection in optical remote sensing images: a survey and a new benchmark[J]. ISPRS Journal of Photogrammetry and Remote Sensing, 2020, 159: 296-307. 10.1016/j.isprsjprs.2019.11.023 |
| [1] | Jing QIN, Zhiguang QIN, Fali LI, Yueheng PENG. Diagnosis of major depressive disorder based on probabilistic sparse self-attention neural network [J]. Journal of Computer Applications, 2024, 44(9): 2970-2974. |
| [2] | Liting LI, Bei HUA, Ruozhou HE, Kuang XU. Multivariate time series prediction model based on decoupled attention mechanism [J]. Journal of Computer Applications, 2024, 44(9): 2732-2738. |
| [3] | Yexin PAN, Zhe YANG. Optimization model for small object detection based on multi-level feature bidirectional fusion [J]. Journal of Computer Applications, 2024, 44(9): 2871-2877. |
| [4] | Zhiqiang ZHAO, Peihong MA, Xinhong HEI. Crowd counting method based on dual attention mechanism [J]. Journal of Computer Applications, 2024, 44(9): 2886-2892. |
| [5] | Yeheng LI, Guangsheng LUO, Qianmin SU. Logo detection algorithm based on improved YOLOv5 [J]. Journal of Computer Applications, 2024, 44(8): 2580-2587. |
| [6] | Yingjun ZHANG, Niuniu LI, Binhong XIE, Rui ZHANG, Wangdong LU. Semi-supervised object detection framework guided by curriculum learning [J]. Journal of Computer Applications, 2024, 44(8): 2326-2333. |
| [7] | Kaipeng XUE, Tao XU, Chunjie LIAO. Multimodal sentiment analysis network with self-supervision and multi-layer cross attention [J]. Journal of Computer Applications, 2024, 44(8): 2387-2392. |
| [8] | Pengqi GAO, Heming HUANG, Yonghong FAN. Fusion of coordinate and multi-head attention mechanisms for interactive speech emotion recognition [J]. Journal of Computer Applications, 2024, 44(8): 2400-2406. |
| [9] | Zhonghua LI, Yunqi BAI, Xuejin WANG, Leilei HUANG, Chujun LIN, Shiyu LIAO. Low illumination face detection based on image enhancement [J]. Journal of Computer Applications, 2024, 44(8): 2588-2594. |
| [10] | Shangbin MO, Wenjun WANG, Ling DONG, Shengxiang GAO, Zhengtao YU. Single-channel speech enhancement based on multi-channel information aggregation and collaborative decoding [J]. Journal of Computer Applications, 2024, 44(8): 2611-2617. |
| [11] | Wu XIONG, Congjun CAO, Xuefang SONG, Yunlong SHAO, Xusheng WANG. Handwriting identification method based on multi-scale mixed domain attention mechanism [J]. Journal of Computer Applications, 2024, 44(7): 2225-2232. |
| [12] | Huanhuan LI, Tianqiang HUANG, Xuemei DING, Haifeng LUO, Liqing HUANG. Public traffic demand prediction based on multi-scale spatial-temporal graph convolutional network [J]. Journal of Computer Applications, 2024, 44(7): 2065-2072. |
| [13] | Dianhui MAO, Xuebo LI, Junling LIU, Denghui ZHANG, Wenjing YAN. Chinese entity and relation extraction model based on parallel heterogeneous graph and sequential attention mechanism [J]. Journal of Computer Applications, 2024, 44(7): 2018-2025. |
| [14] | Li LIU, Haijin HOU, Anhong WANG, Tao ZHANG. Generative data hiding algorithm based on multi-scale attention [J]. Journal of Computer Applications, 2024, 44(7): 2102-2109. |
| [15] | Song XU, Wenbo ZHANG, Yifan WANG. Lightweight video salient object detection network based on spatiotemporal information [J]. Journal of Computer Applications, 2024, 44(7): 2192-2199. |
| Viewed | ||||||
|
Full text |
|
|||||
|
Abstract |
|
|||||