基于图像的端到端行人搜索算法综述

• •

基于图像的端到端行人搜索算法综述

王翠¹,邓淼磊²,张德贤³,李磊¹,杨晓艳¹

1. 河南工业大学
2. 河南工业大学信息科学与工程学院
3. 河南工业大学信息科学与工程学院, 郑州 450001

收稿日期:2023-09-05 修回日期:2023-10-24 发布日期:2023-12-18
通讯作者: 邓淼磊
基金资助:
河南省重大公益专项;国家自然科学基金

Review of end-to-end person search algorithms based on images

Received:2023-09-05 Revised:2023-10-24 Online:2023-12-18
Contact: Miaolei MiaoleiDENG

摘要/Abstract

摘要： 行人搜索是计算机视觉领域中重要的研究方向之一，其研究目的是在未剪裁的图像库中检测和识别人物。对于行人搜索算法，尽管已有大量算法研究，但总结性研究尚有不足。为深入了解行人搜索算法，对大量相关文献进行了总结与分析。首先根据网络结构的不同，将行人搜索分为两类:一类是传统的两步法，一类是基于端到端的一步法，对一步法的关键技术：特征学习和度量学习进行重点分析和介绍；进一步介绍了行人搜索领域的数据集和评价指标，对主流算法进行性能比较与分析；实验结果表明，两步法虽然实现了很好的性能，但大多数的方法计算成本很高，且耗时较长，而一步法可以在更高效的学习框架中共同解决两个子任务，效果更好；最后对行人搜索算法进行总结，并讨论了未来的发展方向。

关键词: 行人搜索, 一步法, 端到端, 计算机视觉, transformer

Abstract: Abstract: Person search is one of the important research directions in the field of computer vision. Its research goal is to detect and identify characters in unarmed image libraries. Although there are already a large number of algorithm research on person search algorithms, summary research is still insufficient. In order to deeply understand the person search algorithm, a large number of related literature summarized and analyzed. First of all, according to the different network structure, the person search is divided into two categories: one is a two -step method, and the other is based on the end -to -end step method. The key technologies of the one -step method are analyzed and the characteristic learning and measurement learning Introduction; further introduce the data sets and evaluation indicators in the field of person search, and compare the performance and analysis of the mainstream algorithm; although the experimental results have achieved good performance, most of the methods have a high calculation cost, and It takes a long time, and the one -step method can solve the two sub -tasks in the more efficient learning framework, which is better. Finally, the person search algorithm is summarized and discussed the future development direction.

Key words: person search, one-step, end-to-end, computer vision, transformer

中图分类号:

TP391

王翠邓淼磊张德贤李磊杨晓艳. 基于图像的端到端行人搜索算法综述[J]. 计算机应用.

[1]	赵晓焱, 匡燕, 王梦含, 袁培燕. 基于知识图谱的端到端内容共享机制[J]. 《计算机应用》唯一官方网站, 2024, 44(4): 995-1001.
[2]	黄荣, 宋俊杰, 周树波, 刘浩. 基于自监督视觉Transformer的图像美学质量评价方法[J]. 《计算机应用》唯一官方网站, 2024, 44(4): 1269-1276.
[3]	吴宁, 罗杨洋, 许华杰. 基于多尺度特征融合的遥感图像语义分割方法[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 737-744.
[4]	崔晨辉, 蔺素珍, 李大威, 禄晓飞, 武杰. 基于孪生网络和Transformer的红外弱小目标跟踪方法[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 563-571.
[5]	陈豪, 夏振平, 程成, 林李兴, 张博文. 基于Transformer-CNN的轻量级图像超分辨率重建网络[J]. 《计算机应用》唯一官方网站, 2024, 44(1): 292-299.
[6]	陈丽安, 过弋. 融合个体偏差信息的文本情感分析模型[J]. 《计算机应用》唯一官方网站, 2024, 44(1): 145-151.
[7]	陈蒙蒙, 乔志伟. 基于融合通道注意力的Uformer的CT图像稀疏重建[J]. 《计算机应用》唯一官方网站, 2023, 43(9): 2948-2954.
[8]	段升位, 程欣宇, 王浩舟, 王飞. 基于改进的YOLOv5的大坝表面病害检测算法[J]. 《计算机应用》唯一官方网站, 2023, 43(8): 2619-2629.
[9]	王一, 谢杰, 程佳, 豆立伟. 基于深度学习的RGB图像目标位姿估计综述[J]. 《计算机应用》唯一官方网站, 2023, 43(8): 2546-2555.
[10]	周静, 胡怡宇, 胡成玉, 王天江. 基于点云补全和多分辨Transformer的弱感知目标检测方法[J]. 《计算机应用》唯一官方网站, 2023, 43(7): 2155-2165.
[11]	陈一驰, 陈斌. 计算机视觉中的终身学习综述[J]. 《计算机应用》唯一官方网站, 2023, 43(6): 1785-1795.
[12]	王利, 宣士斌, 秦续阳, 李紫薇. 基于双解码器的Transformer多目标跟踪方法[J]. 《计算机应用》唯一官方网站, 2023, 43(6): 1919-1929.
[13]	郭劲文, 马兴华, 骆功宁, 王玮, 曹阳, 王宽全. 基于Transformer的结构强化IVOCT导丝伪影去除方法[J]. 《计算机应用》唯一官方网站, 2023, 43(5): 1596-1605.
[14]	傅励瑶, 尹梦晓, 杨锋. 基于Transformer的U型医学图像分割网络综述[J]. 《计算机应用》唯一官方网站, 2023, 43(5): 1584-1595.
[15]	刘阳, 陆志扬, 王骏, 施俊. 基于自注意力连接UNet的磁共振成像去吉布斯伪影算法[J]. 《计算机应用》唯一官方网站, 2023, 43(5): 1606-1611.