BEV三维目标检测算法体系综述

doi:10.11772/j.issn.1001-9081.2025040419

《计算机应用》唯一官方网站

• • 下一篇

BEV三维目标检测算法体系综述

郭阳^1,2，王海亮^1,2，高需^1,2*，王海涛^1,2，王翌博^1,2

1.郑州大学计算机与人工智能学院，郑州 450001；2. 郑州大学国家超级计算郑州中心，郑州 450001

收稿日期:2025-04-18 修回日期:2025-07-24 接受日期:2025-07-25 发布日期:2025-07-30 出版日期:2025-07-30
通讯作者: 高需
基金资助:
郑州市重大科技创新专项;河南省研究生教育改革与质量提升工程项目

Survey on BEV 3D target detection algorithm system

Received:2025-04-18 Revised:2025-07-24 Accepted:2025-07-25 Online:2025-07-30 Published:2025-07-30
Supported by:
Zhengzhou City Major Science and Technology Innovation Project;Postgraduate Education Reform and Quality Improv ement Project of Henan Province

摘要/Abstract

摘要： 视觉感知作为环境理解的核心技术之一，为智能移动系统(如自动驾驶车辆)提供精准的环境信息，是保障安全决策的重要前提。基于鸟瞰图(BEV)的三维目标检测技术因它具有的高效性和准确性已成为了环境感知领域的主流范式。为进一步促进基于BEV的三维目标检测算法的研究，首先对所涵盖的算法进行系统分类，根据输入数据的模态，将它们分为纯相机算法、纯激光雷达算法和相机-激光雷达融合算法；其次，探讨预训练算法在提升检测性能中的作用；再次，分析融合时序特征的算法在动态场景中的优势和融合高度特征的算法在复杂环境下的表现。继次，梳理大模型协同BEV目标检测在目标检测精度与场景理解方面取得的突破性进展；最后，总结核心结论，并展望未来研究方向，以期为该领域的研究工作提供新的思路。

关键词: 鸟瞰图, 三维目标检测, 预训练, 时序特征, 高度特征, 大模型

Abstract: Visual perception, as one of the core technologies of environmental understanding, provides accurate environmental information for intelligent mobile systems (such as autonomous vehicles) and is an important prerequisite for ensuring safety decisions. 3D object detection technology based on Bird's Eye View (BEV) has become the mainstream paradigm in the field of environmental perception because of its efficiency and accuracy. To further promote the research of three-dimensional object detection algorithms based on BEV, the covered algorithms were first systematically classified, and according to the modes of the input data, they were divided into three categories: pure camera algorithm, pure lidar algorithm and camera-lidar fusion algorithm. Secondly, the role of pre-training algorithms in improving detection performance was explored. Then, the advantages of fusion timing characteristics were analyzed in dynamic scenarios and the performance of fusion high-level characteristics in complex environments. Besides, the breakthrough progress made in target detection accuracy and scenario understanding of large language model collaborative BEV target detection was sorted out. Finally, the core conclusions were summarized and future research directions were looked forward to provide new ideas for research work in this field.

Key words: Bird's Eye View（BEV）, 3D object detection, pre-training, temporal features, height features, large language model

中图分类号:

TP301.6

郭阳王海亮高需王海涛王翌博. BEV三维目标检测算法体系综述[J]. 计算机应用, DOI: 10.11772/j.issn.1001-9081.2025040419.

[1]	姜皓骞, 张东, 李冠宇, 陈恒. 基于结构增强的层次化任务导向提示策略的对话推荐系统SetaCRS[J]. 《计算机应用》唯一官方网站, 2026, 46(2): 368-377.
[2]	文洪建, 胡瑞娇, 吴保文, 孙家兴, 李环, 张晴, 刘杰. 基于图神经网络实现多尺度特征联合学习的中文作文自动评分[J]. 《计算机应用》唯一官方网站, 2026, 46(2): 378-385.
[3]	魏涵玥, 郭晨娟, 梅杰源, 田锦东, 陈鹏, 徐榕荟, 杨彬. 融合时频特征与混合文本的多模态股票预测框架MATCH[J]. 《计算机应用》唯一官方网站, 2026, 46(2): 427-436.
[4]	李明光, 陶重犇. 基于Mamba模型的分级跨模态融合三维目标检测方法[J]. 《计算机应用》唯一官方网站, 2026, 46(2): 572-579.
[5]	何金栋, 及宇轩, 陈天赐, 许恒铭, 耿技, 曹明生, 梁员宁. 基于知识图谱和大模型的非智能传感器的实体发现方法[J]. 《计算机应用》唯一官方网站, 2026, 46(2): 354-360.
[6]	殷兵, 凌震华, 林垠, 奚昌凤, 刘颖. 兼容缺失模态推理的情感识别方法[J]. 《计算机应用》唯一官方网站, 2025, 45(9): 2764-2772.
[7]	杨青, 朱焱. 改进语言规则中的表示的隐喻识别[J]. 《计算机应用》唯一官方网站, 2025, 45(8): 2491-2496.
[8]	张伟, 牛家祥, 马继超, 沈琼霞. 深层语义特征增强的ReLM中文拼写纠错模型[J]. 《计算机应用》唯一官方网站, 2025, 45(8): 2484-2490.
[9]	王祉苑, 彭涛, 杨捷. 分布外检测中训练与测试的内外数据整合[J]. 《计算机应用》唯一官方网站, 2025, 45(8): 2497-2506.
[10]	帅健, 王中卿, 陈嘉沥. 基于代码生成的细粒度情感分析方法[J]. 《计算机应用》唯一官方网站, 2025, 45(6): 1827-1832.
[11]	杨杰, 尼玛扎西, 仁青东主, 祁晋东, 才让东知. 基于预训练模型标记器重构的藏文分词系统[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1199-1204.
[12]	李嘉欣, 莫思特. 基于MiniRBT-LSTM-GAT与标签平滑的台区电力工单分类[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1356-1362.
[13]	刘天宇, 陶冶, 鲁超峰, 刘家旺. 融合叙事单元和可靠标签的小说说话人识别框架[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1190-1198.
[14]	王利琴, 耿智雷, 李英双, 董永峰, 边萌. 基于路径和增强三元组文本的开放世界知识推理模型[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1177-1183.
[15]	孙海涛, 林佳瑜, 梁祖红, 郭洁. 结合标签混淆的中文文本分类数据增强技术[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1113-1119.

BEV三维目标检测算法体系综述

Survey on BEV 3D target detection algorithm system

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics