Journal of Computer Applications ›› 2024, Vol. 44 ›› Issue (11): 3556-3564.DOI: 10.11772/j.issn.1001-9081.2023111661

• Multimedia computing and computer simulation • Previous Articles     Next Articles

Multi-view stereo method based on quadtree prior assistance

Lihua HU1(), Xiaoping LI1, Jianhua HU2, Sulan ZHANG1   

  1. 1.College of Computer Science and Technology,Taiyuan University of Science and Technology,Taiyuan Shanxi 030024,China
    2.Institute of Automation,Chinese Academy of Sciences,Beijing 100190,China
  • Received:2023-12-04 Revised:2024-05-25 Accepted:2024-05-30 Online:2024-07-25 Published:2024-11-10
  • Contact: Lihua HU
  • About author:LI Xiaoping, born in 1998, M. S. candidate. Her research interests include computer vision.
    HU Jianhua, born in 1987, Ph. D., associate research fellow. His research interests include intelligent robot, 3D vision.
    ZHANG Sulan, born in 1971, Ph. D., professor. Her research interests include computer vision, machine learning.
  • Supported by:
    National Natural Science Foundation of China(62273248);Natural Science Foundation of Shanxi Province(202103021224285);Science and Technology Service Network Initiative Program of Chinese Academy of Sciences(STS-HP-202202)

基于四叉树先验辅助的多视图立体方法

胡立华1(), 李小平1, 胡建华2, 张素兰1   

  1. 1.太原科技大学 计算机科学与技术学院,太原 030024
    2.中国科学院 自动化研究所,北京 100190
  • 通讯作者: 胡立华
  • 作者简介:李小平(1998—),女,山东青岛人,硕士研究生,主要研究方向:计算机视觉
    胡建华(1987—),男,山西忻州人,副研究员,博士,主要研究方向:智能机器人、3D视觉
    张素兰(1971—),女,山西长治人,教授,博士,CCF会员,主要研究方向:计算机视觉、机器学习。
  • 基金资助:
    国家自然科学基金资助项目(62273248);山西省自然科学基金资助项目(202103021224285);中国科学院科技服务网络计划项目(STS?HP?202202)

Abstract:

PatchMatch-based Multi-View Stereo (MVS) method can estimate the depth of a scene based on multiple input images and is currently applied in large-scale 3D scene reconstruction. However, the existing methods have lower accuracy and completeness in depth estimation in low-texture regions due to unstable feature matching and unreliable reliance on photometric consistency alone. To address the above problems, an MVS method based on quadtree prior assistance was proposed. Firstly, the image pixel values were used to obtain local textures. Secondly, a coarse depth map was obtained by Adaptive Checkerboard sampling and Multi-Hypothesis joint view selection (ACMH), which combined the structural information in the low-texture region to generate a priori plane hypothesis by using quadtree segmentation. Thirdly, by integrating the above information, a new multi-view matching cost function was designed to guide the low-texture regions for obtaining the best depth assumption, thereby improving the accuracy of stereo matching. Finally, comparison experiments were conducted with many existing traditional MVS methods on ETH3D, Tanks and Temples, and Chinese Academy of Sciences' ancient architecture datasets. The results demonstrate that the proposed method performs better, especially in ETH3D test dataset with error threshold of 2 cm, its F1 score and completeness are improved by 1.29 and 2.38 percentage points, respectively, compared with the current state-of-the-art multi-scale geometric consistency guided and planar prior assisted multi-view stereo method (ACMMP).

Key words: Multi-View Stereo (MVS), depth estimation, matching cost, low-texture region, quadtree prior

摘要:

基于PatchMatch的多视图立体(MVS)方法依据输入多幅图像估计场景的深度,目前已应用于大规模场景三维重建。然而,由于特征匹配不稳定、仅依赖光度一致性不可靠等原因,现有方法在弱纹理区域的深度估计准确性和完整性较低。针对上述问题,提出一种基于四叉树先验辅助的MVS方法。首先,利用图像像素值获得局部纹理;其次,基于自适应棋盘网格采样的块匹配多视图立体视觉方法(ACMH)获得粗略的深度图,结合弱纹理区域中的结构信息,采用四叉树分割生成先验平面假设;再次,融合上述信息,设计一种新的多视图匹配代价函数,引导弱纹理区域得到最优深度假设,进而提高立体匹配的准确性;最后,在ETH3D、Tanks and Temples和中国科学院古建筑数据集上与多种现有的传统MVS方法进行对比实验。结果表明所提方法性能更优,特别是在ETH3D测试数据集中,当误差阈值为2 cm时,相较于当前先进的多尺度平面先验辅助方法(ACMMP),它的F1分数和完整性分别提高了1.29和2.38个百分点。

关键词: 多视图立体, 深度估计, 匹配代价, 弱纹理区域, 四叉树先验

CLC Number: