《计算机应用》唯一官方网站

• •    下一篇

联合球面对齐与自适应几何校正的全景视频超分辨率网络

陈晓雷,郑芷薇,黄雪,曲振彬   

  1. 兰州理工大学 电气工程与信息工程学院
  • 收稿日期:2025-03-25 修回日期:2025-04-21 发布日期:2025-05-07 出版日期:2025-05-07
  • 通讯作者: 陈晓雷
  • 作者简介:陈晓雷(1979—),男,河南灵宝人,教授,博士,CCF会员,主要研究方向:人工智能、计算机视觉;郑芷薇(1998—),女,陕西汉中人,硕士研究生,主要研究方向:计算机视觉;黄雪(2000—),女,江苏徐州人,硕士研究生,主要研究方向:计算机视觉;曲振彬(2001—),男,山西忻州人,硕士研究生,主要研究方向:计算机视觉。
  • 基金资助:
    国家自然科学基金资助项目(61967012)

Panoramic video super-resolution network with joint spherical alignment and adaptive geometric correction

CHEN Xiaolei, ZHENG Zhiwei, HUANG Xue, QU Zhenbin   

  1. College of Electrical and Information Engineering, Lanzhou University of Technology
  • Received:2025-03-25 Revised:2025-04-21 Online:2025-05-07 Published:2025-05-07
  • About author:CHEN Xiaolei, born in 1979, Ph.D., professor. His research interests include artificial intelligence, computer vision. ZHENG Zhiwei, born in 1998, M.S. candidate. Her research interests include computer vision. HUANG Xue, born in 2000, M.S. candidate. Her research interests include computer vision. QU Zhenbin, born in 2001, M.S. candidate. His research interests include computer vision.
  • Supported by:
    National Natural Science Foundation of China (61967012)

摘要: 现有的传统视频超分辨率方法在处理全景视频时,难以有效解决等距矩形投影带来的几何畸变问题,且在帧间对齐和特征融合方面存在不足,导致重建效果不佳。为进一步提升全景视频的超分辨率重建质量,提出了一种联合球面对齐与自适应几何校正的全景视频超分辨率网络——360GeoVSR。该网络通过球面对齐模块(SAM)和几何融合块(GFB)实现帧间特征的精确对齐与高效融合。SAM结合空间变换和可变形卷积,处理全局与局部几何畸变;GFB通过嵌入的自适应几何校正子模块(AGC)动态校正特征对齐,并融合多帧信息以捕捉复杂的帧间关系。在扩展的ODV360Extended全景视频数据集上的主客观对比实验结果表明,360GeoVSR在客观指标和主观视觉效果上均优于BasicVSR++、VRT(Video Restoration Transformer)等5种代表性超分辨率方法,验证了它的有效性。

关键词: 全景视频, 超分辨率, 几何校正, 球面对齐, 深度学习

Abstract: Traditional video super-resolution methods were found to be ineffective in addressing the geometric distortion problems caused by equirectangular projection when processing panoramic videos. Deficiencies were also observed in inter-frame alignment and feature fusion, which resulted in poor reconstruction quality. To further improve the super-resolution reconstruction quality of panoramic videos, a panoramic video super-resolution network combining spherical alignment and adaptive geometric correction, named 360GeoVSR, was proposed. Precise alignment and efficient fusion of inter-frame features were achieved through a Spherical Alignment Module (SAM) and a Geometric Fusion Block (GFB). In SAM, spatial transformation and deformable convolution were combined to address global and local geometric distortions. In GFB, feature alignment was dynamically corrected using an embedded Adaptive Geometric Correction submodule (AGC), and multi-frame information was fused to capture complex inter-frame relationships. The results of subjective and objective comparison experiments on the extended ODV360Extended panoramic video dataset showed that 360GeoVSR outperformed five representative super-resolution methods, including BasicVSR++ and VRT (Video Restoration Transformer), in both objective metrics and subjective visual effects, validating its effectiveness. 

Key words: panoramic video, super-resolution, geometric correction, spherical alignment, deep learning

中图分类号: