Journal of Computer Applications ›› 2026, Vol. 46 ›› Issue (2): 528-535.DOI: 10.11772/j.issn.1001-9081.2025030311

• Multimedia computing and computer simulation • Previous Articles    

Panoramic video super-resolution network combining spherical alignment and adaptive geometric correction

Xiaolei CHEN(), Zhiwei ZHENG, Xue HUANG, Zhenbin QU   

  1. College of Electrical and Information Engineering,Lanzhou University of Technology,Lanzhou Gansu 730050,China
  • Received:2025-03-25 Revised:2025-04-25 Accepted:2025-04-27 Online:2025-05-07 Published:2026-02-10
  • Contact: Xiaolei CHEN
  • About author:CHEN Xiaolei, born in 1979, Ph. D., professor. His research interests include artificial intelligence, computer vision. Email:chenxl703@lut.edu.cn
    ZHENG Zhiwei, born in 1998, M. S. candidate. Her research interests include computer vision.
    HUANG Xue, born in 2000, M. S. candidate. Her research interests include computer vision.
    QU Zhenbin, born in 2001, M. S. candidate. His research interests include computer vision.
  • Supported by:
    National Natural Science Foundation of China(61967012)

结合球面对齐与自适应几何校正的全景视频超分辨率网络

陈晓雷(), 郑芷薇, 黄雪, 曲振彬   

  1. 兰州理工大学 电气工程与信息工程学院,兰州 730050
  • 通讯作者: 陈晓雷
  • 作者简介:陈晓雷(1979—),男,河南灵宝人,教授,博士,CCF会员,主要研究方向:人工智能、计算机视觉 Email:chenxl703@lut.edu.cn
    郑芷薇(1998—),女,陕西汉中人,硕士研究生,主要研究方向:计算机视觉
    黄雪(2000—),女,江苏徐州人,硕士研究生,主要研究方向:计算机视觉
    曲振彬(2001—),男,山西忻州人,硕士研究生,主要研究方向:计算机视觉。
  • 基金资助:
    国家自然科学基金资助项目(61967012)

Abstract:

Traditional Video Super-Resolution (VSR) methods are ineffective in solving geometric distortion problems caused by equirectangular projection when processing panoramic videos, and have deficiencies in inter-frame alignment and feature fusion, which results in poor reconstruction quality. To further improve the super-resolution reconstruction quality of panoramic videos, a panoramic video super-resolution network combining spherical alignment and adaptive geometric correction, named 360GeoVSR, was proposed. In the network, accurate alignment and efficient fusion of inter-frame features were achieved through a Spherical Alignment Module (SAM) and a Geometric Fusion Block (GFB). In SAM, spatial transformation and deformable convolution were combined to address global and local geometric distortions. In GFB, feature alignment was corrected dynamically using an embedded Adaptive Geometric Correction (AGC) submodule, and multi-frame information was fused to capture complex inter-frame relationships. The results of subjective and objective comparison experiments on the extended ODV360Extended panoramic video dataset show that 360GeoVSR outperforms five representative super-resolution methods, including BasicVSR++ and VRT (Video Restoration Transformer), in both objective metrics and subjective visual effects, verifying its effectiveness.

Key words: panoramic video, super-resolution, geometric correction, spherical alignment, deep learning

摘要:

现有的传统视频超分辨率(VSR)方法在处理全景视频时难以有效解决等距矩形投影带来的几何畸变问题,且在帧间对齐和特征融合方面存在不足,这些会导致重建效果不佳。为了进一步提升全景视频的超分辨率重建质量,提出一种结合球面对齐与自适应几何校正的全景视频超分辨率网络——360GeoVSR。该网络通过球面对齐模块(SAM)和几何融合块(GFB)实现帧间特征的精确对齐与高效融合。SAM结合空间变换和可变形卷积处理全局与局部几何畸变;GFB通过嵌入的自适应几何校正(AGC)子模块动态校正特征对齐,并融合多帧信息以捕捉复杂的帧间关系。在扩展的ODV360Extended全景视频数据集上的主客观对比实验结果表明,360GeoVSR在客观指标和主观视觉效果上均优于BasicVSR++和VRT(Video Restoration Transformer)等5种代表性超分辨率方法,验证了它的有效性。

关键词: 全景视频, 超分辨率, 几何校正, 球面对齐, 深度学习

CLC Number: