Journals
  Publication Years
  Keywords
Search within results Open Search
Please wait a minute...
For Selected: Toggle Thumbnails
Panoramic video super-resolution network combining spherical alignment and adaptive geometric correction
Xiaolei CHEN, Zhiwei ZHENG, Xue HUANG, Zhenbin QU
Journal of Computer Applications    2026, 46 (2): 528-535.   DOI: 10.11772/j.issn.1001-9081.2025030311
Abstract86)   HTML0)    PDF (1058KB)(172)       Save

Traditional Video Super-Resolution (VSR) methods are ineffective in solving geometric distortion problems caused by equirectangular projection when processing panoramic videos, and have deficiencies in inter-frame alignment and feature fusion, which results in poor reconstruction quality. To further improve the super-resolution reconstruction quality of panoramic videos, a panoramic video super-resolution network combining spherical alignment and adaptive geometric correction, named 360GeoVSR, was proposed. In the network, accurate alignment and efficient fusion of inter-frame features were achieved through a Spherical Alignment Module (SAM) and a Geometric Fusion Block (GFB). In SAM, spatial transformation and deformable convolution were combined to address global and local geometric distortions. In GFB, feature alignment was corrected dynamically using an embedded Adaptive Geometric Correction (AGC) submodule, and multi-frame information was fused to capture complex inter-frame relationships. The results of subjective and objective comparison experiments on the extended ODV360Extended panoramic video dataset show that 360GeoVSR outperforms five representative super-resolution methods, including BasicVSR++ and VRT (Video Restoration Transformer), in both objective metrics and subjective visual effects, verifying its effectiveness.

Table and Figures | Reference | Related Articles | Metrics
Fast segmentation of sign language video based on cellular neural network
ZHANG Aihua LEI Xiaoya CHEN Xiaolei CHEN Lili
Journal of Computer Applications    2013, 33 (02): 503-506.   DOI: 10.3724/SP.J.1087.2013.00503
Abstract1071)      PDF (564KB)(430)       Save
To achieve sign language video coding of region of interest, and improve call efficiency, a fast segmentation methodology of sign language video based on Cellular Neural Network (CNN) was proposed. Firstly, the skin regions of sign language video were detected through corresponding CNN templates by using the skin color information characteristics. Secondly, CNN based motion detection was carried out on the skin detection results by using inter-frame difference algorithm, and then the initial gesture region could be obtained. Finally, morphological processing methods were employed to fill small holes and smooth the boundaries of regions, and eventually the segmentation of the face and hands regions of sign language video image sequence was realized. The results show that the method can rapidly and accurately segment sign language video.
Related Articles | Metrics