Search Result

Journals

Publication Years

Keywords

Please wait a minute...

For Selected:

Download Citations
EndNote Ris BibTeX

Toggle Thumbnails

Select

Panoramic video super-resolution network combining spherical alignment and adaptive geometric correction

Xiaolei CHEN, Zhiwei ZHENG, Xue HUANG, Zhenbin QU

Journal of Computer Applications 2026, 46 (2): 528-535. DOI: 10.11772/j.issn.1001-9081.2025030311

Abstract （86）

HTML （0）

PDF （1058KB）（172）

Save

Traditional Video Super-Resolution （VSR） methods are ineffective in solving geometric distortion problems caused by equirectangular projection when processing panoramic videos， and have deficiencies in inter-frame alignment and feature fusion， which results in poor reconstruction quality. To further improve the super-resolution reconstruction quality of panoramic videos， a panoramic video super-resolution network combining spherical alignment and adaptive geometric correction， named 360GeoVSR， was proposed. In the network， accurate alignment and efficient fusion of inter-frame features were achieved through a Spherical Alignment Module （SAM） and a Geometric Fusion Block （GFB）. In SAM， spatial transformation and deformable convolution were combined to address global and local geometric distortions. In GFB， feature alignment was corrected dynamically using an embedded Adaptive Geometric Correction （AGC） submodule， and multi-frame information was fused to capture complex inter-frame relationships. The results of subjective and objective comparison experiments on the extended ODV360Extended panoramic video dataset show that 360GeoVSR outperforms five representative super-resolution methods， including BasicVSR++ and VRT （Video Restoration Transformer）， in both objective metrics and subjective visual effects， verifying its effectiveness.

Table and Figures | Reference | Related Articles | Metrics

Select

Fast segmentation of sign language video based on cellular neural network

ZHANG Aihua LEI Xiaoya CHEN Xiaolei CHEN Lili

Journal of Computer Applications 2013, 33 (02): 503-506. DOI: 10.3724/SP.J.1087.2013.00503

Abstract （1071）

PDF （564KB）（430）

Save

To achieve sign language video coding of region of interest, and improve call efficiency, a fast segmentation methodology of sign language video based on Cellular Neural Network (CNN) was proposed. Firstly, the skin regions of sign language video were detected through corresponding CNN templates by using the skin color information characteristics. Secondly, CNN based motion detection was carried out on the skin detection results by using inter-frame difference algorithm, and then the initial gesture region could be obtained. Finally, morphological processing methods were employed to fill small holes and smooth the boundaries of regions, and eventually the segmentation of the face and hands regions of sign language video image sequence was realized. The results show that the method can rapidly and accurately segment sign language video.