Search Result

Journals

Publication Years

Keywords

Please wait a minute...

For Selected:

Download Citations
EndNote Ris BibTeX

Toggle Thumbnails

Select

Local and long-range temporal complementary modeling for video action recognition

Zuxi ZHANG, Zhancheng ZHANG, Fuyuan HU

Journal of Computer Applications 2026, 46 (3): 758-766. DOI: 10.11772/j.issn.1001-9081.2025040509

Abstract （1）

HTML （0）

PDF （980KB）（0）

Save

Due to the diversity and complexity of spatio-temporal features in videos， as well as the wide variability of actions across different speeds and scales， problems of insufficient capture of local motion details and inadequate mining of long-range temporal dependencies are commonly encountered in the existing action recognition methods. Therefore， a video action recognition network based on complementary modeling of local and long-range temporal information was proposed. The network is composed of the Two-level Fusion Motion Excitation （TFME） and the Temporal Aggregation Channel Excitation （TACE） modules. In the TFME module， the first-order and second-order differences between adjacent feature maps were computed and fused， and the fused weights were used to excite channels of the original feature maps， so as to enhance the fine-grained extraction capability of multi-level motion features， thereby modeling local temporal information. In the TACE module， a hierarchical residual pyramid structure was constructed using a channel grouping strategy， which expanded the temporal receptive field and enhanced the learning ability of multi-scale features. Meanwhile， a Temporal Channel Attention （TCA） mechanism was designed to adjust the aggregated feature maps dynamically and optimize the weight allocation among temporal channels， thereby modeling long-range temporal information. Finally， the above complementary modules were integrated and embedded into a 2D residual network to realize end-to-end action recognition. Experimental results on the Something-SomethingV1 and V2 validation sets show that using only RGB frames with a random 8-frame sampling strategy， the proposed network achieves the Top-1 accuracies of 50.6% and 61.9%， respectively； with a 16-frame sampling strategy， the accuracies are 54.1% and 65.6%， respectively. It can be seen that the proposed network models both local motion details and long-range temporal dependencies efficiently， offering a new way of thinking for action recognition tasks in complex temporal scenarios.

Table and Figures | Reference | Related Articles | Metrics

Select

Formal verification of railway interlocking system based on ladder logic

YU Lizhen XU Zhongwei CHEN Zuxi ZHANG Shuqin

Journal of Computer Applications 2013, 33 (12): 3419-3422.

Abstract （822）

PDF （748KB）（434）

Save

Ladder logic was used to model the railway interlocking system. In order to achieve the purpose of formal verification of the railway interlocking system, the model of the railway interlocking system expressed by ladder logic was converted to NuSMV language, which was a temporal logic model checker. Then Computational Tree Logic (CTL) specification representing the safety requirements of the railway interlocking system was verified. Finally, the formal verification of computer design model of the railway interlocking system was implemented based on NuSMV.