Skip to main navigation Skip to search Skip to main content

3D Enhanced Residual CNN for Video Super-Resolution Network

  • Weiqiang XIN
  • , Zheng WANG
  • , Xi CHEN
  • , Yufeng TANG
  • , Bing LI
  • , Chunwei TIAN*
  • *Corresponding author for this work

Research output: Journal PublicationsJournal Article (refereed)peer-review

Abstract

Deep convolutional neural networks (CNNs) have demonstrated remarkable performance in video super-resolution (VSR). However, the ability of most existing methods to recover fine details in complex scenes is often hindered by the loss of shallow texture information during feature extraction. To address this limitation, we propose a 3D Convolutional Enhanced Residual Video Super-Resolution Network (3D-ERVSNet). This network employs a forward and backward bidirectional propagation module (FBBPM) that aligns features across frames using explicit optical flow through lightweight SPyNet. By incorporating an enhanced residual structure (ERS) with skip connections, shallow and deep features are effectively integrated, enhancing texture restoration capabilities. Furthermore, 3D convolution module (3DCM) is applied after the backward propagation module to implicitly capture spatio-temporal dependencies. The architecture synergizes these components where FBBPM extracts aligned features, ERS fuses hierarchical representations, and 3DCM refines temporal coherence. Finally, a deep feature aggregation module (DFAM) fuses the processed features, and a pixel-upsampling module (PUM) reconstructs the high-resolution (HR) video frames. Comprehensive evaluations on REDS, Vid4, UDM10, and Vim4 benchmarks demonstrate well performance including 30.95 dB PSNR/0.8822 SSIM on REDS and 32.78 dB/0.8987 on Vim4. 3D-ERVSNet achieves significant gains over baselines while maintaining high efficiency with only 6.3M parameters and 77 ms/frame runtime (i.e., 20× faster than RBPN). The network’s effectiveness stems from its task-specific asymmetric design that balances explicit alignment and implicit fusion.
Original languageEnglish
Pages (from-to)2837-2849
Number of pages13
JournalComputers, Materials and Continua
Volume85
Issue number2
DOIs
Publication statusPublished - 23 Sept 2025
Externally publishedYes

Bibliographical note

Publisher Copyright:
Copyright © 2025 The Authors.

Funding

Funding Statement: This project was supported in part by the Basic and Applied Basic Research Foundation of Guangdong Province [2025A1515011566]; in part by the State Key Laboratory for Novel Software Technology, Nanjing University [KFKT2024B08]; in part by Leading Talents in Gusu Innovation and Entrepreneurship [ZXL2023170]; and in part by the Basic Research Programs of Taicang 2024, [TC2024JC32].

Keywords

  • 3D convolution
  • enhanced residual CNN
  • spatio-temporal feature extraction
  • Video super-resolution

Fingerprint

Dive into the research topics of '3D Enhanced Residual CNN for Video Super-Resolution Network'. Together they form a unique fingerprint.

Cite this