G2HFNet: GeoGran-Aware Hierarchical Feature Fusion Network for Salient Object Detection in Optical Remote Sensing Images

  • Bin WAN
  • , Runmin CONG*
  • , Xiaofei ZHOU
  • , Hao FANG
  • , Chengtao LV
  • , Sam KWONG
  • *Corresponding author for this work

Research output: Journal PublicationsJournal Article (refereed)peer-review

Abstract

Remote sensing images captured from aerial perspectives often exhibit significant scale variations and complex backgrounds, posing challenges for salient object detection (SOD). Existing methods typically extract multi-level features at a single scale using uniform attention mechanisms, leading to suboptimal representations and incomplete detection results. To address these issues, we propose a GeoGran-Aware Hierarchical Feature Fusion Network (G2HFNet) that fully exploits geometric and granular cues in optical remote sensing images. Specifically, G2HFNet adopts Swin Transformer as the backbone to extract multi-level features and integrates three key modules: the multi-scale detail enhancement (MDE) module to handle object scale variations and enrich fine details, the dual-branch geo-gran complementary (DGC) module to jointly capture fine-grained details and positional information in mid-level features, and the deep semantic perception (DSP) module to refine high-level positional cues via self-attention. Additionally, a local-global guidance fusion (LGF) module is introduced to replace traditional convolutions for effective multi-level feature integration. Extensive experiments demonstrate that G2HFNet achieves high-quality saliency maps and significantly improves detection performance in challenging remote sensing scenarios.
Original languageEnglish
Number of pages12
JournalIEEE Transactions on Circuits and Systems for Video Technology
DOIs
Publication statusE-pub ahead of print - 12 Jan 2026

Bibliographical note

Publisher Copyright:
© 1991-2012 IEEE.

Funding

This work was supported in part by the opening project of State Key Laboratory of Autonomous Intelligent Unmanned Systems under Grant ZZKF2025- 2-8, in part by the National Natural Science Foundation of China under Grant 62471278 and 62271180, in part by the Hong Kong GRF-RGC General Research Fund under Grant 13200425, and in part by the Research Grants Council of the Hong Kong Special Administrative Region, China under Grant STG5/E-103/24-R.

Keywords

  • Salient object detection
  • optical remote sensing image
  • multi-scale detail enhancement
  • dual-branch geo-gran complementary
  • local-global guidance fusion

Fingerprint

Dive into the research topics of 'G2HFNet: GeoGran-Aware Hierarchical Feature Fusion Network for Salient Object Detection in Optical Remote Sensing Images'. Together they form a unique fingerprint.

Cite this