Multi-Projection Fusion and Refinement Network for Salient Object Detection in 360° Omnidirectional Image

Runmin CONG, Ke HUANG, Jianjun LEI, Yao ZHAO, Qingming HUANG, Sam KWONG

Research output: Journal PublicationsJournal Article (refereed)peer-review

6 Citations (Scopus)

Abstract

Salient object detection (SOD) aims to determine the most visually attractive objects in an image. With the development of virtual reality (VR) technology, 360 ∘ omnidirectional image has been widely used, but the SOD task in 360 ∘ omnidirectional image is seldom studied due to its severe distortions and complex scenes. In this article, we propose a multi-projection fusion and refinement network (MPFR-Net) to detect the salient objects in 360 ∘ omnidirectional image. Different from the existing methods, the equirectangular projection (EP) image and four corresponding cube-unfolding (CU) images are embedded into the network simultaneously as inputs, where the CU images not only provide supplementary information for EP image but also ensure the object integrity of cube-map projection. In order to make full use of these two projection modes, a dynamic weighting fusion (DWF) module is designed to adaptively integrate the features of different projections in a complementary and dynamic manner from the perspective of inter and intrafeatures. Furthermore, in order to fully explore the way of interaction between encoder and decoder features, a filtration and refinement (FR) module is designed to suppress the redundant information of the feature itself and between the features. Experimental results on two omnidirectional datasets demonstrate that the proposed approach outperforms the state-of-the-art methods both qualitatively and quantitatively. The code and results can be found from the link of https://rmcong.github.io/proj_MPFRNet.html.
Original languageEnglish
Pages (from-to)1-13
Number of pages13
JournalIEEE Transactions on Neural Networks and Learning Systems
DOIs
Publication statusE-pub ahead of print - 9 Jan 2023
Externally publishedYes

Bibliographical note

Publisher Copyright:
IEEE

Funding

This work was supported in part by the National Key Research and Development Program of China under Grant 2021ZD0112100; in part by the Beijing Nova Program under Grant Z201100006820016; in part by the National Natural Science Foundation of China under Grant 62002014, Grant U1936212, Grant 62120106009, Grant 62236008, Grant U21B2038, and Grant 61931008; in part by the Hong Kong Innovation and Technology Commission [InnoHK Project the Centre for Intelligent Multidimensional Data Analysis (CIMDA)]; in part by the Hong Kong GRF-Research Grants Council (RGC) General Research Fund under Grant 11209819 (CityU 9042816) and Grant 11203820 (CityU 9042598); in part by the Young Elite Scientist Sponsorship Program by the China Association for Science and Technology under Grant 2020QNRC001; and in part by the Chinese Association for Artificial Intelligence (CAAI)-Huawei Fund.

Keywords

  • 360∘ omnidirectional image
  • cube-unfolding (CU)
  • dynamic weighting
  • filtration and refinement (FR)
  • salient object detection (SOD)

Fingerprint

Dive into the research topics of 'Multi-Projection Fusion and Refinement Network for Salient Object Detection in 360° Omnidirectional Image'. Together they form a unique fingerprint.

Cite this