Abstract
While salient object detection (SOD) on 2-D images has been extensively studied, there is very little SOD work on 3-D measurement surfaces. We propose an effective point transformer-based SOD network for 3-D measurement point clouds, termed PSOD-Net. PSOD-Net is an encoder-decoder network that takes full advantage of transformers to model the contextual information in both multiscale point- and scenewise manners. In the encoder, we develop a point context transformer (PCT) module to capture region contextual features at the point level; PCT contains two different transformers to excavate the relationship among points. In the decoder, we develop a scene context transformer (SCT) module to learn context representations at the scene level; SCT contains both upsampling-and-transformer (UT) blocks and multicontext aggregation (MCA) units to integrate the global semantic and multilevel features from the encoder into the global scene context. Experiments show clear improvements of PSOD-Net over its competitors and validate that PSOD-Net is more robust to challenging cases such as small objects, multiple objects, and objects with complex structures. Code is available at: https://github.com/ZeyongWei/PSOD-Net.
| Original language | English |
|---|---|
| Article number | 5701511 |
| Pages (from-to) | 1-11 |
| Number of pages | 11 |
| Journal | IEEE Transactions on Geoscience and Remote Sensing |
| Volume | 62 |
| Early online date | 19 Jan 2024 |
| DOIs | |
| Publication status | Published - 2024 |
| Externally published | Yes |
Bibliographical note
Publisher Copyright:© 2024 IEEE.
Funding
This work was supported in part by the National Natural Science Foundation of China under Grant T2322012, Grant 62172218, and Grant 62032011 in part by the Shenzhen Science and Technology Program under Grant JCYJ20220818103401003 and Grant JCYJ20220530172403007; in part by the Guangdong Basic and Applied Basic Research Foundation under Grant 2022A1515010170; in part by the Research Grants Council of the Hong Kong Special Administrative Region, China under Grant UGC/FDS16/E14/21; and in part by the Hong Kong Metropolitan University Research under Grant RD/2022/2.13.
Keywords
- 3-D measurement point cloud
- 3-D salient object detection (SOD)
- point transformer
- PSOD-Net