Foreground-Aware Geometry Compression With Hybrid Attention for Large-Scale Point Clouds

  • Liang XIE
  • , Haoran Li
  • , Baoliang CHEN
  • , Ge LI
  • , Sam KWONG
  • , Wei GAO

Research output: Journal PublicationsJournal Article (refereed)peer-review

Abstract

Regions of Interest (ROI) play a crucial role in point cloud compression, especially in applications such as autonomous driving and robot navigation, where foreground regions often contain key information such as obstacles and object boundaries. However, traditional point cloud compression methods typically fail to optimize for these critical areas, instead applying uniform processing across the whole point clouds. The paper aims to optimize the point cloud compression process by allocating more bitstream resources to the foreground regions, thereby preserving important information in the point clouds. To achieve this, we propose a separation-based Foreground-Background Network (FB-Net) for compressing point cloud. The framework first identifies and separates the foreground and background regions, then designs an attention-based compression network, which includes multi-stage Occupancy Probability Estimation (OPE) module. The OPE module consist of an Attention-based Feature Extraction Layer (AFEL) and an Occupancy Probability Generation (AOPG) module. By controlling the number of OPE modules, we can allocate more bitstream resources to critical regions in the scene point cloud, such as tables and chairs, thus improving the performance of downstream detection tasks. Furthermore, to compensate for perceptual distortions in human vision, we design a large-scale receptive field-based Point Cloud Upsampling Network (PCU-Net), to enhance the objective quality. Through extensive experiments on point cloud datasets such as ScanNet and SUN RGB-D, we demonstrate that allocating more bitstream resources to the foreground regions benefits the accuracy of detection tasks. Compared to G-PCC and many state-of-the-art learning-based point cloud compression methods, our approach shows superior performance in detection tasks than compression-then-detection process methods.
Original languageEnglish
JournalIEEE Transactions on Broadcasting
DOIs
Publication statusE-pub ahead of print - 23 Jan 2026

Bibliographical note

Publisher Copyright:
© 1963-12012 IEEE.

Funding

Rdve 8 September 2025; visedre 24 Nrvo 2025; accepted 11 December 2025. This orkw asw supported in part by the Major Kye Project of Peng Cheng Laboratory (PCL) under Grant PCL2024A02; in part by the Natural Science Foundation of China under Grant 62271013 and Grant 62031013; in part by Guangdong Provincial Kye Laboratory of Ultra High Definition Ievm Media Ty under Grant 2024B1212010006; in part by Guangdong Province Pearl Rrvi Tt Program under Grant 2021QN020708; in part by Guangdong Basic and Applied Basic Research Foundation under Grant 2024A1515010155; in part by Shenzhen Science and Ty Program under Grant JCYJ20240813160202004, Grant JCYJ20230807120808017, SYSPG20241211173440004; and in part by Shenzhen Fundamental Research Program under Grant GXWD20201231165807007-20200806163656003. (Corresponding author: Wei Gao.) Liang Xie is with the School of Computer Science and T,e Guangdong Uyvn of T,e Guangzhou 510006, China, and also with the Peking Uyvn Shenzhen Graduate School, Shenzhen 518055, China (e-mail: [email protected]). Haoran Li is with the School of Electronics and Communication Engineer- ing, Sun Y-n U,evn Shenzhen Campus, Shenzhen 518107, China, and also with the Peng Cheng L,a Shenzhen, Guangdong 518000, China (e-mail: [email protected]). Baoliang Chen is with the School of Computer Science, South China Normal U,evn Guangzhou 510631, China (e-mail: blchen6- c .m ..). Ge Li is with Guangdong Provincial Kye Laboratory of Ultra High Definition Ievm Media T,e School of Electronic and Com- puter Engineering, Peking U,evn Shenzhen 518055, China (e-mail: [email protected]). Sam Kwong is with the School of Data Science, Lingnan U,evn Hong K, China (e-mail: [email protected]).samkw Wi Gao is with Guangdong Provincial Kye Laboratory of Ultra High Definition Ievm Media T,e School of Electronic and Computer Engineering, Peking U,evn Shenzhen 518055, China, and also with the Peng Cheng L,a Shenzhen 518000, China (e-mail: 2aog 62@..). Digital Object Identifier 10.1109

Keywords

  • Foreground-background
  • ROI region
  • object detection
  • point cloud compression
  • upsampling

Fingerprint

Dive into the research topics of 'Foreground-Aware Geometry Compression With Hybrid Attention for Large-Scale Point Clouds'. Together they form a unique fingerprint.

Cite this