Diffusion-based dynamic super-dense candidate boxes with random center points for 3D object detection

  • Si Heng HE
  • , Zi Jia WANG*
  • , Yuan Gen WANG
  • , Yicong ZHOU
  • , Sam KWONG
  • *Corresponding author for this work

Research output: Journal PublicationsJournal Article (refereed)peer-review

1 Citation (Scopus)

Abstract

Diffusion models have achieved promising results in image generation, but their applications in 3D object detection still need further exploration. In this paper, we design a novel model DiffCandiDet based on dense heads with Gaussian distributed center points for 3D object detection, which effectively integrates the anchor-based method and the Gaussian random noise-based method to leverage the powerful denoising and reconstruction capabilities of the diffusion model. To achieve the learning balance for multi-class 3D object detection, we propose a Dynamic Super-dense Candidate Boxes (DSCB) strategy. Notably, DiffCandiDet addresses the issue of traditional models struggling to detect pedestrians walking side by side. In addition to Gaussian distribution, we also propose a DSCB strategy based on discrete uniform distribution (DUCandiDet) and continuous uniform distribution (CUCandiDet), to reduce the runtime consumption and enhance the robustness of the model. Extensive experiments show that DiffCandiDet achieves competitive results on both KITTI and Waymo Open Datasets. DiffCandiDet ranks 1st on the KITTI validation set in the Car and Pedestrian detection leaderboard. Code is available at https://github.com/SiHengHeHSH/DiffCandiDet.

Original languageEnglish
Article number113181
JournalApplied Soft Computing
Volume178
Early online date22 May 2025
DOIs
Publication statusPublished - Jun 2025

Bibliographical note

Publisher Copyright:
© 2025 Elsevier B.V.

Funding

This work was supported in part by the National Natural Science Foundations of China (NSFC) under Grants 62106055 and 62176094 , in part by the Guangdong Natural Science Foundation, China under Grants 2025A1515010256 , and in part by the Guangzhou Science and Technology Planning Project under Grants 2023A04J0388 and 2023A03J0662 .

Keywords

  • 3D object detection
  • Diffusion model
  • Point cloud

Fingerprint

Dive into the research topics of 'Diffusion-based dynamic super-dense candidate boxes with random center points for 3D object detection'. Together they form a unique fingerprint.

Cite this