Abstract
Diffusion models have achieved promising results in image generation, but their applications in 3D object detection still need further exploration. In this paper, we design a novel model DiffCandiDet based on dense heads with Gaussian distributed center points for 3D object detection, which effectively integrates the anchor-based method and the Gaussian random noise-based method to leverage the powerful denoising and reconstruction capabilities of the diffusion model. To achieve the learning balance for multi-class 3D object detection, we propose a Dynamic Super-dense Candidate Boxes (DSCB) strategy. Notably, DiffCandiDet addresses the issue of traditional models struggling to detect pedestrians walking side by side. In addition to Gaussian distribution, we also propose a DSCB strategy based on discrete uniform distribution (DUCandiDet) and continuous uniform distribution (CUCandiDet), to reduce the runtime consumption and enhance the robustness of the model. Extensive experiments show that DiffCandiDet achieves competitive results on both KITTI and Waymo Open Datasets. DiffCandiDet ranks 1st on the KITTI validation set in the Car and Pedestrian detection leaderboard. Code is available at https://github.com/SiHengHeHSH/DiffCandiDet.
| Original language | English |
|---|---|
| Article number | 113181 |
| Journal | Applied Soft Computing |
| Volume | 178 |
| Early online date | 22 May 2025 |
| DOIs | |
| Publication status | Published - Jun 2025 |
Bibliographical note
Publisher Copyright:© 2025 Elsevier B.V.
Funding
This work was supported in part by the National Natural Science Foundations of China (NSFC) under Grants 62106055 and 62176094 , in part by the Guangdong Natural Science Foundation, China under Grants 2025A1515010256 , and in part by the Guangzhou Science and Technology Planning Project under Grants 2023A04J0388 and 2023A03J0662 .
Keywords
- 3D object detection
- Diffusion model
- Point cloud