Abstract
This paper investigates crowd counting in the frequency domain, which is a novel direction compared to the traditional view in the spatial domain. By transforming the density map into the frequency domain and using the properties of the characteristic function, we propose a novel method that is simple, effective, and efficient. The solid theoretical analysis ends up as an implementation-friendly loss function, which requires only standard tensor operations in the training process. We prove that our loss function is an upper bound of the pseudo sup norm metric between the ground truth and the prediction density map (over all of their sub-regions), and demonstrate its efficacy and efficiency versus other loss functions. The experimental results also show its competitiveness to the state-of-the-art on five benchmark data sets: ShanghaiTech A & B, UCF-QNRF, JHU++, and NWPU.
Original language | English |
---|---|
Title of host publication | Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition |
Publisher | IEEE |
Pages | 19586-19595 |
Number of pages | 10 |
ISBN (Electronic) | 9781665469463 |
ISBN (Print) | 9781665469470 |
DOIs | |
Publication status | Published - 2022 |
Externally published | Yes |
Event | 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR - New Orleans, United States Duration: 19 Jun 2022 → 24 Jun 2022 |
Publication series
Name | Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition |
---|---|
Volume | 2022-June |
ISSN (Print) | 1063-6919 |
Conference
Conference | 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR |
---|---|
Country/Territory | United States |
City | New Orleans |
Period | 19/06/22 → 24/06/22 |
Bibliographical note
Publisher Copyright:© 2022 IEEE.
Funding
This work was supported by a grant from the Research Grants Council of the Hong Kong Special Administrative Region, China (Proj. No. CityU 11212518), and a Strategic Research Grant from City University of Hong Kong (Proj. No. 7005665).
Keywords
- Scene analysis and understanding
- Vision applications and systems