Skip to main navigation Skip to search Skip to main content

FusionU10: enhancing pedestrian detection in low-light complex tourist scenes through multimodal fusion

  • Xuefan ZHOU*
  • , Jiapeng LI
  • , Yingzheng LI
  • *Corresponding author for this work

Research output: Journal PublicationsJournal Article (refereed)peer-review

Abstract

With the rapid development of tourism, the concentration of visitor flows poses significant challenges for public safety management, especially in low-light and highly occluded environments, where existing pedestrian detection technologies often struggle to achieve satisfactory accuracy. Although infrared images perform well under low-light conditions, they lack color and detail, making them susceptible to background noise interference, particularly in complex outdoor environments where the similarity between heat sources and pedestrian features further reduces detection accuracy. To address these issues, this paper proposes the FusionU10 model, which combines information from both infrared and visible light images. The model first incorporates an Attention Gate mechanism (AGUNet) into an improved UNet architecture to focus on key features and generate pseudo-color images, followed by pedestrian detection using YOLOv10. During the prediction phase, the model optimizes the loss function with Complete Intersection over Union (CIoU), objectness loss (obj loss), and classification loss (cls loss), thereby enhancing the performance of the detection network and improving the quality and feature extraction capabilities of the pseudo-color images through a feedback mechanism. Experimental results demonstrate that FusionU10 significantly improves detection accuracy and robustness in complex scenes on the FLIR, M3FD, and LLVIP datasets, showing great potential for application in challenging environments.
Original languageEnglish
Article number1504070
Number of pages15
JournalFrontiers in Neurorobotics
Volume18
Early online date27 Nov 2024
DOIs
Publication statusPublished - 10 Jan 2025
Externally publishedYes

Bibliographical note

Publisher Copyright:
Copyright © 2025 Zhou, Li and Li.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This study was funded by the Guizhou Business School Research and Interpretation of the Spirit of the Third Plenary Session of the 20th Central Committee of the Communist Party of China Special Project Funding: “Coupling Mechanism and Collaborative Enhancement Path of Urban Digital Economy and High-Quality Development of Exhibition” (No. 2024XJFSDZD10).

Keywords

  • AGUNet
  • FusionU10 model
  • infrared and visible light
  • pedestrian detection
  • YOLOv10

Fingerprint

Dive into the research topics of 'FusionU10: enhancing pedestrian detection in low-light complex tourist scenes through multimodal fusion'. Together they form a unique fingerprint.

Cite this