Toward Full-Scene Domain Generalization in Multi-Agent Collaborative Bird’s Eye View Segmentation for Connected and Autonomous Driving

Senkang HU, Zhengru FANG, Yiqin DENG, Xianhao CHEN, Yuguang FANG, Sam KWONG

Research output: Journal PublicationsJournal Article (refereed)peer-review

Abstract

Collaborative perception has recently gained significant attention in autonomous driving, improving perception quality by enabling the exchange of additional information among vehicles. However, deploying collaborative perception systems can lead to domain shifts due to diverse environmental conditions and data heterogeneity among connected and autonomous vehicles (CAVs). To address these challenges, we propose a unified domain generalization framework to be utilized during the training and inference stages of collaborative perception. In the training phase, we introduce an Amplitude Augmentation (AmpAug) method to augment low-frequency image variations, broadening the model’s ability to learn across multiple domains. We also employ a meta-consistency training scheme to simulate domain shifts, optimizing the model with a carefully designed consistency loss to acquire domain-invariant representations. In the inference phase, we introduce an intra-system domain alignment mechanism to reduce or potentially eliminate the domain discrepancy among CAVs prior to inference. Extensive experiments substantiate the effectiveness of our method in comparison with the existing state-of-the-art works.
Original languageEnglish
Number of pages14
JournalIEEE Transactions on Intelligent Transportation Systems
DOIs
Publication statusE-pub ahead of print - 5 Dec 2024

Bibliographical note

Publisher Copyright:
© 2024 IEEE.

Keywords

  • Domain generalization
  • autonomous driving
  • bird’s eye view segmentation
  • vehicle-to-vehicle collaborative perception

Fingerprint

Dive into the research topics of 'Toward Full-Scene Domain Generalization in Multi-Agent Collaborative Bird’s Eye View Segmentation for Connected and Autonomous Driving'. Together they form a unique fingerprint.

Cite this