Skip to main navigation Skip to search Skip to main content

Cross-Modal Cross-Domain Dual Alignment Network for RGB-Infrared Person Re-Identification

  • Xiaowei FU
  • , Fuxiang HUANG
  • , Yuhang ZHOU
  • , Huimin MA
  • , Xin XU
  • , Lei ZHANG*
  • *Corresponding author for this work

Research output: Journal PublicationsJournal Article (refereed)peer-review

Abstract

RGB-Infrared cross-modal person re-identification (Re-ID) has drawn increasing attention due to its application value in practice. Most of the current works rely on a supervised training manner. However, in real-world applications, manual collection of pair-wise RGB-Infrared (IR) person data is labor-intensive and time-consuming. Moreover, when a trained model is directly used in another domain, there is usually a significant performance drop. To overcome the above problems, we make the first attempt to transfer the learned model to a new RGB-IR domain which is unlabeled. The practical problem covers two kinds of challenges, i.e., cross-modal (RGB-Infrared) and cross-domain (different dataset) person Re-ID. Previous works have often considered only one of them either cross-modal or cross-domain. In this work, we propose a dual alignment network (DAN) to solve the RGB-Infrared cross-modal cross-domain person Re-ID problem. This network consists of three parts: Domain Adversarial Alignment component (DAA), Pseudo Label Generation module for target domain (PLG), and Cross-Modal Alignment component (CMA). These three modules complement and promote the model to learn domain-invariant and modality-invariant person representations. Further, we propose a protocol of cross-modal cross-domain person Re-ID by synthesizing target domains by adding random noise, adjusting the lighting intensity, and changing the background color, respectively. Experiments on real and synthetic datasets under the same cross-modalities across domains demonstrate the effectiveness of our method.
Original languageEnglish
Pages (from-to)6874-6887
Number of pages14
JournalIEEE Transactions on Circuits and Systems for Video Technology
Volume32
Issue number10
Early online date9 May 2022
DOIs
Publication statusPublished - Oct 2022
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 1991-2012 IEEE.

Funding

This work was supported in part by the Chongqing Natural Science Fund under Grant cstc2021jcyj-jqX0023, in part by the National Key Research and Development Program of China under Grant 2021YFB3100800, in part by CCF Hikvision Open Fund, in part by CAAI-Huawei MindSpore Open Fund, in part by the Beijing Academy of Artificial Intelligence (BAAI), and in part by the Fundamental Research Funds for Central Universities under Grant 2022CDJKYJH018.

Keywords

  • cross-modal learning
  • domain adaptation
  • Person Re-ID
  • transfer learning

Fingerprint

Dive into the research topics of 'Cross-Modal Cross-Domain Dual Alignment Network for RGB-Infrared Person Re-Identification'. Together they form a unique fingerprint.

Cite this