Abstract
RGB-Infrared cross-modal person re-identification (Re-ID) has drawn increasing attention due to its application value in practice. Most of the current works rely on a supervised training manner. However, in real-world applications, manual collection of pair-wise RGB-Infrared (IR) person data is labor-intensive and time-consuming. Moreover, when a trained model is directly used in another domain, there is usually a significant performance drop. To overcome the above problems, we make the first attempt to transfer the learned model to a new RGB-IR domain which is unlabeled. The practical problem covers two kinds of challenges, i.e., cross-modal (RGB-Infrared) and cross-domain (different dataset) person Re-ID. Previous works have often considered only one of them either cross-modal or cross-domain. In this work, we propose a dual alignment network (DAN) to solve the RGB-Infrared cross-modal cross-domain person Re-ID problem. This network consists of three parts: Domain Adversarial Alignment component (DAA), Pseudo Label Generation module for target domain (PLG), and Cross-Modal Alignment component (CMA). These three modules complement and promote the model to learn domain-invariant and modality-invariant person representations. Further, we propose a protocol of cross-modal cross-domain person Re-ID by synthesizing target domains by adding random noise, adjusting the lighting intensity, and changing the background color, respectively. Experiments on real and synthetic datasets under the same cross-modalities across domains demonstrate the effectiveness of our method.
| Original language | English |
|---|---|
| Pages (from-to) | 6874-6887 |
| Number of pages | 14 |
| Journal | IEEE Transactions on Circuits and Systems for Video Technology |
| Volume | 32 |
| Issue number | 10 |
| Early online date | 9 May 2022 |
| DOIs | |
| Publication status | Published - Oct 2022 |
| Externally published | Yes |
Bibliographical note
Publisher Copyright:© 1991-2012 IEEE.
Funding
This work was supported in part by the Chongqing Natural Science Fund under Grant cstc2021jcyj-jqX0023, in part by the National Key Research and Development Program of China under Grant 2021YFB3100800, in part by CCF Hikvision Open Fund, in part by CAAI-Huawei MindSpore Open Fund, in part by the Beijing Academy of Artificial Intelligence (BAAI), and in part by the Fundamental Research Funds for Central Universities under Grant 2022CDJKYJH018.
Keywords
- cross-modal learning
- domain adaptation
- Person Re-ID
- transfer learning
Fingerprint
Dive into the research topics of 'Cross-Modal Cross-Domain Dual Alignment Network for RGB-Infrared Person Re-Identification'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver