DrFuse: Learning Disentangled Representation for Clinical Multi-Modal Fusion with Missing Modality and Modal Inconsistency

  • Wenfang YAO
  • , Kejing YIN
  • , William K. CHEUNG
  • , Jia LIU
  • , Jing QIN

Research output: Book Chapters | Papers in Conference ProceedingsConference paper (refereed)Researchpeer-review

42 Citations (Scopus)

Abstract

The combination of electronic health records (EHR) and medical images is crucial for clinicians in making diagnoses and forecasting prognoses. Strategically fusing these two data modalities has great potential to improve the accuracy of machine learning models in clinical prediction tasks. However, the asynchronous and complementary nature of EHR and medical images presents unique challenges. Missing modalities due to clinical and administrative factors are inevitable in practice, and the significance of each data modality varies depending on the patient and the prediction target, resulting in inconsistent predictions and suboptimal model performance. To address these challenges, we propose DrFuse to achieve effective clinical multi-modal fusion. It tackles the missing modality issue by disentangling the features shared across modalities and those unique within each modality. Furthermore, we address the modal inconsistency issue via a disease-wise attention layer that produces the patient- and disease-wise weighting for each modality to make the final prediction. We validate the proposed method using real-world large-scale datasets, MIMIC-IV and MIMIC-CXR. Experimental results show that the proposed method significantly outperforms the state-of-the-art models.

Original languageEnglish
Title of host publicationProceedings of the 38th AAAI Conference on Artificial Intelligence
EditorsJennifer DY, Sriraam NATARAJAN, Michael WOOLDRIDGE
PublisherAAAI press
Pages16416-16424
Number of pages9
ISBN (Electronic)9781577358879
DOIs
Publication statusPublished - 25 Mar 2024
Externally publishedYes
EventThe 38th Annual AAAI Conference on Artificial Intelligence - Vancouver, Canada
Duration: 20 Feb 202427 Feb 2024

Publication series

NameProceedings of the AAAI Conference on Artificial Intelligence
PublisherAssociation for the Advancement of Artificial Intelligence
Number15
Volume38
ISSN (Print)2374-3468
ISSN (Electronic)2159-5399

Conference

ConferenceThe 38th Annual AAAI Conference on Artificial Intelligence
Abbreviated titleAAAI-24
Country/TerritoryCanada
CityVancouver
Period20/02/2427/02/24

Bibliographical note

Publisher Copyright:
Copyright © 2024, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.

Funding

The work described in this paper is supported by a grant of Hong Kong RGC Theme-based Research Scheme (project no. T45-401/22-N), an Innovation and Technology Fund-Midstream Research Programme for Universities (ITF-MRP) (project no. MRP/022/20X), and General Research Fund RGC/HKBU12201219 from the Research Grant Council.

Fingerprint

Dive into the research topics of 'DrFuse: Learning Disentangled Representation for Clinical Multi-Modal Fusion with Missing Modality and Modal Inconsistency'. Together they form a unique fingerprint.

Cite this