Bilinear-experts network with self-adaptive sampler for long-tailed visual recognition

Qin WANG, Sam KWONG, Xizhao WANG*

*Corresponding author for this work

Research output: Journal PublicationsJournal Article (refereed)peer-review

Abstract

Long-tail distributed data hinders the practical application of state-of-the-art deep models in computer vision. Consequently, exclusive methodologies for handling the long-tailed problem are proposed, focusing on different hierarchies. For embedding hierarchy, existing works manually augment the diversity of tail-class features for specific datasets. However, prior knowledge about datasets is not always available for practical use, which brings unsatisfactory generalization ability in human fine-turned augmentation under such circumstances. To figure out this problem, we introduce a novel model named Bilinear-Experts Network (BENet) with Self-Adaptive Sampler (SAS). This model leverages model-driven perturbations to tail-class embeddings while preserving generalization capability on head classes through a designed bilinear experts system. The designed perturbations adaptively augment tail-class space and shift the class boundary away from the tail-class centers. Moreover, we find that SAS automatically assigns more significant perturbations to specific tail classes with relatively fewer training samples, which indicates SAS is capable of filtering tail classes with lower quality and enhancing them. Also, experiments conducted across various long-tailed benchmarks validate the comparable performance of the proposed BENet.
Original languageEnglish
Article number129832
Number of pages12
JournalNeurocomputing
Volume633
Early online date3 Mar 2025
DOIs
Publication statusE-pub ahead of print - 3 Mar 2025

Bibliographical note

Publisher Copyright:
© 2025

Funding

This work was supported in part by the National Natural Science Foundation of China (Grants 62376161 and U24A20322), in part by the Stable Support Project of Shenzhen City (No. 20231122124602001), and in part by the Research Grants Council of the Hong Kong Special Administrative Region, China (Project Reference Number: STG5/E-103/24-R).

Keywords

  • Class-imbalance
  • Deep learning
  • Long-tailed problem
  • Self-adaptive attention

Fingerprint

Dive into the research topics of 'Bilinear-experts network with self-adaptive sampler for long-tailed visual recognition'. Together they form a unique fingerprint.

Cite this