TY - JOUR
T1 - Cross-domain few-shot learning via adaptive transformer networks
AU - Paeedeh, Naeem
AU - Pratama, Mahardhika
AU - Ma'sum, Muhammad Anwar
AU - Mayer, Wolfgang
AU - Cao, Zehong
AU - Kowlczyk, Ryszard
N1 - Publisher Copyright:
© 2024 The Author(s)
PY - 2024/3/15
Y1 - 2024/3/15
N2 - Most few-shot learning works rely on the same domain assumption between the base and the target tasks, hindering their practical applications. This paper proposes an adaptive transformer network (ADAPTER), a simple but effective solution for cross-domain few-shot learning where there exist large domain shifts between the base task and the target task. ADAPTER is built upon the idea of bidirectional cross-attention to learn transferable features between the two domains. The proposed architecture is trained with DINO to produce diverse, and less biased features to avoid the supervision collapse problem. Furthermore, the label smoothing approach is proposed to improve the consistency and reliability of the predictions by also considering the predicted labels of the close samples in the embedding space. The performance of ADAPTER is rigorously evaluated in the BSCD-FSL benchmarks in which it outperforms prior arts with significant margins.
AB - Most few-shot learning works rely on the same domain assumption between the base and the target tasks, hindering their practical applications. This paper proposes an adaptive transformer network (ADAPTER), a simple but effective solution for cross-domain few-shot learning where there exist large domain shifts between the base task and the target task. ADAPTER is built upon the idea of bidirectional cross-attention to learn transferable features between the two domains. The proposed architecture is trained with DINO to produce diverse, and less biased features to avoid the supervision collapse problem. Furthermore, the label smoothing approach is proposed to improve the consistency and reliability of the predictions by also considering the predicted labels of the close samples in the embedding space. The performance of ADAPTER is rigorously evaluated in the BSCD-FSL benchmarks in which it outperforms prior arts with significant margins.
KW - Cross-domain few-shot learning
KW - Domain adaptation
KW - Few-shot learning
UR - http://www.scopus.com/inward/record.url?scp=85184742982&partnerID=8YFLogxK
U2 - 10.1016/j.knosys.2024.111458
DO - 10.1016/j.knosys.2024.111458
M3 - Article
AN - SCOPUS:85184742982
SN - 0950-7051
VL - 288
JO - Knowledge-Based Systems
JF - Knowledge-Based Systems
M1 - 111458
ER -