References & Citations
Computer Science > Computer Vision and Pattern Recognition
Title: Dual Expert Distillation Network for Generalized Zero-Shot Learning
(Submitted on 25 Apr 2024 (v1), last revised 29 Apr 2024 (this version, v2))
Abstract: Zero-shot learning has consistently yielded remarkable progress via modeling nuanced one-to-one visual-attribute correlation. Existing studies resort to refining a uniform mapping function to align and correlate the sample regions and subattributes, ignoring two crucial issues: 1) the inherent asymmetry of attributes; and 2) the unutilized channel information. This paper addresses these issues by introducing a simple yet effective approach, dubbed Dual Expert Distillation Network (DEDN), where two experts are dedicated to coarse- and fine-grained visual-attribute modeling, respectively. Concretely, one coarse expert, namely cExp, has a complete perceptual scope to coordinate visual-attribute similarity metrics across dimensions, and moreover, another fine expert, namely fExp, consists of multiple specialized subnetworks, each corresponds to an exclusive set of attributes. Two experts cooperatively distill from each other to reach a mutual agreement during training. Meanwhile, we further equip DEDN with a newly designed backbone network, i.e., Dual Attention Network (DAN), which incorporates both region and channel attention information to fully exploit and leverage visual semantic knowledge. Experiments on various benchmark datasets indicate a new state-of-the-art.
Submission history
From: Jingcai Guo [view email][v1] Thu, 25 Apr 2024 05:59:42 GMT (8746kb,D)
[v2] Mon, 29 Apr 2024 14:12:49 GMT (8743kb,D)
Link back to: arXiv, form interface, contact.