ME-D2N：用于跨域的多expert域分解网络几乎没有学习

论文标题

ME-D2N：用于跨域的多expert域分解网络几乎没有学习

ME-D2N: Multi-Expert Domain Decompositional Network for Cross-Domain Few-Shot Learning

论文作者

Fu, Yuqian, Xie, Yu, Fu, Yanwei, Chen, Jingjing, Jiang, Yu-Gang

论文摘要

最近，跨域几乎没有学习（CD-FSL），旨在解决不同领域的少量学习（FSL）问题，引起了人们的关注。 CD-FSL的核心挑战在于源和新型目标数据集之间的域间隙。尽管在模型培训期间，已经为CD-FSL进行了许多尝试，但巨大的域差距使现有的CD-FSL方法仍然难以实现非常令人满意的结果。另外，在先前的工作〜\ cite {fu2021meta}中倡导了更现实且有希望的学习CD-FSL模型。因此，在本文中，我们坚持这种设置，并从技术上贡献了一种新型的多专家域分解网络（ME-D2N）。具体而言，为了解决源数据之间的数据不平衡问题，并以有限的示例解决了辅助目标数据，我们在多型专家学习的保护下构建了模型。首先在源头和辅助目标集中对两个可以被认为是其相应领域的专家的教师模型。然后，引入了知识蒸馏技术，以将知识从两个教师转移到统一的学生模型。进一步迈出一步，为了帮助我们的学生模型同时从不同的领域教师学习知识，我们进一步提出了一个新颖的域分解模块，该模块学会将学生模型分解为两个与域相关的子部分。这是通过一个新颖的域特异性门来实现的，该门学会以可学习的方式将每个过滤器分配给一个特定的域。广泛的实验证明了我们方法的有效性。代码和模型可在https://github.com/lovelyqian/me-d2n_for_ffsl上找到。

Recently, Cross-Domain Few-Shot Learning (CD-FSL) which aims at addressing the Few-Shot Learning (FSL) problem across different domains has attracted rising attention. The core challenge of CD-FSL lies in the domain gap between the source and novel target datasets. Though many attempts have been made for CD-FSL without any target data during model training, the huge domain gap makes it still hard for existing CD-FSL methods to achieve very satisfactory results. Alternatively, learning CD-FSL models with few labeled target domain data which is more realistic and promising is advocated in previous work~\cite{fu2021meta}. Thus, in this paper, we stick to this setting and technically contribute a novel Multi-Expert Domain Decompositional Network (ME-D2N). Concretely, to solve the data imbalance problem between the source data with sufficient examples and the auxiliary target data with limited examples, we build our model under the umbrella of multi-expert learning. Two teacher models which can be considered to be experts in their corresponding domain are first trained on the source and the auxiliary target sets, respectively. Then, the knowledge distillation technique is introduced to transfer the knowledge from two teachers to a unified student model. Taking a step further, to help our student model learn knowledge from different domain teachers simultaneously, we further present a novel domain decomposition module that learns to decompose the student model into two domain-related sub parts. This is achieved by a novel domain-specific gate that learns to assign each filter to only one specific domain in a learnable way. Extensive experiments demonstrate the effectiveness of our method. Codes and models are available at https://github.com/lovelyqian/ME-D2N_for_CDFSL.

下载PDF全文

下载文献需遵守相关版权规定

论文标题