论文标题
师生的异步学习以及多源的面部标志性检测一致性
Teacher-Student Asynchronous Learning with Multi-Source Consistency for Facial Landmark Detection
论文作者
论文摘要
由于研究人员提出,由于视频中大规模面部标志性检测任务的高注释成本,研究人员提出了一种半监督的范式,该半监督范式使用自我培训用于采矿高质量的伪标签来参加培训。但是,基于自我训练的方法通常会逐渐增加样本数量,其性能因添加的伪标记样本的数量而有很大差异。 在本文中,我们提出了一个基于多源监督信号一致性标准的教师学生的异步学习〜(TSAL)框架,该框架通过一致性约束隐含地挖掘伪标记。具体而言,TSAL框架包含两个具有完全相同结构的模型。激进的学生使用来自同一任务的多源监督信号来更新参数,而平静的老师使用单源监督信号来更新参数。为了合理地吸收学生的建议,通过递归平均过滤再次更新教师的参数。实验结果证明,异步学习框架可以有效地过滤多源监督信号中的噪声,从而挖掘出对网络参数更新更为重要的伪标签。 300W,AFLW和300VW基准测试的大量实验表明,TSAL框架实现了最先进的性能。
Due to the high annotation cost of large-scale facial landmark detection tasks in videos, a semi-supervised paradigm that uses self-training for mining high-quality pseudo-labels to participate in training has been proposed by researchers. However, self-training based methods often train with a gradually increasing number of samples, whose performances vary a lot depending on the number of pseudo-labeled samples added. In this paper, we propose a teacher-student asynchronous learning~(TSAL) framework based on the multi-source supervision signal consistency criterion, which implicitly mines pseudo-labels through consistency constraints. Specifically, the TSAL framework contains two models with exactly the same structure. The radical student uses multi-source supervision signals from the same task to update parameters, while the calm teacher uses a single-source supervision signal to update parameters. In order to reasonably absorb student's suggestions, teacher's parameters are updated again through recursive average filtering. The experimental results prove that asynchronous-learning framework can effectively filter noise in multi-source supervision signals, thereby mining the pseudo-labels which are more significant for network parameter updating. And extensive experiments on 300W, AFLW, and 300VW benchmarks show that the TSAL framework achieves state-of-the-art performance.