通过认知不确定性对抗训练改善分布检测

论文标题

通过认知不确定性对抗训练改善分布检测

Improving Out-of-Distribution Detection via Epistemic Uncertainty Adversarial Training

论文作者

Everett, Derek, Nguyen, Andre T., Richards, Luke E., Raff, Edward

论文摘要

不确定性的量化对于采用机器学习很重要，尤其是拒绝分发（OOD）数据回到人类专家以进行审查。然而，进步一直很慢，因为计算效率和不确定性估计质量之间必须达到平衡。因此，许多人使用神经网络或蒙特卡洛辍学的深度集合来进行相对最小的计算和记忆时合理的不确定性估计。出乎意料的是，当我们专注于$ \ leq 1 \％$ frese-forts阳性率（FPR）的现实世界中的约束时，先前的方法无法可靠地检测到OOD样本。值得注意的是，即使高斯随机噪声也无法触发这些流行的OOD技术。我们通过设计一种简单的对抗训练计划来帮助缓解这个问题，该计划结合了辍学合奏所预测的认知不确定性的攻击。我们证明了这种方法可改善标准数据（即未经对抗制作）上的OOD检测性能，并将标准化的部分AUC从近乎随机的猜测性能提高到$ \ geq 0.75 $。

The quantification of uncertainty is important for the adoption of machine learning, especially to reject out-of-distribution (OOD) data back to human experts for review. Yet progress has been slow, as a balance must be struck between computational efficiency and the quality of uncertainty estimates. For this reason many use deep ensembles of neural networks or Monte Carlo dropout for reasonable uncertainty estimates at relatively minimal compute and memory. Surprisingly, when we focus on the real-world applicable constraint of $\leq 1\%$ false positive rate (FPR), prior methods fail to reliably detect OOD samples as such. Notably, even Gaussian random noise fails to trigger these popular OOD techniques. We help to alleviate this problem by devising a simple adversarial training scheme that incorporates an attack of the epistemic uncertainty predicted by the dropout ensemble. We demonstrate this method improves OOD detection performance on standard data (i.e., not adversarially crafted), and improves the standardized partial AUC from near-random guessing performance to $\geq 0.75$.

下载PDF全文

下载文献需遵守相关版权规定

论文标题