使用贝叶斯神经网络进行乳腺癌筛查的有效置信度措施评估度量

论文标题

使用贝叶斯神经网络进行乳腺癌筛查的有效置信度措施评估度量

An Efficient Confidence Measure-Based Evaluation Metric for Breast Cancer Screening Using Bayesian Neural Networks

论文作者

Tabassum, Anika, Khan, Naimul

论文摘要

筛查乳房X线照片是早期检测乳腺癌的黄金标准。虽然在乳房X线摄影图像分类（尤其是在深层神经网络上）进行了大量工作，但对分类的信心或不确定性测量并没有太多探索。在本文中，我们提出了一个基于置信度措施的评估度量，以供乳腺癌筛查。我们提出了一个模块化网络体系结构，其中传统的神经网络被用作传输学习的功能提取器，然后是简单的贝叶斯神经网络。使用两阶段方法有助于降低计算复杂性，从而使所提出的框架对更广泛的部署有吸引力。我们表明，通过为医生提供一个工具来调整贝叶斯神经网络的两个超参数，即，采样数量的网络数量和最小概率的一部分，域专家可以根据需要对框架进行调整。最后，我们认为，不仅可以将元组（准确性，覆盖范围，采样网络数量和最小概率）用作我们框架的评估度量标准，而不仅仅是单个数字（准确性，覆盖率，采样数量和最小概率）。我们在CBIS-DDSM数据集上提供了实验结果，在此，我们在调整两个超参数时显示了准确覆盖折衷方案的趋势。我们还表明，与基线转移学习相比，我们的置信度调整以高度置信度降低，从而提高了准确性。为了使所提出的框架容易部署，我们在https://git.io/jvrqe上提供（匿名）源代码，并具有可重复的结果。

Screening mammograms is the gold standard for detecting breast cancer early. While a good amount of work has been performed on mammography image classification, especially with deep neural networks, there has not been much exploration into the confidence or uncertainty measurement of the classification. In this paper, we propose a confidence measure-based evaluation metric for breast cancer screening. We propose a modular network architecture, where a traditional neural network is used as a feature extractor with transfer learning, followed by a simple Bayesian neural network. Utilizing a two-stage approach helps reducing the computational complexity, making the proposed framework attractive for wider deployment. We show that by providing the medical practitioners with a tool to tune two hyperparameters of the Bayesian neural network, namely, fraction of sampled number of networks and minimum probability, the framework can be adapted as needed by the domain expert. Finally, we argue that instead of just a single number such as accuracy, a tuple (accuracy, coverage, sampled number of networks, and minimum probability) can be utilized as an evaluation metric of our framework. We provide experimental results on the CBIS-DDSM dataset, where we show the trends in accuracy-coverage tradeoff while tuning the two hyperparameters. We also show that our confidence tuning results in increased accuracy with a reduced set of images with high confidence when compared to the baseline transfer learning. To make the proposed framework readily deployable, we provide (anonymized) source code with reproducible results at https://git.io/JvRqE.

下载PDF全文

下载文献需遵守相关版权规定

论文标题