论文标题
小儿肺炎诊断的卷积神经网络模型的合奏
Ensembles of Convolutional Neural Networks models for pediatric pneumonia diagnosis
论文作者
论文摘要
肺炎是一种肺部感染,导致15%的儿童死亡率,每年超过80万名5岁以下儿童。该病理主要由病毒或细菌引起。 X射线成像分析是肺炎诊断最常用的方法之一。可以使用机器学习方法(例如卷积神经网络(CNN))来分析这些临床图像,这些方法学会了为分类提取关键特征。但是,由于缺乏解释性,这些系统的可用性受到医学的限制,因为这些模型不能用于产生可理解的解释(从基于人类的角度来看),即它们如何达到这些结果。困难该技术影响的另一个问题是许多医学领域的标记数据有限。这项工作的主要贡献是两个方面:第一个是设计新的可解释人工智能(XAI)技术的设计,该技术基于合并从集合中的每个模型获得的单个热图。这允许克服CNN“黑匣子”的解释性和可解释性问题,突出了图像的那些区域,这些区域与生成分类更相关。第二个是开发新的集合深度学习模型,以对胸部X射线进行分类,从而可以使用小型数据集进行训练。我们使用质量低下和解剖学变异性(代表最大的挑战之一)的小儿X射线(950个样本)的小数据集测试了我们的合奏模型。我们还测试了其他策略,例如使用Chexnet训练的单个CNN和转移学习。我们的结果表明,我们的合奏模型优于这些策略获得高度竞争成果。最后,我们使用另一个肺炎诊断数据集证实了方法的鲁棒性[1]。
Pneumonia is a lung infection that causes 15% of childhood mortality, over 800,000 children under five every year, all over the world. This pathology is mainly caused by viruses or bacteria. X-rays imaging analysis is one of the most used methods for pneumonia diagnosis. These clinical images can be analyzed using machine learning methods such as convolutional neural networks (CNN), which learn to extract critical features for the classification. However, the usability of these systems is limited in medicine due to the lack of interpretability, because of these models cannot be used to generate an understandable explanation (from a human-based perspective), about how they have reached those results. Another problem that difficults the impact of this technology is the limited amount of labeled data in many medicine domains. The main contributions of this work are two fold: the first one is the design of a new explainable artificial intelligence (XAI) technique based on combining the individual heatmaps obtained from each model in the ensemble. This allows to overcome the explainability and interpretability problems of the CNN "black boxes", highlighting those areas of the image which are more relevant to generate the classification. The second one is the development of new ensemble deep learning models to classify chest X-rays that allow highly competitive results using small datasets for training. We tested our ensemble model using a small dataset of pediatric X-rays (950 samples) with low quality and anatomical variability (which represents one of the biggest challenges). We also tested other strategies such as single CNNs trained from scratch and transfer learning using CheXNet. Our results show that our ensemble model outperforms these strategies obtaining highly competitive results. Finally, we confirmed the robustness of our approach using another pneumonia diagnosis dataset [1].