论文标题
地球观察中的半监督语义分割:微型套件,数据集分析和多任务网络研究
Semi-Supervised Semantic Segmentation in Earth Observation: The MiniFrance Suite, Dataset Analysis and Multi-task Network Study
论文作者
论文摘要
半监督学习技术的发展对于增强机器学习算法的概括能力至关重要。实际上,在标签稀缺时,原始图像数据很丰富,因此利用未标记的输入来构建更好的模型至关重要。大型数据库的可用性一直是开发具有高级性能的学习算法的关键。 尽管机器学习在地球观察中的主要作用是推导诸如土地覆盖地图之类的产品,但现场数据集仍受到限制,要么是由于表面覆盖量适中,缺乏各种场景或限制类别以识别。我们介绍了一个新型的大型数据集,用于地球观察中的半监督语义分割,即Minifrance Suite。 Minifrance具有几种前所未有的特性:它是大规模的,包含2000多个非常高分辨率的航空图像,占超过2000亿个样品(像素);它各不相同,涵盖了法国的16个Conurbations,具有各种气候,不同的景观,城市以及乡村场景;考虑到具有高级语义的土地使用课程,这是充满挑战的。然而,最独特的小型质量是该领域中唯一专门为半监视学习而设计的数据集:它在其培训分区中包含标记和未标记的图像,它重现了类似生活的场景。除此数据集外,我们还提供了有关外观相似性和对Minifrance数据的全面研究的数据代表性分析的工具,表明它适合于学习并在半监督的环境中良好地概括。最后,我们基于多任务学习和有关Minifrance的第一个实验,介绍了半监督的深度体系结构。
The development of semi-supervised learning techniques is essential to enhance the generalization capacities of machine learning algorithms. Indeed, raw image data are abundant while labels are scarce, therefore it is crucial to leverage unlabeled inputs to build better models. The availability of large databases have been key for the development of learning algorithms with high level performance. Despite the major role of machine learning in Earth Observation to derive products such as land cover maps, datasets in the field are still limited, either because of modest surface coverage, lack of variety of scenes or restricted classes to identify. We introduce a novel large-scale dataset for semi-supervised semantic segmentation in Earth Observation, the MiniFrance suite. MiniFrance has several unprecedented properties: it is large-scale, containing over 2000 very high resolution aerial images, accounting for more than 200 billions samples (pixels); it is varied, covering 16 conurbations in France, with various climates, different landscapes, and urban as well as countryside scenes; and it is challenging, considering land use classes with high-level semantics. Nevertheless, the most distinctive quality of MiniFrance is being the only dataset in the field especially designed for semi-supervised learning: it contains labeled and unlabeled images in its training partition, which reproduces a life-like scenario. Along with this dataset, we present tools for data representativeness analysis in terms of appearance similarity and a thorough study of MiniFrance data, demonstrating that it is suitable for learning and generalizes well in a semi-supervised setting. Finally, we present semi-supervised deep architectures based on multi-task learning and the first experiments on MiniFrance.