论文标题

使用场景图检测视觉关系:调查

Visual Relationship Detection using Scene Graphs: A Survey

论文作者

Agarwal, Aniket, Mangal, Ayush, Vipul

论文摘要

通过解码图像中描述的视觉关系了解场景一直是一个长期研究的问题。尽管深度学习和深度神经网络的使用的最新进展已经在许多任务上取得了几乎人为的准确性,但在各种视觉关系检测任务方面,人类和机器水平的性能之间仍然存在很大的差距。开发着诸如对象识别,细分和字幕之类的早期任务,该任务集中在相对较粗的图像理解上,最近引入了更新的任务,以处理更优质的图像理解。场景图是一种更好地代表场景及其中存在的各种关系的技术。它在各种任务中进行了广泛的应用,例如视觉问题回答,语义图像检索,图像生成等等,它已被证明是一种有用的工具,可用于深入,更好的视觉关系理解。在本文中,我们介绍了有关场景图生成的各种技术的详细调查,它们代表视觉关系的功效以及如何用于解决各种下游任务。我们还试图分析该领域将来可能发展的各种未来方向。作为对该主题进行详细调查的首批论文之一,我们还希望在为其应用程序开发方法时简要介绍场景图和指导从业者。

Understanding a scene by decoding the visual relationships depicted in an image has been a long studied problem. While the recent advances in deep learning and the usage of deep neural networks have achieved near human accuracy on many tasks, there still exists a pretty big gap between human and machine level performance when it comes to various visual relationship detection tasks. Developing on earlier tasks like object recognition, segmentation and captioning which focused on a relatively coarser image understanding, newer tasks have been introduced recently to deal with a finer level of image understanding. A Scene Graph is one such technique to better represent a scene and the various relationships present in it. With its wide number of applications in various tasks like Visual Question Answering, Semantic Image Retrieval, Image Generation, among many others, it has proved to be a useful tool for deeper and better visual relationship understanding. In this paper, we present a detailed survey on the various techniques for scene graph generation, their efficacy to represent visual relationships and how it has been used to solve various downstream tasks. We also attempt to analyze the various future directions in which the field might advance in the future. Being one of the first papers to give a detailed survey on this topic, we also hope to give a succinct introduction to scene graphs, and guide practitioners while developing approaches for their applications.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源