SRVIO：用于动态环境的超强视觉惯性进程和具有挑战性的循环闭合条件

论文标题

SRVIO：用于动态环境的超强视觉惯性进程和具有挑战性的循环闭合条件

SRVIO: Super Robust Visual Inertial Odometry for dynamic environments and challenging Loop-closure conditions

论文作者

Samadzadeh, Ali, Nickabadi, Ahmad

论文摘要

在过去的几十年中，关于自动机器人和虚拟现实的视觉定位和探光量进行了广泛的研究。传统上，借助昂贵的传感器（例如LiDars）解决了这个问题。如今，该领域的主要研究重点是使用更多经济传感器（例如相机和IMU）进行稳健的本地化。因此，几何视觉定位方法已经变得更加准确。但是，这些方法在充满挑战的环境中，例如充满动人的人的房间中仍然遭受了巨大的损失和差异的困扰。科学家开始使用深层神经网络（DNN）来减轻此问题。使用DNN的主要思想是更好地理解数据的挑战性方面并克服复杂的条件，例如在相机前的动态物体移动，涵盖了相机的完整视图，极端的照明条件和相机的高速。先前的端到端DNN方法已经克服了其中一些挑战。但是，没有一般和健壮的框架可以一起克服所有挑战。在本文中，我们将基于几何和DNN的方法结合在一起，以具有几何大满贯框架的一般性和速度，并在DNN的帮助下克服了大多数这些具有挑战性的条件，并提供了迄今为止最强大的框架。为此，我们设计了一个基于VINS-MONO的框架，并表明它能够与基于DNN的几何和端到端DNN的SLAM相比，在TUM-DYNAGIC，TUM-VI，ADVIO和EUROC数据集中实现最先进的结果。我们提出的框架也可以在类似于上述挑战的极端模拟案件上取得出色的结果。

There has been extensive research on visual localization and odometry for autonomous robots and virtual reality during the past decades. Traditionally, this problem has been solved with the help of expensive sensors, such as lidars. Nowadays, the focus of the leading research in this field is on robust localization using more economic sensors, such as cameras and IMUs. Consequently, geometric visual localization methods have become more accurate in time. However, these methods still suffer from significant loss and divergence in challenging environments, such as a room full of moving people. Scientists started using deep neural networks (DNNs) to mitigate this problem. The main idea behind using DNNs is to better understand challenging aspects of the data and overcome complex conditions such as the movement of a dynamic object in front of the camera that covers the full view of the camera, extreme lighting conditions, and high speed of the camera. Prior end-to-end DNN methods have overcome some of these challenges. However, no general and robust framework is available to overcome all challenges together. In this paper, we have combined geometric and DNN-based methods to have the generality and speed of geometric SLAM frameworks and overcome most of these challenging conditions with the help of DNNs and deliver the most robust framework so far. To do so, we have designed a framework based on Vins-Mono, and show that it is able to achieve state-of-the-art results on TUM-Dynamic, TUM-VI, ADVIO, and EuRoC datasets compared to geometric and end-to-end DNN based SLAMs. Our proposed framework could also achieve outstanding results on extreme simulated cases resembling the aforementioned challenges.

下载PDF全文

下载文献需遵守相关版权规定

论文标题