论文标题

通过稀疏门控学习强大的动态

Learning Robust Dynamics through Variational Sparse Gating

论文作者

Jain, Arnav Kumar, Sujit, Shivakanth, Joshi, Shruti, Michalski, Vincent, Hafner, Danijar, Ebrahimi-Kahou, Samira

论文摘要

从他们的感觉投入中学习世界模型,使代理可以通过想象他们的未来结果来计划行动。以前已显示世界模型可以改善几乎没有对象的模拟环境中的样本效率,但尚未成功应用于许多对象的环境。在具有许多对象的环境中,通常只有少数在同一时间移动或交互。在本文中,我们研究了将稀疏相互作用的这种感应偏置整合到从像素训练的世界模型的潜在动力学中。首先,我们引入了稀疏门控(VSG),这是一种潜在动力学模型,通过随机二进制门更新其特征尺寸。此外,我们提出了一个简化的体系结构简单的变分稀疏门控(SVSG),该缩小门控删除了先前模型的确定性途径,从而产生了利用VSG机制的完全随机过渡函数。我们评估了带有大量移动对象和部分可观察性的Briseback -ShapeS(BBS)环境中的两个模型架构,并证明了对先前模型的明显改进。

Learning world models from their sensory inputs enables agents to plan for actions by imagining their future outcomes. World models have previously been shown to improve sample-efficiency in simulated environments with few objects, but have not yet been applied successfully to environments with many objects. In environments with many objects, often only a small number of them are moving or interacting at the same time. In this paper, we investigate integrating this inductive bias of sparse interactions into the latent dynamics of world models trained from pixels. First, we introduce Variational Sparse Gating (VSG), a latent dynamics model that updates its feature dimensions sparsely through stochastic binary gates. Moreover, we propose a simplified architecture Simple Variational Sparse Gating (SVSG) that removes the deterministic pathway of previous models, resulting in a fully stochastic transition function that leverages the VSG mechanism. We evaluate the two model architectures in the BringBackShapes (BBS) environment that features a large number of moving objects and partial observability, demonstrating clear improvements over prior models.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源