解剖神经氧

论文标题

Dissecting Neural ODEs

论文作者

Massaroli, Stefano, Poli, Michael, Park, Jinkyoo, Yamashita, Atsushi, Asama, Hajime

论文摘要

连续深度学习架构最近重新出现为神经普通微分方程（神经odes）。这种无限的深度方法理论上弥合了深度学习和动态系统之间的差距，从而提供了一种新颖的视角。但是，对这些模型的内部工作进行解密仍然是一个开放的挑战，因为大多数应用程序将它们用作通用黑盒模块。在这项工作中，我们“打开盒子”，进一步开发连续的深度配方，目的是阐明几种设计选择对基础动力学的影响。

Continuous deep learning architectures have recently re-emerged as Neural Ordinary Differential Equations (Neural ODEs). This infinite-depth approach theoretically bridges the gap between deep learning and dynamical systems, offering a novel perspective. However, deciphering the inner working of these models is still an open challenge, as most applications apply them as generic black-box modules. In this work we "open the box", further developing the continuous-depth formulation with the aim of clarifying the influence of several design choices on the underlying dynamics.

下载PDF全文

下载文献需遵守相关版权规定

论文标题