论文标题

解剖神经氧

Dissecting Neural ODEs

论文作者

Massaroli, Stefano, Poli, Michael, Park, Jinkyoo, Yamashita, Atsushi, Asama, Hajime

论文摘要

连续深度学习架构最近重新出现为神经普通微分方程(神经odes)。这种无限的深度方法理论上弥合了深度学习和动态系统之间的差距,从而提供了一种新颖的视角。但是,对这些模型的内部工作进行解密仍然是一个开放的挑战,因为大多数应用程序将它们用作通用黑盒模块。在这项工作中,我们“打开盒子”,进一步开发连续的深度配方,目的是阐明几种设计选择对基础动力学的影响。

Continuous deep learning architectures have recently re-emerged as Neural Ordinary Differential Equations (Neural ODEs). This infinite-depth approach theoretically bridges the gap between deep learning and dynamical systems, offering a novel perspective. However, deciphering the inner working of these models is still an open challenge, as most applications apply them as generic black-box modules. In this work we "open the box", further developing the continuous-depth formulation with the aim of clarifying the influence of several design choices on the underlying dynamics.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源