论文标题
HPC系统上计算流体动力学的深入增强学习
Deep Reinforcement Learning for Computational Fluid Dynamics on HPC Systems
论文作者
论文摘要
强化学习(RL)非常适合在动态系统的背景下制定控制策略。这种动态系统的一个突出实例是管理流体动力学的方程式系统。最近的研究结果表明,RL增强的计算流体动力学(CFD)求解器可以超过当前的最新水平,例如在湍流建模领域。但是,在监督学习中,可以以离线方式生成培训数据,但RL需要在培训期间与CFD求解器进行持续的运行时交互和数据交换。为了利用RL增强CFD的潜力,必须在高性能计算(HPC)硬件上有效地实现CFD求解器与RL算法之间的相互作用。为此,我们将Rrexi作为可扩展的RL框架呈现,它在HPC系统上弥合了机器学习工作流程与现代CFD求解器之间的差距,从而为这两个组件提供了其专用硬件。 REREXI是考虑到模块化的,可以通过SmartSIM库提供的内存数据传输轻松整合各种HPC求解器。在这里,我们证明了Rrexi框架可以在数千个内核上扩展多达数百个平行环境。这允许现代HPC资源实现更大的问题或更快的周转时间。最后,我们通过在大型涡流模拟中找到最佳涡流粘度选择的控制策略来证明RL-augment的CFD求解器的潜力。
Reinforcement learning (RL) is highly suitable for devising control strategies in the context of dynamical systems. A prominent instance of such a dynamical system is the system of equations governing fluid dynamics. Recent research results indicate that RL-augmented computational fluid dynamics (CFD) solvers can exceed the current state of the art, for example in the field of turbulence modeling. However, while in supervised learning, the training data can be generated a priori in an offline manner, RL requires constant run-time interaction and data exchange with the CFD solver during training. In order to leverage the potential of RL-enhanced CFD, the interaction between the CFD solver and the RL algorithm thus have to be implemented efficiently on high-performance computing (HPC) hardware. To this end, we present Relexi as a scalable RL framework that bridges the gap between machine learning workflows and modern CFD solvers on HPC systems providing both components with its specialized hardware. Relexi is built with modularity in mind and allows easy integration of various HPC solvers by means of the in-memory data transfer provided by the SmartSim library. Here, we demonstrate that the Relexi framework can scale up to hundreds of parallel environment on thousands of cores. This allows to leverage modern HPC resources to either enable larger problems or faster turnaround times. Finally, we demonstrate the potential of an RL-augmented CFD solver by finding a control strategy for optimal eddy viscosity selection in large eddy simulations.