论文标题

现场可编程门阵列上的高性能光谱元素方法

High-Performance Spectral Element Methods on Field-Programmable Gate Arrays

论文作者

Karp, Martin, Podobas, Artur, Jansson, Niclas, Kenter, Tobias, Plessl, Christian, Schlatter, Philipp, Markidis, Stefano

论文摘要

历史上,计算机系统的改进依赖于两个众所周知的观察结果:摩尔定律和丹纳德的扩展。如今,这两个观察结果都在结束,迫使计算机用户,研究人员和从业人员放弃通用体系结构的舒适感,而倾向于新兴的潮汐后系统。在这些后期后系统中,最突出的是现场编程的门阵列(FPGA),它在复杂性和性能之间达到了方便的平衡。在本文中,我们研究了现代FPGA在将光谱元素方法(SEM)核心加速到许多计算流体动力学(CFD)应用中的适用性。我们设计了一个以双重精神运行的自定义SEM硬件加速器,我们在最新的Stratix 10 GX系列FPGAS上进行了经验评估,并将其性能(以及发电效率)定位为ARM Thunderx2,Nvidia Pascal/vidia pascal/volta/volta/ampere tesla-series cards等最先进的系统,并将其定位。最后,我们为我们的SEM-Accelerator开发了一个性能模型,我们用它来投影FPGAS的性能和加速CFD应用程序的作用,最终回答了一个问题:哪些特征是CFD应用程序具有什么特征?

Improvements in computer systems have historically relied on two well-known observations: Moore's law and Dennard's scaling. Today, both these observations are ending, forcing computer users, researchers, and practitioners to abandon the general-purpose architectures' comforts in favor of emerging post-Moore systems. Among the most salient of these post-Moore systems is the Field-Programmable Gate Array (FPGA), which strikes a convenient balance between complexity and performance. In this paper, we study modern FPGAs' applicability in accelerating the Spectral Element Method (SEM) core to many computational fluid dynamics (CFD) applications. We design a custom SEM hardware accelerator operating in double-precision that we empirically evaluate on the latest Stratix 10 GX-series FPGAs and position its performance (and power-efficiency) against state-of-the-art systems such as ARM ThunderX2, NVIDIA Pascal/Volta/Ampere Tesla-series cards, and general-purpose manycore CPUs. Finally, we develop a performance model for our SEM-accelerator, which we use to project future FPGAs' performance and role to accelerate CFD applications, ultimately answering the question: what characteristics would a perfect FPGA for CFD applications have?

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源