论文标题

计算并行化的加速和效率:一种统一的方法和渐近分析

Speedup and efficiency of computational parallelization: A unifying approach and asymptotic analysis

论文作者

Schryen, Guido

论文摘要

在高性能计算环境中,我们观察到可用核心数量的持续增加。这种发展要求重新强调绩效(可伸缩性)分析和加速法律,如文献(例如Amdahl的定律和Gustafson定律),重点是渐近性能。了解算法并行性的加速和效率问题可用于多种目的,包括系统操作的优化,对程序执行的时间预测以及渐近属性的分析以及确定加速范围。但是,文献是分散的,并显示了加速模型和法律的大量多样性和异质性。这些现象使得获得模型及其关系的概述,以确定在给定算法和计算环境中性能的决定因素,最后确定性能模型和法律对特定平行计算设置的适用性。在这项工作中,我们为均质计算环境提供了通用的加速(以及效率)模型。我们的方法概括了文献中建议的许多突出模型,并允许表明它们可以被视为统一方法的特殊情况。统一加速模型的通用性是通过参数化实现的。考虑到参数范围的组合,我们确定了六个不同的渐近加速病例和八个不同的渐近效率病例。共同应用这些加速和效率案例,我们得出了11个可伸缩性案例,从中我们构建了可扩展性类型。研究人员可以利用我们的类型学来分类其加速模型,并在平行处理单元数量增加时确定渐近行为。此外,我们的结果可用于解决我们设置的各种扩展。

In high performance computing environments, we observe an ongoing increase in the available numbers of cores. This development calls for re-emphasizing performance (scalability) analysis and speedup laws as suggested in the literature (e.g., Amdahl's law and Gustafson's law), with a focus on asymptotic performance. Understanding speedup and efficiency issues of algorithmic parallelism is useful for several purposes, including the optimization of system operations, temporal predictions on the execution of a program, and the analysis of asymptotic properties and the determination of speedup bounds. However, the literature is fragmented and shows a large diversity and heterogeneity of speedup models and laws. These phenomena make it challenging to obtain an overview of the models and their relationships, to identify the determinants of performance in a given algorithmic and computational context, and, finally, to determine the applicability of performance models and laws to a particular parallel computing setting. In this work, we provide a generic speedup (and thus also efficiency) model for homogeneous computing environments. Our approach generalizes many prominent models suggested in the literature and allows showing that they can be considered special cases of a unifying approach. The genericity of the unifying speedup model is achieved through parameterization. Considering combinations of parameter ranges, we identify six different asymptotic speedup cases and eight different asymptotic efficiency cases. Jointly applying these speedup and efficiency cases, we derive eleven scalability cases, from which we build a scalability typology. Researchers can draw upon our typology to classify their speedup model and to determine the asymptotic behavior when the number of parallel processing units increases. In addition, our results may be used to address various extensions of our setting.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源