论文标题
有效的浮点旋转单元
Efficient Floating-Point Givens Rotation Unit
论文作者
论文摘要
高通量QR分解是许多高级信号处理和通信应用中的关键操作。对于其中一些应用程序,使用浮点计算几乎是强制性的。但是,在用于嵌入式系统的浮点QR分解的硬件实现中,存在稀缺作品。在本文中,我们提出了一个非常有效的高通量浮点旋转单元,以进行QR分解。此外,通过使用新的半单位偏见格式,可以增强常规数字格式的最初提议的设计。提供的错误分析显示了我们的提案的有效性和不同实施参数的权衡。还提出了FPGA实施结果,并在两种方法之间进行了详尽的比较。这些实施结果还显示了与其他以前的类似设计相比,在面积,延迟和吞吐量方面进行了出色的改进。
High-throughput QR decomposition is a key operation in many advanced signal processing and communication applications. For some of these applications, using floating-point computation is becoming almost compulsory. However, there are scarce works in hardware implementations of floating-point QR decomposition for embedded systems. In this paper, we propose a very efficient high-throughput floating-point Givens rotation unit for QR decomposition. Moreover, the initial proposed design for conventional number formats is enhanced by using the new Half-Unit Biased format. The provided error analysis shows the effectiveness of our proposals and the trade-off of different implementation parameters. FPGA implementation results are also presented and a thorough comparison between both approaches. These implementation results also reveal outstanding improvements compared to other previous similar designs in terms of area, latency, and throughput.