一个完全管道的FPGA加速器，用于比例不变特征转换关键点描述符匹配，

论文标题

一个完全管道的FPGA加速器，用于比例不变特征转换关键点描述符匹配，

A fully pipelined FPGA accelerator for scale invariant feature transform keypoint descriptor matching,

论文作者

Daoud, Luka, Latif, Muhammad Kamran, Jacinto, H S., Rafla, Nader

论文摘要

比例不变特征变换（SIFT）算法被认为是计算机视觉领域内的经典特征提取算法。由于消耗的数据量，SIFT KEYPOINT描述符匹配是一个计算密集的过程。在这项工作中，我们设计了一个全新的管道上的硬件加速器架构，用于筛分键盘描述符匹配。在现场可编程门阵列（FPGA）上实现并测试了加速器核心。所提出的硬件体系结构能够正确处理完整的实现实现所需的内存带宽，并达到了车顶线的性能模型，从而实现了潜在的最大吞吐量。完全管道的匹配体系结构是基于Consine角距离方法设计的。我们的体系结构已针对16位定点操作进行了优化，并使用基于Xilinx Zynq的FPGA开发板在硬件上实施。我们提出的架构显示，与文献中的相比，区域资源显着减少，同时通过减轻记忆带宽限制来保持高通量。结果表明，在LUTS和79％的Brams中，消耗的设备资源最多减少了91％。我们的硬件实现比可比的软件方法快15.7倍。

The scale invariant feature transform (SIFT) algorithm is considered a classical feature extraction algorithm within the field of computer vision. SIFT keypoint descriptor matching is a computationally intensive process due to the amount of data consumed. In this work, we designed a novel fully pipelined hardware accelerator architecture for SIFT keypoint descriptor matching. The accelerator core was implemented and tested on a field programmable gate array (FPGA). The proposed hardware architecture is able to properly handle the memory bandwidth necessary for a fully-pipelined implementation and hits the roofline performance model, achieving the potential maximum throughput. The fully pipelined matching architecture was designed based on the consine angle distance method. Our architecture was optimized for 16-bit fixed-point operations and implemented on hardware using a Xilinx Zynq-based FPGA development board. Our proposed architecture shows a noticeable reduction of area resources compared with its counterparts in literature, while maintaining high throughput by alleviating memory bandwidth restrictions. The results show a reduction in consumed device resources of up to 91 percent in LUTs and 79 percent of BRAMs. Our hardware implementation is 15.7 times faster than the comparable software approach.

下载PDF全文

下载文献需遵守相关版权规定

论文标题