Abstract:Visual feature extraction is a key technology of computer vision for intelligent video processing. Efficient feature extraction is a fundamental problem in computer vision applications. Scale-Invariant Feature Transform (SIFT) is one of the most popular feature extraction algorithms because SIFT features are invariant to image scale and rotation and robust to changes in illumination and noise. However, SIFT is a computationally-intensive and power-hungry algorithm, which needs to be accelerated by efficient hardware design to achieve both high-speed feature extraction and high energy efficiency for many high frame-rate video applications at Artificial-intelligent Internet of Things edges. In this work, an energy-efficient SIFT based feature extraction accelerator is proposed. In the Gaussian pyramid and Differences of Gaussian (DoG) pyramid construction process, three design methods are proposed to reduce power consumption and improve information fidelity: a fast and slow dual clock domain design method with a reconfigurable design strategy is proposed to reduce the computation resources; a partial sum reuse design method is proposed to further reduce the computation resources and the amount of computation; a dynamic padding design method is proposed to solve the problem of information loss at image edges and corners after convolution operation. In the keypoint descriptor generation process, an optimized algorithm using circular region and polar coordinates is proposed to parallelize the main orientation assignment and descriptor generation to achieve high-speed processing, while maintaining a comparable matching accuracy with the state-of-the-art designs. The experiment results show that the proposed SIFT hardware accelerator is able to extract features by up to 162 frames per second ( $640\times 480$ pixels) under 100 MHz, with the power consumption of 364.26 mW and energy efficiency of 2.25 mJ/frame based on 180 nm technology, which is suitable for many high frame-rate AIoT applications including autonomous driving cars and unmanned aerial vehicles.

An Energy-Efficient SIFT Based Feature Extraction Accelerator for High Frame-Rate Video Applications.

Implementation of Parallel Acceleration for Real-time Extraction of Visual Features.

A Fast and Power-Efficient Hardware Architecture for Visual Feature Detection in Affine-SIFT.

An Efficient Hardware Architecture of the Optimised SIFT Descriptor Generation.

An Ultra-Fast and Low-Power Design of Analog Circuit Network for DoG Pyramid Construction of SIFT Algorithm

A Hardware Accelerated Scale Invariant Feature Detector For Real-Time Visual Localization And Mapping

ASP-SIFT: Using Analog Signal Processing Architecture to Accelerate Keypoint Detection of SIFT Algorithm

A Hardware Accelerator with Variable Pixel Representation & Skip Mode Prediction for Feature Point Detection Part of SIFT Algorithm.

A 83fps 1080P Resolution 354 Mw Silicon Implementation for Computing the Improved Robust Feature in Affine Space

A 135-Frames/s 1080p 87.5-Mw Binary-Descriptor-Based Image Feature Extraction Accelerator

A 127 fps in full hd accelerator based on optimized AKAZE with efficiency and effectiveness for image feature extraction

A Parallel Analysis on Scale Invariant Feature Transform (SIFT) Algorithm.

SIFT Implementation and Optimization for Multi-Core Systems.

An Efficient VLSI Architecture of Speeded-Up Robust Feature Extraction for High Resolution and High Frame Rate Video

A Precision-Improved Processing Architecture of Physical Computing for Energy-Efficient SIFT Feature Extraction

FPGA-Based Feature Extraction and Tracking Accelerator for Real-Time Visual SLAM

An Accelerated and Flexible SIFT Parallel-Computing Approach Based on the General Multi-Core Platform

Adaptive Pipeline Parallelism for Image Feature Extraction Algorithms

Parallelization And Optimization Of Sift On Gpu Using Cuda

A 65-Nm Energy-Efficient Interframe Data Reuse Neural Network Accelerator for Video Applications

Parallelization of Computing-Intensive Tasks of SIFT Algorithm on a Reconfigurable Architecture System