Abstract:Binary neural networks (BNNs) have 1-bit weights and activations. Such networks are well suited for FPGAs, as their dominant computations are bitwise arithmetic and the memory requirement is also significantly reduced. However, compared to start-of-the-art compact convolutional neural network (CNN) models, BNNs tend to produce a much lower accuracy on realistic datasets such as ImageNet. In addition, the input layer of BNNs has gradually become a major compute bottleneck, because it is conventionally excluded from binarization to avoid a large accuracy loss. This work proposes FracBNN, which exploits fractional activations to substantially improve the accuracy of BNNs. Specifically, our approach employs a dual-precision activation scheme to compute features with up to two bits, using an additional sparse binary convolution. We further binarize the input layer using a novel thermometer encoding. Overall, FracBNN preserves the key benefits of conventional BNNs, where all convolutional layers are computed in pure binary MAC operations (BMACs). We design an efficient FPGA-based accelerator for our novel BNN model that supports the fractional activations. To evaluate the performance of FracBNN under a resource-constrained scenario, we implement the entire optimized network architecture on an embedded FPGA (Xilinx Ultra96 v2). Our experiments on ImageNet show that FracBNN achieves an accuracy comparable to MobileNetV2, surpassing the best-known BNN design on FPGAs with an increase of 28.9% in top-1 accuracy and a 2.5x reduction in model size. FracBNN also outperforms a recently introduced BNN model with an increase of 2.4% in top-1 accuracy while using the same model size. On the embedded FPGA device, FracBNN demonstrates the ability of real-time image classification.

FP-BNN: Binarized neural network on FPGA

A Near Memory Computing FPGA Architecture for Neural Network Acceleration

Accelerating Binarized Convolutional Neural Networks with Software-Programmable FPGAs.

FCA-BNN: Flexible and Configurable Accelerator for Binarized Neural Networks on FPGA.

A high-throughput scalable BNN accelerator with fully pipelined architecture

FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional Activations.

An Approach of Binary Neural Network Energy-Efficient Implementation

Binary Neural Networks in FPGAs: Architectures, Tool Flows and Hardware Comparisons

Fast and Light-weight Binarized Neural Network Implemented in an FPGA using LUT-based Signal Processing and its Time-domain Extension for Multi-bit Processing

Accelerating Low Bit-Width Convolutional Neural Networks with Embedded FPGA.

Spike Trains Encoding Optimization for Spiking Neural Networks Implementation in FPGA

FP-DNN: an Automated Framework for Mapping Deep Neural Networks Onto FPGAs with RTL-HLS Hybrid Templates

AddNet: Deep Neural Networks Using FPGA-Optimized Multipliers

High-Performance FPGA-based Accelerator for Bayesian Neural Networks

A FPGA-based Hardware Accelerator for Bayesian Confidence Propagation Neural Network

An FPGA implementation of Bayesian inference with spiking neural networks

Hardware implementation of spiking neural networks on FPGA

Deep neural network accelerator based on FPGA

FPGA-based implementation of deep neural network using stochastic computing

Deploying deep learning networks based advanced techniques for image processing on FPGA platform

Hardware Platform-Aware Binarized Neural Network Model Optimization