An Efficient General-Purpose Optical Accelerator for Neural Networks

Sijie Fei,Amro Eldebiky,Grace Li Zhang,Bing Li,Ulf Schlichtmann

2024-09-02

Abstract:General-purpose optical accelerators (GOAs) have emerged as a promising platform to accelerate deep neural networks (DNNs) due to their low latency and energy consumption. Such an accelerator is usually composed of a given number of interleaving Mach-Zehnder- Interferometers (MZIs). This interleaving architecture, however, has a low efficiency when accelerating neural networks of various sizes due to the mismatch between weight matrices and the GOA architecture. In this work, a hybrid GOA architecture is proposed to enhance the mapping efficiency of neural networks onto the GOA. In this architecture, independent MZI modules are connected with microring resonators (MRRs), so that they can be combined to process large neural networks efficiently. Each of these modules implements a unitary matrix with inputs adjusted by tunable coefficients. The parameters of the proposed architecture are searched using genetic algorithm. To enhance the accuracy of neural networks, selected weight matrices are expanded to multiple unitary matrices applying singular value decomposition (SVD). The kernels in neural networks are also adjusted to use up the on-chip computational resources. Experimental results show that with a given number of MZIs, the mapping efficiency of neural networks on the proposed architecture can be enhanced by 21.87%, 21.20%, 24.69%, and 25.52% for VGG16 and Resnet18 on datasets Cifar10 and Cifar100, respectively. The energy consumption and computation latency can also be reduced by over 67% and 21%, respectively.

Neural and Evolutionary Computing,Systems and Control

What problem does this paper attempt to address?

### The Problems Addressed by This Paper This paper primarily aims to address the following issues: 1. **Inefficient Mapping of General Optical Accelerators (GOA) in Neural Networks**: - Existing GOAs are typically composed of interleaved Mach-Zehnder Interferometer (MZI) arrays. This structure is inefficient when handling neural networks of different sizes because the weight matrix does not match the GOA architecture, leading to resource wastage. 2. **Large Area Cost**: - When representing arbitrary weight matrices, existing GOAs need to decompose the weight matrix into two unitary matrices and one diagonal matrix, which increases the required number of MZIs, resulting in a large area overhead. 3. **Improving the Execution Efficiency of Neural Networks**: - By improving the design of GOAs to enhance the mapping efficiency of neural networks, reducing energy consumption and computational delay, thereby improving the overall system performance. To address these issues, the authors propose a hybrid GOA architecture that connects independent MZI modules with microring resonators (MRR), which can handle large-scale neural networks more efficiently. Additionally, they optimize parameter configurations using a genetic algorithm to balance mapping efficiency, area, power consumption, and the cost of electro-optical conversion. Furthermore, a hardware-aware training method is proposed to adjust the neural network structure to fit the new GOA architecture and maintain the accuracy of the neural network by selectively recovering key weight matrices.

An Efficient General-Purpose Optical Accelerator for Neural Networks

A Near Memory Computing FPGA Architecture for Neural Network Acceleration

A Convolutional Neural Network Accelerator Architecture with Fine-Granular Mixed Precision Configurability.

Countering Variations and Thermal Effects for Accurate Optical Neural Networks.

O-HAS: Optical Hardware Accelerator Search for Boosting Both Acceleration Performance and Development Speed

Bitwise Neural Network Acceleration Using Silicon Photonics

GNA: Reconfigurable and Efficient Architecture for Generative Network Acceleration

A Winograd-based Integrated Photonics Accelerator for Convolutional Neural Networks

PIXEL: Photonic Neural Network Accelerator

A Small-Footprint Accelerator for Large-Scale Neural Networks

EnGN: A High-Throughput and Energy-Efficient Accelerator for Large Graph Neural Networks

Energy-Efficient Accelerator Design for Deformable Convolution Networks

OECS: A Novel Deep Convolutional Neural Network Accelerator Based on 3D Hybrid Optical-Electrical NoC

Reuse and Blend: Energy-Efficient Optical Neural Network Enabled by Weight Sharing

Scaling Analog Photonic Accelerators for Byte-Size, Integer General Matrix Multiply (GEMM) Kernels

Efficient Hardware Optimization Strategies For Deep Neural Networks Acceleration Chip

A Reconfigurable Spatial Architecture for Energy-Efficient Inception Neural Networks

A High Performance Multi-Bit-Width Booth Vector Systolic Accelerator for NAS Optimized Deep Learning Neural Networks

MEGA: A Memory-Efficient GNN Accelerator Exploiting Degree-Aware Mixed-Precision Quantization

Improving Efficiency in Neural Network Accelerator Using Operands Hamming Distance Optimization

Optimizing DNN Inference on Multi-Accelerator SoCs at Training-time