An Efficient General-Purpose Optical Accelerator for Neural Networks

Sijie Fei,Amro Eldebiky,Grace Li Zhang,Bing Li,Ulf Schlichtmann
2024-09-02
Abstract:General-purpose optical accelerators (GOAs) have emerged as a promising platform to accelerate deep neural networks (DNNs) due to their low latency and energy consumption. Such an accelerator is usually composed of a given number of interleaving Mach-Zehnder- Interferometers (MZIs). This interleaving architecture, however, has a low efficiency when accelerating neural networks of various sizes due to the mismatch between weight matrices and the GOA architecture. In this work, a hybrid GOA architecture is proposed to enhance the mapping efficiency of neural networks onto the GOA. In this architecture, independent MZI modules are connected with microring resonators (MRRs), so that they can be combined to process large neural networks efficiently. Each of these modules implements a unitary matrix with inputs adjusted by tunable coefficients. The parameters of the proposed architecture are searched using genetic algorithm. To enhance the accuracy of neural networks, selected weight matrices are expanded to multiple unitary matrices applying singular value decomposition (SVD). The kernels in neural networks are also adjusted to use up the on-chip computational resources. Experimental results show that with a given number of MZIs, the mapping efficiency of neural networks on the proposed architecture can be enhanced by 21.87%, 21.20%, 24.69%, and 25.52% for VGG16 and Resnet18 on datasets Cifar10 and Cifar100, respectively. The energy consumption and computation latency can also be reduced by over 67% and 21%, respectively.
Neural and Evolutionary Computing,Systems and Control
What problem does this paper attempt to address?
### The Problems Addressed by This Paper This paper primarily aims to address the following issues: 1. **Inefficient Mapping of General Optical Accelerators (GOA) in Neural Networks**: - Existing GOAs are typically composed of interleaved Mach-Zehnder Interferometer (MZI) arrays. This structure is inefficient when handling neural networks of different sizes because the weight matrix does not match the GOA architecture, leading to resource wastage. 2. **Large Area Cost**: - When representing arbitrary weight matrices, existing GOAs need to decompose the weight matrix into two unitary matrices and one diagonal matrix, which increases the required number of MZIs, resulting in a large area overhead. 3. **Improving the Execution Efficiency of Neural Networks**: - By improving the design of GOAs to enhance the mapping efficiency of neural networks, reducing energy consumption and computational delay, thereby improving the overall system performance. To address these issues, the authors propose a hybrid GOA architecture that connects independent MZI modules with microring resonators (MRR), which can handle large-scale neural networks more efficiently. Additionally, they optimize parameter configurations using a genetic algorithm to balance mapping efficiency, area, power consumption, and the cost of electro-optical conversion. Furthermore, a hardware-aware training method is proposed to adjust the neural network structure to fit the new GOA architecture and maintain the accuracy of the neural network by selectively recovering key weight matrices.