Abstract:A novel convolutional processor is proposed using the shifted spectral response of a pair of arrayed waveguide gratings (AWGs) to mimic the kernel shifts during image convolution. This inherent mixing of inputs in the AWG's spectral response eliminates the need for repetitive element‐wise computations while enabling the simultaneous generation of convolved output maps. Convolutional neural networks are a powerful category of artificial neural networks that can extract features from raw data to provide greatly reduced parametric complexity and enhance pattern recognition and the accuracy of prediction. Optical neural networks offer the promise of dramatically accelerating computing speed while maintaining low power consumption even when using high‐speed data streams running at hundreds of gigabit/s. Here, we propose an optical convolutional processor (CP) that leverages the spectral response of an arrayed waveguide grating (AWG) to enhance convolution speed by eliminating the need for repetitive element‐wise multiplication. Our design features a balanced AWG configuration, enabling both positive and negative weightings essential for convolutional kernels. A proof‐of‐concept demonstration of an 8‐bit resolution processor is experimentally implemented using a pair of AWGs with a broadband Mach–Zehnder interferometer (MZI) designed to achieve uniform weighting across the whole spectrum. Experimental results demonstrate the CP's effectiveness in edge detection and achieved 96% accuracy in a convolutional neural network for MNIST recognition. This approach can be extended to other common operations, such as pooling and deconvolution in Generative Adversarial Networks. It is also scalable to more complex networks, making it suitable for applications like autonomous vehicles and real‐time video recognition.

High performance dilated convolutions on multi-core DSPs

High-performance Reconfigurable DNN Accelerator on a Bandwidth-limited Embedded System

A fine-grained mixed precision DNN accelerator using a two-stage big-little core RISC-V MCU.

Performance Analysis of DNN Inference/Training with Convolution and non-Convolution Operations

Myocarditis: A clinical entity that can benefit from noninvasive imaging

A Flexible and Efficient FPGA Accelerator for Various Large-Scale and Lightweight CNNs

An Efficient Accelerator for Multiple Convolutions From the Sparsity Perspective

HybridDNN: A Framework for High-Performance Hybrid DNN Accelerator Design and Implementation.

A High-Performance FPGA-Based Depthwise Separable Convolution Accelerator

High‐Throughput Multichannel Parallelized Diffraction Convolutional Neural Network Accelerator

LUT‐DSP usage trade‐off for re‐configurable convolution acceleration core based on small logarithmic floating point representation

Dsp-Based Parallel Implementation Of Speeded-Up Robust Features

Accelerating DNN Inference with Heterogeneous Multi-DPU Engines

MG3MConv: Multi-Grained Matrix-Multiplication-Mapping Convolution Algorithm toward the SW26010 Processor

Accelerating Convolutional Processing by Harnessing Channel Shifts in Arrayed Waveguide Gratings

HDConv: Heterogeneous kernel-based dilated convolutions

A High Performance Reconfigurable Hardware Architecture for Lightweight Convolutional Neural Network

OpenMDSP: Extending OpenMP to Program Multi-Core DSPs

A High Utilization FPGA-Based Accelerator for Variable-Scale Convolutional Neural Network

High Throughput Multi-Channel Parallelized Diffraction Convolutional Neural Network Accelerator

YHFT-QDSP:High-Performance Heterogeneous Multi-Core DSP