Improving Scalability of Mach-Zehnder Interferometer Based Photonic Computers via Algorithm-Hardware Codesign

M. B. On,Luis El Srouji,Mahmoud Abdelghany,Yun-Jhu Lee,S. J. B. Yoo
DOI: https://doi.org/10.1109/JLT.2024.3439336
IF: 4.7
2024-11-15
Journal of Lightwave Technology
Abstract:Photonic computers promise to meet the high-throughput computational needs of advanced machine learning algorithms, predominantly matrix-vector multiplication, which challenge currently available digital computers. Photonic computing offers massively available optical bandwidth to multiplex the computations. Among the successfully demonstrated photonic matrix multipliers, Mach-Zehnder interferometer (MZI)-based circuits appeal due to their broadband operations. However, they suffer in terms of footprint compared to their electronic counterparts. Here, we explore two algorithm-hardware codesign approaches: continual learning and tensor-train decomposition to improve the scalability of the MZI-based photonic computers. Continual learning enables the same photonic hardware to learn new tasks without catastrophically forgetting previously learned ones. We experimentally demonstrated that a 4 × 4 photonic matrix unit can solve two independent linear classification problems simultaneously with $>$90% accuracy in both without requiring additional hardware resources. We utilize a gradient-free error descent training method to enable on-chip continual learning, which does not require a priori model of hardware imperfections and explicit expression of the loss function with respect to phase shifter parameters. We study and discuss the convergence behavior of the mentioned algorithm on-chip. Further, we train the photonic hardware to realize orthogonal matrices with positive and negative valued entries. Later, we use the hardware-implemented 4 × 4 tensor cores to emulate 8 × 8 and 16 × 16 tensor train decomposed matrices and analyze the error penalty due to individually trained tensor cores.
Engineering,Computer Science,Physics
What problem does this paper attempt to address?