ChatGPT at the Speed of Light: Optical Comb-Based Monolithic Photonic-Electronic Linear-Algebra Accelerators

Tzu-Chien Hsueh,Yeshaiahu Fainman,Bill Lin
2023-11-21
Abstract:This paper proposes to adopt advanced monolithic silicon-photonics integrated-circuits manufacturing capabilities to achieve a system-on-chip photonic-electronic linear-algebra accelerator with the features of optical comb-based broadband incoherent photo-detections and high-dimensional operations of consecutive matrix-matrix multiplications to enable substantial leaps in computation density and energy efficiency, with practical considerations of power/area overhead due to photonic-electronic on-chip conversions, integrations, and calibrations through holistic co-design approaches to support attention-head mechanism based deep-learning neural networks used in Large Language Models and other emergent applications.
Systems and Control,Emerging Technologies
What problem does this paper attempt to address?
The paper aims to address the following issues: 1. **Improving computational density and energy efficiency**: By adopting advanced monolithic silicon photonic integrated circuit manufacturing technology, an on-chip optoelectronic linear algebra accelerator is realized. This accelerator features broadband incoherent optoelectronic detection and can perform high-dimensional matrix-matrix multiplication, thereby significantly enhancing computational density and energy efficiency. 2. **Supporting attention mechanisms in large-scale language models**: The accelerator is specifically optimized for deep learning neural networks based on attention mechanisms (such as Transformer models), especially large language models like ChatGPT. It addresses the bottleneck issues of existing hardware when handling high-dimensional matrix operations. 3. **Overcoming the limitations of traditional computing architectures**: By integrating optoelectronic computing on a monolithic chip, it avoids the power, area, and integration overhead issues of traditional computing architectures. This approach not only increases computational speed but also reduces energy consumption. In summary, the goal of this paper is to develop a novel optoelectronic linear algebra accelerator to support efficient computation in large-scale language models and other emerging applications.