Toru Aonishi,Tatsuya Nagasawa,Toshiyuki Koizumi,Mastiyage Don Sudeera Hasaranga Gunathilaka,Kazushi Mimura,Masato Okada,Satoshi Kako,Yoshihisa Yamamoto
Abstract:In recent years, quantum Ising machines have drawn a lot of attention, but due to physical implementation constraints, it has been difficult to achieve dense coupling, such as full coupling with sufficient spins to handle practical large-scale applications. Consequently, classically computable equations have been derived from quantum master equations for these quantum Ising machines. Parallel implementations of these algorithms using FPGAs have been used to rapidly find solutions to these problems on a scale that is difficult to achieve in physical systems. We have developed an FPGA implemented cyber coherent Ising machine (cyber CIM) that is much more versatile than previous implementations using FPGAs. Our architecture is versatile since it can be applied to the open-loop CIM, which was proposed when CIM research began, to the closed-loop CIM, which has been used recently, as well as to Jacobi successive over-relaxation method. By modifying the sequence control code for the calculation control module, other algorithms such as Simulated Bifurcation (SB) can also be implemented. Earlier research on large-scale FPGA implementations of SB and CIM used binary or ternary discrete values for connections, whereas the cyber CIM used FP32 values. Also, the cyber CIM utilized Zeeman terms that were represented as FP32, which were not present in other large-scale FPGA systems. Our implementation with continuous interaction realizes N=4096 on a single FPGA, comparable to the single-FPGA implementation of SB with binary interactions, with N=4096. The cyber CIM enables applications such as CDMA multi-user detector and L0 compressed sensing which were not possible with earlier FPGA systems, while enabling superior calculation speeds, more than ten times faster than a GPU implementation. The calculation speed can be further improved by increasing parallelism, such as through clustering.
Hardware Architecture,Disordered Systems and Neural Networks,Distributed, Parallel, and Cluster Computing,Emerging Technologies,Quantum Physics
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is the technical limitations encountered by existing Quantum Ising Machines in large - scale applications, especially the difficulty in achieving high - density coupling (such as full coupling) and handling a sufficient number of spins required for practical large - scale applications. Specifically, the paper points out that due to limitations in physical implementation, such as the difficulty of physical wiring among a large number of spins, it is difficult for Quantum Ising Machines to achieve a high - density interaction network. In addition, although equations that can be calculated classically have been derived from the quantum master equation and these algorithms have been implemented in parallel using Field - Programmable Gate Arrays (FPGAs), the existing systems still have the following limitations when dealing with large - scale problems:
1. **The connection values are low - bit discrete values, not continuous real values**: This limits the system's ability to handle real - world problems.
2. **Unable to configure the Zeeman term**: The Zeeman term plays a key role in optimization problems in signal processing applications, such as compressed sensing, Code Division Multiple Access (CDMA) multi - user detectors, etc.
3. **Can only handle one of the QUBO or Ising Hamiltonian optimization problems**: Unable to handle both types of problems simultaneously.
4. **The implemented algorithm cannot be changed**: Each time the model changes, the architecture needs to be redesigned.
To solve these problems, the author has developed a highly flexible FPGA - implemented Cyber Coherent Ising Machine (network CIM), which can support three different algorithms: open - loop CIM, closed - loop CIM, and Jacobi Successive Over - Relaxation (Jacobi SOR). This architecture allows these algorithms to be executed on the same module, and by modifying the sequence control code in the calculation control module, other algorithms can also be executed. In principle, Simulated Bifurcation (SB) can also be executed. In addition, the coupling matrix, Zeeman term, and OPO amplitude are all represented in single - precision floating - point format (FP32), thereby supporting high - precision operations for local field and time evolution (TE) calculations.
Through this highly flexible architecture, the paper has achieved effective handling of large - scale problems, especially achieving a system scale of \(N = 4096\) on a single FPGA, which is equivalent to the SB binary interaction system implemented on a single FPGA. In addition, this system can also exhibit superior computational speed in applications such as CDMA multi - user detectors and L0 - norm - regularized Compressed Sensing (L0RBCS), more than ten times faster than GPU implementations.