Multistability manipulation by reinforcement learning algorithm inside mode-locked fiber laser

Alexey Kokhanovskiy,Evgeny Kuprikov,Kirill Serebrennikov,Aram Mkrtchyan,Ayvaz Davletkhanov,Alexey Bunkov,Dmitry Krasnikov,Mikhail Shashkov,Albert Nasibulin,Yuriy Gladush
DOI: https://doi.org/10.1515/nanoph-2023-0792
IF: 7.5
2024-04-16
Nanophotonics
Abstract:Fiber mode-locked lasers are nonlinear optical systems that provide ultrashort pulses at high repetition rates. However, adjusting the cavity parameters is often a challenging task due to the intrinsic multistability of a laser system. Depending on the adjustment of the cavity parameters, the optical output may vary significantly, including Q-switching, single and multipulse, and harmonic mode-locked regimes. In this study, we demonstrate an experimental implementation of the Soft Actor–Critic algorithm for generating a harmonic mode-locked regime inside a state-of-the-art fiber laser with an ion-gated nanotube saturable absorber. The algorithm employs nontrivial strategies to achieve a guaranteed harmonic mode-locked regime with the highest order by effectively managing the pumping power of a laser system and the nonlinear transmission of a nanotube absorber. Our results demonstrate a robust and feasible machine-learning–based approach toward an automatic system for adjusting nonlinear optical systems with the presence of multistability phenomena.
optics,physics, applied,materials science, multidisciplinary,nanoscience & nanotechnology
What problem does this paper attempt to address?
The paper attempts to address the problem of achieving multistate control in mode-locked fiber lasers using reinforcement learning algorithms, particularly in automatically adjusting nonlinear optical systems in the presence of multistate phenomena. Specifically, the researchers demonstrate how to use the Soft Actor-Critic (SAC) algorithm to generate harmonic mode-locking states in state-of-the-art fiber lasers. The key to this problem is to achieve stable generation of high-order harmonic mode-locking states by effectively managing the pump power of the laser system and the nonlinear transmission of the nanotube saturable absorber. ### Detailed Interpretation: 1. **Background and Problem**: - **Multistate Phenomena**: Mode-locked fiber lasers are nonlinear optical systems capable of providing ultrashort pulses and high repetition rates. However, adjusting the intracavity parameters to achieve specific outputs is challenging due to the inherent multistate characteristics of the laser system. Different intracavity parameter adjustments can lead to significantly different optical outputs, including Q-switching, single-pulse, multi-pulse, and harmonic mode-locking modes. - **Limitations of Existing Methods**: Traditional manual adjustment methods are not only time-consuming but also difficult to achieve stable and repeatable results due to the presence of multistate phenomena. Additionally, adjustment methods based on polarization controllers are affected by environmental factors and are not suitable for commercial applications. 2. **Solution**: - **Reinforcement Learning Algorithm**: The researchers propose a method based on the Soft Actor-Critic (SAC) algorithm to achieve the generation of high-order harmonic mode-locking states by dynamically adjusting the pump power of the laser system and the modulation depth of the saturable absorber. - **Experimental Platform**: The experiment used an all-polarization-maintaining fiber laser with a ring cavity design, utilizing a single-walled carbon nanotube film as the saturable absorber, and adjusting its nonlinear absorption characteristics through electrochemical gating technology. - **Algorithm Implementation**: The SAC algorithm treats the fiber laser as a black-box system, adjusting the pump current and voltage to achieve the desired harmonic mode-locking state. The algorithm evaluates the current state through a reward function and continuously optimizes the strategy to maximize the reward. 3. **Experimental Results**: - **Training Process**: Through 45 hours of training, the SAC algorithm successfully learned how to generate high-order harmonic mode-locking states under different conditions. Although there were some failures in the early stages of training, the algorithm gradually improved its success rate as training progressed. - **Performance Evaluation**: The experimental results show that the SAC algorithm can find the optimal parameter adjustment path in the multistate environment of the laser system, ultimately achieving stable generation of the 11th-order harmonic mode-locking state. ### Conclusion: By introducing the reinforcement learning algorithm, particularly the Soft Actor-Critic (SAC) algorithm, this paper successfully addresses the adjustment challenges posed by multistate phenomena in mode-locked fiber lasers. This method not only improves the efficiency and stability of adjustments but also provides new ideas for automatically adjusting nonlinear optical systems.