Active Learning Exploration of Transition Metal Complexes to Discover Method-Insensitive and Synthetically Accessible Chromophores

Chenru Duan,Aditya Nandy,Gianmarco Terrones,David W. Kastner,Heather J. Kulik
DOI: https://doi.org/10.48550/arXiv.2208.05444
2022-09-16
Abstract:Transition metal chromophores with earth-abundant transition metals are an important design target for their applications in lighting and non-toxic bioimaging, but their design is challenged by the scarcity of complexes that simultaneously have optimal target absorption energies in the visible region as well as well-defined ground states. Machine learning (ML) accelerated discovery could overcome such challenges by enabling screening of a larger space, but is limited by the fidelity of the data used in ML model training, which is typically from a single approximate density functional. To address this limitation, we search for consensus in predictions among 23 density functional approximations across multiple rungs of Jacobs ladder. To accelerate the discovery of complexes with absorption energies in the visible region while minimizing MR character, we use 2D efficient global optimization to sample candidate low-spin chromophores from multi-million complex spaces. Despite the scarcity (i.e., approx. 0.01\%) of potential chromophores in this large chemical space, we identify candidates with high likelihood (i.e., > 10\%) of computational validation as the ML models improve during active learning, representing a 1,000-fold acceleration in discovery. Absorption spectra of promising chromophores from time-dependent density functional theory verify that 2/3 of candidates have the desired excited state properties. The observation that constituent ligands from our leads have demonstrated interesting optical properties in the literature exemplifies the effectiveness of our construction of a realistic design space and active learning approach.
Chemical Physics,Materials Science,Machine Learning,Biomolecules
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to discover spectroscopically viable chromophores with specific absorption energies in transition - metal complexes. Specifically, the research objective is to find transition - metal complexes with the following characteristics: 1. **Absorption energy in the visible light region**: The target absorption energy range is between 1.5 eV (825 nm) and 3.5 eV (350 nm), which makes these compounds have potential applications in fields such as lighting and non - toxic bio - imaging. 2. **Low - spin ground state**: In order to increase the probability of the metal - ligand charge - transfer (MLCT) state and reduce the influence of the metal - center (MC) state, the researchers hope to find complexes with a low - spin (LS) ground state. 3. **Weak multi - reference (MR) characteristics**: Avoiding high multi - reference characteristics can reduce the density - functional - theory (DFT) prediction errors caused by static correlation. 4. **Synthetic feasibility**: Ensuring that the designed complexes can be prepared by known synthetic methods is a crucial factor in practical applications. To achieve these goals, the researchers adopted a machine - learning (ML) - accelerated efficient global optimization (EGO) method, combined with 23 different density - functional approximations (DFAs), to reduce the bias caused by a single DFA selection. Through this method, the researchers were able to efficiently screen out potential candidates in a design space containing millions of possible complexes, achieving a 1,000 - fold acceleration compared to random sampling. Finally, the spectral properties of some candidates were verified by time - dependent density - functional - theory (TDDFT) calculations, confirming their potential as spectroscopically viable chromophores.