Predicting Solvatochromism of Chromophores in Proteins through QM/MM and Machine Learning

Amanda Arcidiacono,Edoardo Cignoni,Patrizia Mazzeo,Lorenzo Cupellini,Benedetta Mennucci
DOI: https://doi.org/10.26434/chemrxiv-2024-3zrx2
2024-01-29
Abstract:Solvatochromism occurs in both homogeneous solvents and more complex biological environments such as proteins. While in both cases the solvatochromic effects report on the surroundings of the chromophore, their interpretation in proteins becomes more complicated, not only because of structural effects induced by the protein pocket, but also because the protein environment is highly anisotropic. This is particularly evident for highly conjugated and flexible molecules such as carotenoids, whose excitation energy is strongly dependent on both the geometry and the electrostatics of the environment. Here we introduce a machine learning (ML) strategy trained on QM/MM calculations of geometrical and electrochromic contributions to carotenoids' excitation energies. We employ this strategy to compare the solvatochromism in protein and solvent environments. Despite the important specifities of the protein, ML models trained on solvents can faithfully predict excitation energies in the protein environment, demonstrating the robustness of the chosen descriptors.
Chemistry
What problem does this paper attempt to address?