Robust methodology for PEC performance analysis of photoanodes using machine learning and analytical data

Moeko Tajima,Yuya Nagai,Siyan Chen,Zhenhua Pan,Kenji Katayama
DOI: https://doi.org/10.1039/d4an00439f
2024-06-27
The Analyst
Abstract:Machine learning (ML) is increasingly applied across various fields, including chemistry, for molecular design and optimizing reaction parameters. Yet, applying ML to experimental data is challenging due to the limited number of synthesized samples, which restricts its broader application in device development. In energy-harvesting, photoanodes are crucial for solar-driven water splitting, generating hydrogen and oxygen. We explored electrodes like hematite and bismuth vanadate for photocatalytic uses, noting varied photoelectrochemical performances despite similar preparations. To understand this variability, we applied a data-driven ML approach, predicting photocurrent values and identifying key performance influencers even with limited experimental data in the research development of inorganic device. Traditional ML methods used multiple algorithms, obscuring the influence of specific factors. We introduced a novel methodology, incorporating clustering to manage multicollinearity from correlated analytical data and Shapley analysis for clear interpretation of contributions to performance prediction. This method was validated on hematite and bismuth vanadate, showing superior predictability and factor identification, then extended to tungsten oxide and bismuth vanadate heterojunction photoanodes. Despite their complexity, our approach achieved determination coefficients (R2) with a prediction accuracy over 0.85, successfully pinpointing performance-determining factors, demonstrating the robustness of the new scheme in advancing photodevice research.
chemistry, analytical
What problem does this paper attempt to address?