Effect of Increasing the Descriptor Set on Machine Learning Prediction of Small Molecule-Based Organic Solar Cells

Zhi-Wen Zhao,Marcos del Cueto,Yun Geng,Alessandro Troisi
DOI: https://doi.org/10.1021/acs.chemmater.0c02325
IF: 10.508
2020-08-25
Chemistry of Materials
Abstract:In this work, we analyzed a data set formed by 566 donor/acceptor pairs, which are part of organic solar cells recently reported. We explored the effect of different descriptors in machine learning (ML) models to predict the power conversion efficiency (PCE) of these cells. The investigated descriptors are classified into two main categories: structural (topology properties) and physical descriptors (energy levels, molecular size, light absorption, and mixing properties). In line with previous observations, ML predictions are more accurate when using both structural and physical descriptors, as opposed to only using one of them. We observed that ML predictions are also improved by using larger and more varied data sets. Importantly, the structural descriptors are the ones contributing the most to the ML models. Some physical properties are highly correlated with PCE, although they do not improve notably the ML prediction accuracy as they carry information already encoded in the structural descriptors. Given that various descriptors have significantly different computational costs, the analysis presented here can be used as a guide to construct ML models that maximize predictive power and minimize computational costs for screening large sets of candidates.The Supporting Information is available free of charge at <a class="ext-link" href="/doi/10.1021/acs.chemmater.0c02325?goto=supporting-info">https://pubs.acs.org/doi/10.1021/acs.chemmater.0c02325</a>.Computational methods, ML results in detail, combinations of descriptors, and the results about predictions of excited-state properties and miscibility (<a class="ext-link" href="/doi/suppl/10.1021/acs.chemmater.0c02325/suppl_file/cm0c02325_si_001.pdf">PDF</a>)(<a class="ext-link" href="/doi/suppl/10.1021/acs.chemmater.0c02325/suppl_file/cm0c02325_si_001.pdf">PDF</a>)This article has not yet been cited by other publications.
materials science, multidisciplinary,chemistry, physical
What problem does this paper attempt to address?