Accurate Band Gap Prediction Based on an Interpretable Δ-machine Learning

Lingyao Zhang,Tianhao Su,Musen Li,Fanhao Jia,Shuobo Hu,Peihong Zhang,Wei Ren
DOI: https://doi.org/10.1016/j.mtcomm.2022.104630
IF: 3.8
2022-01-01
Materials Today Communications
Abstract:Most materials science datasets are not so large that the accuracy of machine learning (ML) models is relatively limited if only simple features are used. Here, we constructed an interpretable delta-machine learning (delta-ML) model to connect the hybrid functional HSE bandgap (E-g(HSE)) with the PBE functional bandgap (E-g(PBE)). The former can reproduce the band gap comparable with experiments, but the computational cost is much more challenging. The training is based on our high-throughput calculations on a set of two-dimensional semiconductors. Four complex descriptors, all based on the EPBE g are constructed using the sure independence screening and sparsifying operator (SISSO) algorithm. Using these descriptors, the delta-ML can accurately predict the E-g(HSE) of test set with a determination coefficient (R2) of 0.96. The error satisfies a normal distribution with a mean of zero. We provide a direct functional relationship between input descriptors and target properties. We find that E-g(HSE ) and the 5/6th power of E(g )(PBE )show a significant linear correlation, which may guide rapid prediction of E-g(HSE ) from (EPBE) (g) for materials with a E-HSE (g) greater than 0.22 eV. We also discussed the correlation between the atomic radius and the E- g(HSE) . Our work will provide an effective and interpretable model to construct the optimal physical descriptors for ML prediction on bandgaps in screening massive new 2D materials research.
What problem does this paper attempt to address?