Consistency and Uncertainty of Remote Sensing-Based Approaches for Regional Yield Gap Estimation: A Comprehensive Assessment of Process-Based and Data-Driven Models

Jingwen Wang,Jinsong Chen,Jiahua Zhang,Shanshan Yang,Sha Zhang,Yun Bai,Ruize Xu
DOI: https://doi.org/10.1016/j.fcr.2023.109088
IF: 5.8
2023-01-01
Field Crops Research
Abstract:Quantifying the gap between actual and exploitable yields (Ya and Ye, respectively) at the regional scale is critical for understanding the scope and hotspots for future yield improvements. Remote sensing can provide geospatially continuous observations of crop growth, which is perceived as an effective means to overcome the scaling issue commonly suffered by regional yield gap (YG) studies. However, the consistency and uncertainties of different modeling paradigms (i.e., model-driven or data-driven paradigm) remain to be investigated when integrating remote sensing data into YG analysis. This study provides a comprehensive assessment of processbased and machine learning approaches for estimating rice YG over Northeast China. Three machine learning models are examined for predicting Ya, the results of which reveal a prevalent deviation in the estimation of extreme yields. A rotation scheme is thereby proposed to correct this error, and the outperforming random forest model is selected to be compared with a biophysical process-based model (BEPS). County-scale validation reveals good accuracy of both the process-based and machine learning models in estimating Ya, with RMSEs of 1355.3 kg ha  1 (17.3%) and 1205.0 kg  1 (15.4%), respectively. The process-based model reproduces well the probability distribution of Ya in peaks and tails, which guarantees the reasonable estimation of Ye, given that Ye is derived from the high percentile of Ya. In contrast, the machine learning model trained over county-scale yield statistics underestimates the heterogeneity of Ya, leading to significant shrinkage of Ye and thus YG. Degraded model consistency is found from the estimation of Ya, Ye to YG, yet the discrepancy presents limited impacts on identifying the regional priorities for closing YG. The remote sensing-based estimations under the "top-down" framework are further evaluated against the results of the Global Yield Gap Atlas (GYGA), which follows a "bottom-up" protocol. Multiscale comparisons (i.e., local, subregional and regional) with the GYGA and other reference results suggest that the pixelwise modeling of crop-growing processes constitutes a more reliable approach for estimating rice YG over Northeast China, although uncertainty exists in both remote sensing approaches in the local YG estimation. The need to train machine learning models using representative datasets at the field scale is strongly emphasized. This study contributes to the first comprehensive understanding of different modeling paradigms for YG analysis and has important implications for promoting model development and ensuring food security.
What problem does this paper attempt to address?