Abstract:Air temperature (Tair) is critical to modeling environmental processes (e.g. snow/glacier melting) in high-elevation areas of the Tibetan Plateau (TP). To resolve the issue that Tair observations are scarce in the TP western part and at high elevation, many studies have estimated daily air temperatures by using MODIS land surface temperature (LST) and various reanalysis datasets. These estimates are however inadequate for supporting high-resolution long-term hydrological simulations or climate analysis due to the high cloud cover, short time span or low spatial resolution. To improve the Tair estimation, this study develops a novel machine-learning based method that uses the Gradient Boosting model to efficiently integrate observations from high-elevation stations with eight widely used air temperature reanalysis and assimilation datasets (i.e., NNRP-2, 20CRV2c, JRA-55, ERA-Interim, MERRA-2, CFSR, ERA5 and GLDAS2) downscaled with remote sensing-based temperature lapse rates (TLR). This method is used to generate a new dataset of daily air temperature with the 1-km resolution for the period of 1980–2014. To overcome the problem that TLR derived from limited stations may be unreliable, a new TLR estimation method is developed to first estimate spatially continuous monthly TLRs from MODIS LST and then downscale daily mean Tair from eight reanalysis and assimilation datasets to obtain Tair at the 1-km resolution using the MODIS-estimated TLRs. The Gradient Boosting (GB) model is selected for integrating the eight downscaled Tair and five other auxiliary variables. The models are trained and validated using observations from 100 common stations (i.e. China Meteorology Administration stations) and 13 independent high-elevation stations (4 on glaciers). The results show that the proposed TLR estimation method can efficiently reduce exceptional TLRs in the meantime keeping acceptable downscaling accuracy. The downscaled Tair from JRA-55 is the best among the eight downscaled datasets followed by ERA-Interim, MERRA-2, CFSR and others. Finally, the GB-integrated Tair further outperforms the downscaled JRA-55 Tair with the mean root-mean-squared-deviation (RMSD) of 1.7 °C versus 2.0 °C, especially in high-elevation stations with mean RMSD of 1.9 °C versus 2.7 °C. Both the MODIS-estimated TLR and the high-elevation training observations are demonstrated to significantly improve the air temperature estimation accuracy of the GB model in high-elevation stations. This study also provides a framework for integrating multiple reanalysis and assimilation temperature data with elevation correction in mountainous regions that is not restricted to the TP.

Creating 1-km long-term (1980–2014) daily average air temperatures over the Tibetan Plateau by integrating eight types of reanalysis and land data assimilation products downscaled with MODIS-estimated temperature lapse rates based on machine learning

How Well Do the ERA‐Interim, ERA‐5, GLDAS‐2.1 and NCEP‐R2 Reanalysis Datasets Represent Daily Air Temperature over the Tibetan Plateau?

Reconstruction of 60-year (1961-2020) surface air temperature on the Tibetan Plateau by fusing MODIS and ERA5 temperatures

Daily Air Temperature Estimation on Glacier Surfaces in the Tibetan Plateau Using MODIS LST Data

A High-Resolution Land Surface Temperature Downscaling Method Based on Geographically Weighted Neural Network Regression

Modeling ground surface temperature by means of remote sensing data in high-altitude areas: test in the central Tibetan Plateau with application of moderate-resolution imaging spectroradiometer Terra/Aqua land surface temperature and ground-based infrared radiometer

Estimation of the Land Surface Temperature over the Tibetan Plateau by Using Chinese FY-2C Geostationary Satellite Data

Temporal and Spatial Changes in Estimated Near‐surface Air Temperature Lapse Rates on Tibetan Plateau

High resolution Tibetan Plateau regional reanalysis 1961-present

A Solar Radiation-Based Method for Generating Spatially Seamless and Temporally Consistent Land Surface Temperature

A Global Dataset of Spatiotemporally Seamless Daily Mean Land Surface Temperatures: Generation, Validation, and Analysis

Variability of temperature in the Tibetan Plateau based on homogenized surface stations and reanalysis data

Reconstruction of 0.05 all-sky daily maximum air temperature across Eurasia for 2003-2018 with multi-source satellite data and machine learning models

Estimation of Surface and Near-Surface Air Temperatures in Arid Northwest China Using Landsat Satellite Images

Generating 1 km Spatially Seamless and Temporally Continuous Air Temperature Based on Deep Learning over Yangtze River Basin, China

A long-term (2005–2016) dataset of hourly integrated land–atmosphere interaction observations on the Tibetan Plateau

A Fast and Easy Way to Produce a 1-Km All-Weather Land Surface Temperature Dataset for China Utilizing More Ground-Based Data

A New Framework for the Reconstruction of Daily 1 Km Land Surface Temperatures from 2000 to 2022

A simple yet robust framework to estimate accurate daily mean land surface temperature from thermal observations of tandem polar orbiters

Spatial Downscaling of MODIS Land Surface Temperature Based on a Geographically and Temporally Weighted Autoregressive Model

A long-term (2005–2016) dataset of integrated land–atmosphere interaction observations on the Tibetan Plateau