Learning Landscape Features from Streamflow with Autoencoders

Alberto Bassi,Marvin Höge,Antonietta Mira,Fabrizio Fenicia,Carlo Albert
DOI: https://doi.org/10.5194/hess-2024-47
IF: 6.3
2024-02-21
Hydrology and Earth System Sciences
Abstract:Understanding the number and types of signatures that best describe streamflow time series is a crucial objective in hydrological science, serving applications such as catchment classification, hydrological model development and calibration. With the main objective of learning a minimal number of streamflow features, we employ an explicit noise conditional autoencoder (ENCA), which, together with meteorological forcings, allows for an accurate reconstruction of the whole streamflow time series. The ENCA architecture feeds the meteorological forcing to the decoder in order to incentivize the encoder to only learn features that are related to landscape properties. By isolating the effect of meteorology, these features can thus be interpreted as landscape fingerprints. The optimal number of features is found by means of an intrinsic dimension estimator. We train our model on the hydro-meteorologic time series data of 568 catchments of the continental United States from the CAMELS dataset. We compare the reconstruction accuracy with state-of-the-art models that take as input a subset of static catchment attributes (both climate and landscape attributes) along with the meteorological forcing variables. Our results suggest that available static catchment attributes compiled by experts account for almost all the relevant information about the rainfall-runoff relationship. Yet, these catchment attributes can be summarized by only two relevant learnt features (or signatures), while a third one is needed for about a dozen difficult catchments in the central US, mainly characterized by high aridity index and intermittent flow. The principal components of the learnt features strongly correlate with the baseflow index and aridity indicators, which is consistent with the idea that these indicators capture the variability of catchment hydrological response and relate to needed model complexity. The correlation analysis further indicates that soil-related and vegetation attributes are of high importance. Finally, in the attempt to interpret the learnt catchment features, we relate them to typical hydrological model components, with specific reference to the parameters of the GR4J model and their function on the hydrograph.
geosciences, multidisciplinary,water resources
What problem does this paper attempt to address?