Parsimonious statistical learning models for low-flow estimation
-
Published:2022-01-12
Issue:1
Volume:26
Page:129-148
-
ISSN:1607-7938
-
Container-title:Hydrology and Earth System Sciences
-
language:en
-
Short-container-title:Hydrol. Earth Syst. Sci.
Author:
Laimighofer JohannesORCID, Melcher Michael, Laaha GregorORCID
Abstract
Abstract. Statistical learning methods offer a promising approach for low-flow regionalization. We examine seven statistical learning models (Lasso, linear, and nonlinear-model-based boosting, sparse partial least squares, principal component regression, random forest, and support vector regression) for the prediction of winter and summer low flow based on a hydrologically diverse dataset of 260 catchments in Austria. In order to produce sparse models, we adapt the recursive feature elimination for variable preselection and propose using three different variable ranking methods (conditional forest, Lasso, and linear model-based boosting) for each of the prediction models. Results are evaluated for the low-flow characteristic Q95 (Pr(Q>Q95)=0.95) standardized by catchment area using a repeated nested cross-validation scheme. We found a generally high prediction accuracy for winter (RCV2 of 0.66 to 0.7) and summer (RCV2 of 0.83 to 0.86). The models perform similarly to or slightly better than a top-kriging model that constitutes the current benchmark for the study area. The best-performing models are support vector regression (winter) and nonlinear model-based boosting (summer), but linear models exhibit similar prediction accuracy. The use of variable preselection can significantly reduce the complexity of all the models with only a small loss of performance. The so-obtained learning models are more parsimonious and thus easier to interpret and more robust when predicting at ungauged sites. A direct comparison of linear and nonlinear models reveals that nonlinear processes can be sufficiently captured by linear learning models, so there is no need to use more complex models or to add nonlinear effects. When performing low-flow regionalization in a seasonal climate, the temporal stratification into summer and winter low flows was shown to increase the predictive performance of all learning models, offering an alternative to catchment grouping that is recommended otherwise.
Funder
Österreichischen Akademie der Wissenschaften
Publisher
Copernicus GmbH
Subject
General Earth and Planetary Sciences,General Engineering,General Environmental Science
Reference77 articles.
1. Abrahart, R. J., Anctil, F., Coulibaly, P., Dawson, C. W., Mount, N. J., See,
L. M., Shamseldin, A. Y., Solomatine, D. P., Toth, E., and Wilby, R. L.: Two
decades of anarchy? Emerging themes and outstanding challenges for neural
network river forecasting, Prog. Phys. Geog., 36, 480–513,
https://doi.org/10.1177/0309133312444943, 2012. a 2. Ambroise, C. and McLachlan, G. J.: Selection bias in gene extraction on the
basis of microarray gene-expression data, P. Natl. Acad.
Sci. USA, 99, 6562–6566, https://doi.org/10.1073/pnas.102102699, 2002. a, b 3. Beguería, S. and Vicente-Serrano, S. M.: SPEI: Calculation of the Standardised
Precipitation-Evapotranspiration Index, r package version
1.7, available at:
https://CRAN.R-project.org/package=SPEI (last access: 15 Septepmber 2021), 2017. a 4. Blöschl, G., Sivapalan, M., Wagener, T., Savenije, H., and
Viglione, A.: Runoff prediction in ungauged basins: synthesis across
processes, places and scales, edited by: Blöschl, G., Wagener, T., and Savenije, H. Cambridge University Press, https://doi.org/10.1017/CBO9781139235761, 2013. a 5. Breiman, L.: Random forests, Mach. Learn., 45, 5–32,
https://doi.org/10.1023/A:1010933404324, 2001. a
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|