Predictive machine learning for gully susceptibility modeling with geo-environmental covariates: main drivers, model performance, and computational efficiency

Author:

Phinzi KwaneleORCID,Szabó SzilárdORCID

Abstract

AbstractCurrently, machine learning (ML) based gully susceptibility prediction is a rapidly expanding research area. However, when assessing the predictive performance of ML models, previous research frequently overlooked the critical component of computational efficiency in favor of accuracy. This study aimed to evaluate and compare the predictive performance of six commonly used algorithms in gully susceptibility modeling. Artificial neural networks (ANN), partial least squares, regularized discriminant analysis, random forest (RF), stochastic gradient boosting, and support vector machine (SVM) were applied. The comparison was conducted under three scenarios of input feature set sizes: small (six features), medium (twelve features), and large (sixteen features). Results indicated that SVM was the most efficient algorithm with a medium-sized feature set, outperforming other algorithms across all overall accuracy (OA) metrics (OA = 0.898, F1-score = 0.897) and required a relatively short computation time (< 1 min). Conversely, ensemble-based algorithms, mainly RF, required a larger feature set to reach optimal accuracy and were computationally demanding, taking about 15 min to compute. ANN also showed sensitivity to the number of input features, but unlike RF, its accuracy consistently decreased with larger feature sets. Among geo-environmental covariates, NDVI, followed by elevation, TWI, population density, SPI, and LULC, were critical for gully susceptibility modeling. Therefore, using SVM and involving these covariates in gully susceptibility modeling in similar environmental settings is strongly suggested to ensure higher accuracy and minimal computation time.

Publisher

Springer Science and Business Media LLC

Reference129 articles.

1. Abdi H (2003) Partial least square regression (PLS regression). Encycl Res Methods Soc Sci 6:792–795

2. Abdi AM (2020) Land cover and land use classification performance of machine learning algorithms in a boreal landscape using Sentinel-2 data. Gisci Remote Sens 57:1–20. https://doi.org/10.1080/15481603.2019.1650447

3. Achten WMJ, Dondeyne S, Mugogo S et al (2008) Gully erosion in south eastern Tanzania: spatial distribution and topographic thresholds. Zeitschrift Fur Geomorphologie 52:225–236

4. Alkarkhi AFM, Alqaraghuli WAA (2018) Discriminant analysis and classification. In: Alkarkhi AFM, Alqaraghuli WAA (eds) Easy statistics for food science with R. Academic Press, London, p 213

5. Amare S, Langendoen E, Keesstra S et al (2021) Susceptibility to gully erosion: applying random forest (RF) and frequency ratio (FR) approaches to a small catchment in Ethiopia. Water (basel) 13:216

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3