Abstract
The number of input factors affects the prediction accuracy of a model. Factor screening plays an important role as the starting point for data input. The aim of this study is to explore the influence of different factor screening methods on the prediction results. Taking the 2014 landslide inventory of Jingdong County as an example, a landslide database was constructed based on 136 landslide events and 11 selected factors, which were randomly divided into a training dataset and a test dataset according to a ratio of 7:3. Four factor screening methods, namely, the information gain ratio (IGR), GeoDetector, Pearson correlation coefficient and multicollinearity test (MT), were selected to screen the factors. A random forest (RF) model was then used in combination with each factor set for landslide susceptibility mapping (LSM). Finally, accuracy validation was performed using confusion matrices and ROC curves. The results show that factor screening is beneficial in improving the accuracy of the resulting model compared to the original model. Second, the IGR_RF model had the highest AUC value (0.9334), which was higher than that of the MT_RF model without factor screening (0.9194), and the IGR_RF model predicted the most landslides in the very high susceptibility zone (51.22%), indicating the good prediction performance of the IGR_RF model. Finally, the factor weighting analysis revealed that NDVI, elevation and aspect had the greatest influence on landslides in Jingdong County and that curvature had the least influence on landslides. This study can provide a reference for factor screening in LSM.
Funder
National Natural Science Foundation of China
Yunnan Fundamental Research Projects
'Revitalizing Yunnan Talents Support Program' project funding support
Reserve Talent Program for Young and Middle-aged Academic and Technical Leaders in Yunnan Province
Publisher
Public Library of Science (PLoS)
Reference77 articles.
1. Torrential rainfall-triggered shallow landslide characteristics and susceptibility assessment using ensemble data-driven models in the Dongjiang Reservoir Watershed, China;J Dou;Natural Hazards,2019
2. Department of Natural Resources of Yunnan Province. Department of Natural Resources of Yunnan Province on the issuance of the 2020 Yunnan Province geological hazard prevention and control program. Department of Natural Resources of Yunnan Province. 2020 Nov 3 [Cited 2023 Aprial 20]. http://dnr.yn.gov.cn/html/2020/dizhizaihaifangzhi_1103/31197.html.
3. Xinhua News Agency. Floods and landslides have affected 7,604 people in Taizhong Township, Jingdong County, Yunnan Province. Xinhua News Agency. 2008 Nov 4 [Cited 2023 June 23]. https://www.gov.cn/govweb/jrzg/2008-11/04/content_1139590.htm.
4. Fu R. 1 person died and more than 3,600 people were affected by landslides in Jingdong County, Yunnan Province (many pictures). CNR News. 2016 Sep 21 [Cited 2023 Aprial 20]. http://news.cnr.cn/native/gd/20160921/t20160921_523150634.shtml.
5. Machine learning methods for landslide susceptibility studies: A comparative overview of algorithm performance;A Merghadi;Earth-Science Reviews,2020