Identifying the spatial pattern and driving factors of nitrate in groundwater using a novel framework of interpretable stacking ensemble learning.

Driving factors Ensemble learning Groundwater Interpretable machine learning Spatial distribution Water quality

Journal

Environmental geochemistry and health
ISSN: 1573-2983
Titre abrégé: Environ Geochem Health
Pays: Netherlands
ID NLM: 8903118

Informations de publication

Date de publication:
29 Oct 2024
Historique:
received: 19 02 2024
accepted: 27 08 2024
medline: 29 10 2024
pubmed: 29 10 2024
entrez: 29 10 2024
Statut: epublish

Résumé

Groundwater nitrate contamination poses a potential threat to human health and environmental safety globally. This study proposes an interpretable stacking ensemble learning (SEL) framework for enhancing and interpreting groundwater nitrate spatial predictions by integrating the two-level heterogeneous SEL model and SHapley Additive exPlanations (SHAP). In the SEL model, five commonly used machine learning models were utilized as base models (gradient boosting decision tree, extreme gradient boosting, random forest, extremely randomized trees, and k-nearest neighbor), whose outputs were taken as input data for the meta-model. When applied to the agricultural intensive area, the Eden Valley in the UK, the SEL model outperformed the individual models in predictive performance and generalization ability. It reveals a mean groundwater nitrate level of 2.22 mg/L-N, with 2.46% of sandstone aquifers exceeding the drinking standard of 11.3 mg/L-N. Alarmingly, 8.74% of areas with high groundwater nitrate remain outside the designated nitrate vulnerable zones. Moreover, SHAP identified that transmissivity, baseflow index, hydraulic conductivity, the percentage of arable land, and the C:N ratio in the soil were the top five key driving factors of groundwater nitrate. With nitrate threatening groundwater globally, this study presents a high-accuracy, interpretable, and flexible modeling framework that enhances our understanding of the mechanisms behind groundwater nitrate contamination. It implies that the interpretable SEL framework has great promise for providing valuable evidence for environmental management, water resource protection, and sustainable development, particularly in the data-scarce area.

Identifiants

pubmed: 39470928
doi: 10.1007/s10653-024-02201-1
pii: 10.1007/s10653-024-02201-1
doi:

Substances chimiques

Nitrates 0
Water Pollutants, Chemical 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

482

Subventions

Organisme : National Natural Science Foundation of China
ID : 51779030
Organisme : National Natural Science Foundation of China
ID : 42277189
Organisme : National Natural Science Foundation of China
ID : 42277189

Informations de copyright

© 2024. Dalian University of Technology, the British Geological Survey (UKRI), Yuesuo Yang, Yuanyin Li, Zhongguo Li.

Références

Sci Total Environ. 2022 Feb 10;807(Pt 3):151065
pubmed: 34673076
Environ Res. 2022 Mar;204(Pt C):112322
pubmed: 34740625
Sci Total Environ. 2023 Dec 20;905:166863
pubmed: 37690767
Water Res. 2022 Mar 1;211:118078
pubmed: 35066260
Environ Geochem Health. 2024 Feb 17;46(3):80
pubmed: 38367130
Ground Water. 2019 Mar;57(2):245-258
pubmed: 29896911
J Contam Hydrol. 2021 Dec;243:103914
pubmed: 34798506
Sci Total Environ. 2015 Dec 1;536:481-488
pubmed: 26232757
Environ Res. 2022 Jul;210:112988
pubmed: 35217009
Water Res. 2008 Oct;42(16):4215-32
pubmed: 18721996
Chemosphere. 2022 Aug;300:134514
pubmed: 35398076
Environ Geochem Health. 2013 Oct;35(5):667-81
pubmed: 23801341
Nat Mach Intell. 2020 Jan;2(1):56-67
pubmed: 32607472
Sci Total Environ. 2019 Jun 10;668:1317-1327
pubmed: 31018471
Mar Pollut Bull. 2023 Dec;197:115669
pubmed: 37922752
Environ Pollut. 2021 Oct 15;287:117598
pubmed: 34147777
Geoderma. 2019 Dec 1;355:113912
pubmed: 31798185
Ambio. 2020 Feb;49(2):504-516
pubmed: 31115872
J Environ Manage. 2023 Feb 15;328:116959
pubmed: 36473348
Environ Sci Technol. 2021 Feb 16;55(4):2265-2275
pubmed: 33507730
Environ Sci Technol. 2014 May 20;48(10):5643-51
pubmed: 24779475
Environ Geochem Health. 2023 Dec 26;46(1):14
pubmed: 38147177
Environ Pollut. 2020 Apr;259:113711
pubmed: 31891909
Sci Total Environ. 2017 Feb 1;579:133-148
pubmed: 27847181
Sci Total Environ. 2021 Apr 15;765:142760
pubmed: 33131841

Auteurs

Xuan Li (X)

School of Hydraulic Engineering, Dalian University of Technology, Dalian, 116024, China.
British Geological Survey, Keyworth, Nottingham, NG12 5GG, UK.

Guohua Liang (G)

School of Hydraulic Engineering, Dalian University of Technology, Dalian, 116024, China.

Lei Wang (L)

British Geological Survey, Keyworth, Nottingham, NG12 5GG, UK. lelei@bgs.ac.uk.

Yuesuo Yang (Y)

Key Laboratory of Groundwater Resources and Environment, Ministry of Education, Jilin University, Changchun, 130021, China.

Yuanyin Li (Y)

British Geological Survey, Keyworth, Nottingham, NG12 5GG, UK.
Department of Geography, Durham University, Durham, DH1 3LE, UK.

Zhongguo Li (Z)

Liaoning Water Affairs Service Center, Shenyang, 110003, China.

Bin He (B)

School of Hydraulic Engineering, Dalian University of Technology, Dalian, 116024, China.

Guoli Wang (G)

School of Hydraulic Engineering, Dalian University of Technology, Dalian, 116024, China.

Articles similaires

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software

Understanding the role of machine learning in predicting progression of osteoarthritis.

Simone Castagno, Benjamin Gompels, Estelle Strangmark et al.
1.00
Humans Disease Progression Machine Learning Osteoarthritis
India Carbon Sequestration Environmental Monitoring Carbon Biomass
Rivers Turkey Biodiversity Environmental Monitoring Animals

Classifications MeSH