Supervised machine learning improves general applicability of eDNA metabarcoding for reservoir health monitoring.
Index of biotic integrity
Physicochemical assessment
Supervised machine learning
Three Gorges Reservoir
Water quality index
eDNA metabarcoding
Journal
Water research
ISSN: 1879-2448
Titre abrégé: Water Res
Pays: England
ID NLM: 0105072
Informations de publication
Date de publication:
01 Nov 2023
01 Nov 2023
Historique:
received:
19
06
2023
revised:
25
09
2023
accepted:
29
09
2023
medline:
6
11
2023
pubmed:
10
10
2023
entrez:
9
10
2023
Statut:
ppublish
Résumé
Effective and standardized monitoring methodologies are vital for successful reservoir restoration and management. Environmental DNA (eDNA) metabarcoding sequencing offers a promising alternative for biomonitoring and can overcome many limitations of traditional morphological bioassessment. Recent attempts have even shown that supervised machine learning (SML) can directly infer biotic indices (BI) from eDNA metabarcoding data, bypassing the cumbersome calculation process of BI regardless of the taxonomic assignment of eDNA sequences. However, questions surrounding the general applicability of this taxonomy-free approach to monitoring reservoir health remain unclear, including model stability, feature selection, algorithm choice, and multi-season biomonitoring. Here, we firstly developed a novel biological integrity index (Me-IBI) that integrates multitrophic interactions and environmental information, based on taxonomy-assigned eDNA metabarcoding data. The Me-IBI can better distinguish the actual health status of the Three Gorges Reservoir (TGR) than physicochemical assessments and have a clear response to human activity. Then, taking this reliable Me-IBI as a supervised label, we compared the impact of selecting different numbers of features and SML algorithms on the stability and predictive performance of the model for predicting ecological conditions in multiple seasons using taxonomy-free eDNA metabarcoding data. We discovered that even with a small number of features, different SML algorithms can establish a stable model and obtain excellent predictive performance. Finally, we proposed a four-step strategy for standardized routine biomonitoring using SML tools. Our study firstly explores the general applicability problem of the taxonomy-free eDNA-SML approach and establishes a solid foundation for the large-scale and standardized biomonitoring application.
Identifiants
pubmed: 37812979
pii: S0043-1354(23)01126-0
doi: 10.1016/j.watres.2023.120686
pii:
doi:
Substances chimiques
DNA, Environmental
0
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
120686Informations de copyright
Copyright © 2023 Elsevier Ltd. All rights reserved.
Déclaration de conflit d'intérêts
Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.