A taxonomy-free approach based on machine learning to assess the quality of rivers with diatoms.

Bioassessment HYDRA Machine learning Metabarcoding OTUs Rivers

Journal

The Science of the total environment
ISSN: 1879-1026
Titre abrégé: Sci Total Environ
Pays: Netherlands
ID NLM: 0330500

Informations de publication

Date de publication:
20 Jun 2020
Historique:
received: 04 11 2019
revised: 10 03 2020
accepted: 11 03 2020
pubmed: 22 3 2020
medline: 26 6 2020
entrez: 22 3 2020
Statut: ppublish

Résumé

Diatoms are a compulsory biological quality element in the ecological assessment of rivers according to the Water Framework Directive. The application of current official indices requires the identification of individuals to species or lower rank under a microscope based on the valve morphology. This is a highly time-consuming task, often susceptible of disagreements among analysts. In alternative, the use of DNA metabarcoding combined with High-Throughput Sequencing (HTS) has been proposed. The sequences obtained from environmental DNA are clustered into Operational Taxonomic Units (OTUs), which can be assigned to a taxon using reference databases, and from there calculate biotic indices. However, there is still a high percentage of unassigned OTUs to species due to the incompleteness of reference libraries. Alternatively, we tested a new taxonomy-free approach based on diatom community samples to assess rivers. A combination of three machine learning techniques is used to build models that predict diatom OTUs expected in test sites, under reference conditions, from environmental data. The Observed/Expected OTUs ratio indicates the deviation from reference condition and is converted into a quality class. This approach was never used with diatoms neither with OTUs data. To evaluate its efficiency, we built a model based on OTUs lists (HYDGEN) and another based on taxa lists from morphological identification (HYDMORPH), and also calculated a biotic index (IPS). The models were trained and tested with data from 81 sites (44 reference sites) from central Portugal. Both models were considered accurate (linear regression for Observed and Expected richness: R

Identifiants

pubmed: 32199386
pii: S0048-9697(20)31413-3
doi: 10.1016/j.scitotenv.2020.137900
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

137900

Informations de copyright

Copyright © 2020 Elsevier B.V. All rights reserved.

Déclaration de conflit d'intérêts

Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Auteurs

Maria João Feio (MJ)

MARE - Marine and Environmental Sciences Centre, Department of Life Sciences, University of Coimbra, Portugal. Electronic address: mjf@ci.uc.pt.

Sónia R Q Serra (SRQ)

MARE - Marine and Environmental Sciences Centre, Department of Life Sciences, University of Coimbra, Portugal.

Andreia Mortágua (A)

Department of Biology and Geobiotec - Geobiosciences, Geotechnologies and Geoengineering Research Centre, University of Aveiro, Campus de Santiago, 3810-193 Aveiro, Portugal.

Agnès Bouchez (A)

UMR CARRTEL, INRAE, Université Savoie Mont-Blanc, F-74200 Thonon, France.

Frédéric Rimet (F)

UMR CARRTEL, INRAE, Université Savoie Mont-Blanc, F-74200 Thonon, France.

Valentin Vasselon (V)

Pôle R&D "ECLA", France; AFB, Site INRA UMR CARRTEL, Thonon-les-Bains, France.

Salomé F P Almeida (SFP)

Department of Biology and Geobiotec - Geobiosciences, Geotechnologies and Geoengineering Research Centre, University of Aveiro, Campus de Santiago, 3810-193 Aveiro, Portugal.

Articles similaires

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software

Understanding the role of machine learning in predicting progression of osteoarthritis.

Simone Castagno, Benjamin Gompels, Estelle Strangmark et al.
1.00
Humans Disease Progression Machine Learning Osteoarthritis
India Carbon Sequestration Environmental Monitoring Carbon Biomass
Rivers Turkey Biodiversity Environmental Monitoring Animals

Classifications MeSH