Prediction of antimicrobial resistance based on whole-genome sequencing and machine learning.


Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
03 01 2022
Historique:
received: 29 06 2021
revised: 27 08 2021
accepted: 24 09 2021
pubmed: 7 10 2021
medline: 3 2 2023
entrez: 6 10 2021
Statut: ppublish

Résumé

Antimicrobial resistance (AMR) is one of the biggest global problems threatening human and animal health. Rapid and accurate AMR diagnostic methods are thus very urgently needed. However, traditional antimicrobial susceptibility testing (AST) is time-consuming, low throughput and viable only for cultivable bacteria. Machine learning methods may pave the way for automated AMR prediction based on genomic data of the bacteria. However, comparing different machine learning methods for the prediction of AMR based on different encodings and whole-genome sequencing data without previously known knowledge remains to be done. In this study, we evaluated logistic regression (LR), support vector machine (SVM), random forest (RF) and convolutional neural network (CNN) for the prediction of AMR for the antibiotics ciprofloxacin, cefotaxime, ceftazidime and gentamicin. We could demonstrate that these models can effectively predict AMR with label encoding, one-hot encoding and frequency matrix chaos game representation (FCGR encoding) on whole-genome sequencing data. We trained these models on a large AMR dataset and evaluated them on an independent public dataset. Generally, RFs and CNNs perform better than LR and SVM with AUCs up to 0.96. Furthermore, we were able to identify mutations that are associated with AMR for each antibiotic. Source code in data preparation and model training are provided at GitHub website (https://github.com/YunxiaoRen/ML-iAMR). Supplementary data are available at Bioinformatics online.

Identifiants

pubmed: 34613360
pii: 6382301
doi: 10.1093/bioinformatics/btab681
pmc: PMC8722762
doi:

Substances chimiques

Anti-Bacterial Agents 0
Ciprofloxacin 5E8K9I0O4U

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

325-334

Subventions

Organisme : German Federal Ministry of Education and Research (BMBF)
ID : 031L0209B

Informations de copyright

© The Author(s) 2021. Published by Oxford University Press.

Auteurs

Yunxiao Ren (Y)

Department of Data Science in Biomedicine, Faculty of Mathematics and Computer Science, Philipps-University of Marburg, Marburg 35032, Germany.

Trinad Chakraborty (T)

Institute of Medical Microbiology, Justus Liebig University Giessen, Giessen 35392, Germany.
German Center for Infection Research, Partner site Giessen-Marburg-Langen, Giessen 35392, Germany.

Swapnil Doijad (S)

Institute of Medical Microbiology, Justus Liebig University Giessen, Giessen 35392, Germany.
German Center for Infection Research, Partner site Giessen-Marburg-Langen, Giessen 35392, Germany.

Linda Falgenhauer (L)

German Center for Infection Research, Partner site Giessen-Marburg-Langen, Giessen 35392, Germany.
Institute of Hygiene and Environmental Medicine, Justus Liebig University Giessen, Giessen 35392, Germany.
Hessisches universitäres Kompetenzzentrum Krankenhaushygiene, Giessen 35392, Germany.

Jane Falgenhauer (J)

Institute of Medical Microbiology, Justus Liebig University Giessen, Giessen 35392, Germany.
German Center for Infection Research, Partner site Giessen-Marburg-Langen, Giessen 35392, Germany.

Alexander Goesmann (A)

German Center for Infection Research, Partner site Giessen-Marburg-Langen, Giessen 35392, Germany.
Department of Bioinformatics and Systems Biology, Justus Liebig University Giessen, Giessen 35392, Germany.

Anne-Christin Hauschild (AC)

Department of Data Science in Biomedicine, Faculty of Mathematics and Computer Science, Philipps-University of Marburg, Marburg 35032, Germany.

Oliver Schwengers (O)

German Center for Infection Research, Partner site Giessen-Marburg-Langen, Giessen 35392, Germany.
Department of Bioinformatics and Systems Biology, Justus Liebig University Giessen, Giessen 35392, Germany.

Dominik Heider (D)

Department of Data Science in Biomedicine, Faculty of Mathematics and Computer Science, Philipps-University of Marburg, Marburg 35032, Germany.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH