Pathogen detection in RNA-seq data with Pathonoia.


Journal

BMC bioinformatics
ISSN: 1471-2105
Titre abrégé: BMC Bioinformatics
Pays: England
ID NLM: 100965194

Informations de publication

Date de publication:
17 Feb 2023
Historique:
received: 29 08 2022
accepted: 10 01 2023
entrez: 21 2 2023
pubmed: 22 2 2023
medline: 25 2 2023
Statut: epublish

Résumé

Bacterial and viral infections may cause or exacerbate various human diseases and to detect microbes in tissue, one method of choice is RNA sequencing. The detection of specific microbes using RNA sequencing offers good sensitivity and specificity, but untargeted approaches suffer from high false positive rates and a lack of sensitivity for lowly abundant organisms. We introduce Pathonoia, an algorithm that detects viruses and bacteria in RNA sequencing data with high precision and recall. Pathonoia first applies an established k-mer based method for species identification and then aggregates this evidence over all reads in a sample. In addition, we provide an easy-to-use analysis framework that highlights potential microbe-host interactions by correlating the microbial to the host gene expression. Pathonoia outperforms state-of-the-art methods in microbial detection specificity, both on in silico and real datasets. Two case studies in human liver and brain show how Pathonoia can support novel hypotheses on microbial infection exacerbating disease. The Python package for Pathonoia sample analysis and a guided analysis Jupyter notebook for bulk RNAseq datasets are available on GitHub.

Sections du résumé

BACKGROUND BACKGROUND
Bacterial and viral infections may cause or exacerbate various human diseases and to detect microbes in tissue, one method of choice is RNA sequencing. The detection of specific microbes using RNA sequencing offers good sensitivity and specificity, but untargeted approaches suffer from high false positive rates and a lack of sensitivity for lowly abundant organisms.
RESULTS RESULTS
We introduce Pathonoia, an algorithm that detects viruses and bacteria in RNA sequencing data with high precision and recall. Pathonoia first applies an established k-mer based method for species identification and then aggregates this evidence over all reads in a sample. In addition, we provide an easy-to-use analysis framework that highlights potential microbe-host interactions by correlating the microbial to the host gene expression. Pathonoia outperforms state-of-the-art methods in microbial detection specificity, both on in silico and real datasets.
CONCLUSION CONCLUSIONS
Two case studies in human liver and brain show how Pathonoia can support novel hypotheses on microbial infection exacerbating disease. The Python package for Pathonoia sample analysis and a guided analysis Jupyter notebook for bulk RNAseq datasets are available on GitHub.

Identifiants

pubmed: 36803415
doi: 10.1186/s12859-023-05144-z
pii: 10.1186/s12859-023-05144-z
pmc: PMC9938591
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

53

Subventions

Organisme : Deutsche Forschungsgemeinschaft
ID : CRU306 P-C
Organisme : Forschungs- und Wissenschaftsstiftung Hamburg
ID : LFF 78

Informations de copyright

© 2023. The Author(s).

Références

PLoS One. 2015 Feb 23;10(2):e0118286
pubmed: 25706956
Expert Rev Gastroenterol Hepatol. 2018 Jul;12(7):641-656
pubmed: 29806487
Genome Biol. 2018 Nov 16;19(1):198
pubmed: 30445993
Nat Rev Microbiol. 2012 Sep;10(9):618-30
pubmed: 22890146
Genome Biol. 2017 Sep 21;18(1):182
pubmed: 28934964
Euro Surveill. 2017 Dec;22(49):
pubmed: 29233255
Medicine (Baltimore). 1984 May;63(3):133-54
pubmed: 6371440
BMC Genomics. 2016 Jan 05;17:28
pubmed: 26732976
Nat Commun. 2015 May 20;6:7126
pubmed: 25989971
Bioinformatics. 2011 Aug 15;27(16):2194-200
pubmed: 21700674
Eur J Clin Microbiol Infect Dis. 1995 Oct;14(10):908-11
pubmed: 8605906
Sci Rep. 2019 Nov 14;9(1):16841
pubmed: 31727957
Gigascience. 2018 Jun 1;7(6):
pubmed: 29901703
Nucleic Acids Res. 2019 Jul 2;47(W1):W199-W205
pubmed: 31114916
IDCases. 2018 Jan 09;11:61-63
pubmed: 29619322
Genome Biol. 2014;15(12):550
pubmed: 25516281
J Hosp Infect. 2011 Mar;77(3):267-70
pubmed: 21216034
PLoS Comput Biol. 2019 Apr 8;15(4):e1006967
pubmed: 30958827
Appl Microbiol. 1973 Jun;25(6):900-4
pubmed: 4577489
Cell. 2019 Aug 8;178(4):779-794
pubmed: 31398336
BMC Bioinformatics. 2018 Feb 14;19(1):54
pubmed: 29444641
Genome Res. 2016 Dec;26(12):1721-1729
pubmed: 27852649
Front Cell Infect Microbiol. 2019 May 08;9:148
pubmed: 31139578
J Hosp Infect. 2008 Nov;70(3):241-5
pubmed: 18799235
Gigascience. 2021 Feb 16;10(2):
pubmed: 33590861
Nucleic Acids Res. 2020 Jan 8;48(D1):D204-D219
pubmed: 31598718
Proc Natl Acad Sci U S A. 2020 Aug 4;117(31):18670-18679
pubmed: 32675236
Neurosci Insights. 2021 May 27;16:26331055211018709
pubmed: 34104888
BMC Bioinformatics. 2019 Apr 18;20(Suppl 4):168
pubmed: 30999839
Trends Microbiol. 2019 Nov;27(11):964-965
pubmed: 31160162
Genome Biol. 2019 Nov 28;20(1):257
pubmed: 31779668
Brief Bioinform. 2019 Jul 19;20(4):1125-1136
pubmed: 29028872

Auteurs

Anna-Maria Liebhoff (AM)

Institute for Medical Systems Biology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany. aliebho1@jhu.edu.
Department of Computer Science, Whiting School of Engineering, Johns Hopkins University, Baltimore, USA. aliebho1@jhu.edu.

Kevin Menden (K)

Department of Genome Biology of Neurodegenerative Diseases, DZNE, Tübingen, Germany.

Alena Laschtowitz (A)

I. Department of Medicine, University Medical Centre Hamburg-Eppendorf, Hamburg, Germany.

Andre Franke (A)

Institute of Clinical Molecular Biology, Christian-Albrechts-University of Kiel, Kiel, Germany.

Christoph Schramm (C)

I. Department of Medicine, University Medical Centre Hamburg-Eppendorf, Hamburg, Germany.
Martin Zeitz Center for Rare Diseases, University Medical Center Hamburg-Eppendorf, Hamburg, Germany.
Hamburg Center for Translational Immunology (HCTI), University Medical Center Hamburg-Eppendorf, Hamburg, Germany.

Stefan Bonn (S)

Institute for Medical Systems Biology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C

Classifications MeSH