Streamlining CRISPR spacer-based bacterial host predictions to decipher the viral dark matter.


Journal

Nucleic acids research
ISSN: 1362-4962
Titre abrégé: Nucleic Acids Res
Pays: England
ID NLM: 0411011

Informations de publication

Date de publication:
06 04 2021
Historique:
accepted: 17 02 2021
revised: 15 02 2021
received: 21 10 2020
pubmed: 8 3 2021
medline: 13 5 2021
entrez: 7 3 2021
Statut: ppublish

Résumé

Thousands of new phages have recently been discovered thanks to viral metagenomics. These phages are extremely diverse and their genome sequences often do not resemble any known phages. To appreciate their ecological impact, it is important to determine their bacterial hosts. CRISPR spacers can be used to predict hosts of unknown phages, as spacers represent biological records of past phage-bacteria interactions. However, no guidelines have been established to standardize host prediction based on CRISPR spacers. Additionally, there are no tools that use spacers to perform host predictions on large viral datasets. Here, we developed a set of tools that includes all the necessary steps for predicting the hosts of uncharacterized phages. We created a database of >11 million spacers and a program to execute host predictions on large viral datasets. Our host prediction approach uses biological criteria inspired by how CRISPR-Cas naturally work as adaptive immune systems, which make the results easy to interpret. We evaluated the performance using 9484 phages with known hosts and obtained a recall of 49% and a precision of 69%. We also found that this host prediction method yielded higher performance for phages that infect gut-associated bacteria, suggesting it is well suited for gut-virome characterization.

Identifiants

pubmed: 33677572
pii: 6157093
doi: 10.1093/nar/gkab133
pmc: PMC8034630
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

3127-3138

Informations de copyright

© The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research.

Références

Microbiology (Reading). 2005 Aug;151(Pt 8):2551-2561
pubmed: 16079334
BMC Bioinformatics. 2007 Jun 18;8:209
pubmed: 17577412
FEMS Microbiol Rev. 2016 Mar;40(2):258-72
pubmed: 26657537
Mol Biol Evol. 2016 Jun;33(6):1635-8
pubmed: 26921390
ISME J. 2019 Oct;13(10):2589-2602
pubmed: 31239539
Sci Rep. 2017 Mar 06;7:43438
pubmed: 28262818
RNA Biol. 2013 May;10(5):687-93
pubmed: 23628889
Mob DNA. 2017 Oct 3;8:12
pubmed: 29026445
Cell Host Microbe. 2019 Oct 9;26(4):527-541.e5
pubmed: 31600503
Theor Popul Biol. 2018 Feb;119:72-82
pubmed: 29174635
BMC Bioinformatics. 2009 Dec 15;10:421
pubmed: 20003500
Viruses. 2018 Sep 07;10(9):
pubmed: 30205462
Commun Biol. 2020 Jun 22;3(1):321
pubmed: 32572116
Nat Commun. 2017 Jun 23;8:15892
pubmed: 28643787
Microbiome. 2020 Jun 10;8(1):90
pubmed: 32522236
Proc Natl Acad Sci U S A. 2013 Jul 23;110(30):12450-5
pubmed: 23836644
Nat Commun. 2018 Nov 14;9(1):4781
pubmed: 30429469
Philos Trans R Soc Lond B Biol Sci. 2019 May 13;374(1772):20190101
pubmed: 30905294
J Bacteriol. 2008 Feb;190(4):1401-12
pubmed: 18065539
BMC Genomics. 2016 May 17;17:356
pubmed: 27184979
Nucleic Acids Res. 2018 Jul 2;46(W1):W246-W251
pubmed: 29790974
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W52-7
pubmed: 17537822
Cell Host Microbe. 2019 Sep 11;26(3):325-335.e5
pubmed: 31492655
Front Microbiol. 2018 Aug 31;9:2033
pubmed: 30233520
Science. 2007 Mar 23;315(5819):1709-12
pubmed: 17379808
Nucleic Acids Res. 2018 Jul 2;46(W1):W200-W204
pubmed: 29905871
Nat Commun. 2014 Jul 24;5:4498
pubmed: 25058116
PeerJ. 2017 Sep 7;5:e3788
pubmed: 28894651
Nature. 2016 Aug 25;536(7617):425-30
pubmed: 27533034
BMC Bioinformatics. 2007 Jan 20;8:18
pubmed: 17239253
Nat Biotechnol. 2019 Feb;37(2):179-185
pubmed: 30718868
Nat Rev Microbiol. 2020 Feb;18(2):67-83
pubmed: 31857715
Cell. 2019 May 16;177(5):1109-1123.e14
pubmed: 31031001
Nat Rev Microbiol. 2020 Mar;18(3):125-138
pubmed: 32015529
Nat Rev Microbiol. 2015 Mar;13(3):147-59
pubmed: 25639680
Nucleic Acids Res. 2019 Jan 8;47(D1):D666-D677
pubmed: 30289528
Microbiol Mol Biol Rev. 2020 Mar 4;84(2):
pubmed: 32132243
Nature. 2010 Nov 4;468(7320):67-71
pubmed: 21048762
Nat Microbiol. 2018 Jul;3(7):754-766
pubmed: 29867096
Nat Rev Microbiol. 2015 Nov;13(11):722-36
pubmed: 26411297
Nucleic Acids Res. 2017 Jan 4;45(D1):D482-D490
pubmed: 27899678
Bioinformatics. 2017 Oct 1;33(19):3113-3114
pubmed: 28957499
Nat Rev Microbiol. 2010 May;8(5):317-27
pubmed: 20348932

Auteurs

Moïra B Dion (MB)

Département de biochimie, de microbiologie et de bio-informatique, Faculté des sciences et de génie, Université Laval, Québec City, Québec G1V 0A6, Canada.
Groupe de recherche en écologie buccale, Faculté de médecine dentaire, Université Laval, Québec City, Québec G1V 0A6, Canada.

Pier-Luc Plante (PL)

Centre de recherche en infectiologie de l'Université Laval, Axe maladies infectieuses et immunitaires, Centre de Recherche du CHU de Québec-Université Laval, Québec City, Québec G1V 4G2, Canada.
Centre de recherche en données massives, Université Laval, Québec City, Québec G1V 0A6, Canada.
Département de médecine moléculaire, Faculté de Médecine, Université Laval, Québec City, Québec G1V 0A6, Canada.

Edwige Zufferey (E)

Département de biochimie, de microbiologie et de bio-informatique, Faculté des sciences et de génie, Université Laval, Québec City, Québec G1V 0A6, Canada.
Groupe de recherche en écologie buccale, Faculté de médecine dentaire, Université Laval, Québec City, Québec G1V 0A6, Canada.

Shiraz A Shah (SA)

COPSAC, Copenhagen Prospective Studies on Asthma in Childhood, Herlev and Gentofte Hospital, University of Copenhagen, Gentofte 2820, Denmark.

Jacques Corbeil (J)

Centre de recherche en infectiologie de l'Université Laval, Axe maladies infectieuses et immunitaires, Centre de Recherche du CHU de Québec-Université Laval, Québec City, Québec G1V 4G2, Canada.
Centre de recherche en données massives, Université Laval, Québec City, Québec G1V 0A6, Canada.
Département de médecine moléculaire, Faculté de Médecine, Université Laval, Québec City, Québec G1V 0A6, Canada.

Sylvain Moineau (S)

Département de biochimie, de microbiologie et de bio-informatique, Faculté des sciences et de génie, Université Laval, Québec City, Québec G1V 0A6, Canada.
Groupe de recherche en écologie buccale, Faculté de médecine dentaire, Université Laval, Québec City, Québec G1V 0A6, Canada.
Félix d'Hérelle Reference Center for Bacterial Viruses, Université Laval, Québec City, Québec G1V 0A6, Canada.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software
Coal Metagenome Phylogeny Bacteria Genome, Bacterial
Genome, Viral Ralstonia Composting Solanum lycopersicum Bacteriophages

Classifications MeSH