Prediction of Klebsiella phage-host specificity at the strain level.


Journal

Nature communications
ISSN: 2041-1723
Titre abrégé: Nat Commun
Pays: England
ID NLM: 101528555

Informations de publication

Date de publication:
22 May 2024
Historique:
received: 14 12 2023
accepted: 08 05 2024
medline: 23 5 2024
pubmed: 23 5 2024
entrez: 22 5 2024
Statut: epublish

Résumé

Phages are increasingly considered promising alternatives to target drug-resistant bacterial pathogens. However, their often-narrow host range can make it challenging to find matching phages against bacteria of interest. Current computational tools do not accurately predict interactions at the strain level in a way that is relevant and properly evaluated for practical use. We present PhageHostLearn, a machine learning system that predicts strain-level interactions between receptor-binding proteins and bacterial receptors for Klebsiella phage-bacteria pairs. We evaluate this system both in silico and in the laboratory, in the clinically relevant setting of finding matching phages against bacterial strains. PhageHostLearn reaches a cross-validated ROC AUC of up to 81.8% in silico and maintains this performance in laboratory validation. Our approach provides a framework for developing and evaluating phage-host prediction methods that are useful in practice, which we believe to be a meaningful contribution to the machine-learning-guided development of phage therapeutics and diagnostics.

Identifiants

pubmed: 38778023
doi: 10.1038/s41467-024-48675-6
pii: 10.1038/s41467-024-48675-6
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

4355

Informations de copyright

© 2024. The Author(s).

Références

Clokie, M. R. J., Miljard, A. D., Letarov, A. V. & Heaphy, S. Phages in nature. Bacteriophage 1, 31–45 (2011).
doi: 10.4161/bact.1.1.14942 pubmed: 21687533 pmcid: 3109452
Sørensen, A. N., Woudstra, C., Sørensen, M. C. H. & Brøndsted, L. Subtypes of tail spike proteins predicts the host range of Ackermannviridaephages. Comput Struct. Biotechnol. J. 19, 4854–4867 (2021).
doi: 10.1016/j.csbj.2021.08.030 pubmed: 34527194 pmcid: 8432352
Beamud, B. et al. Genetic determinants of host tropism in Klebsiella phages. Cell Rep. 42, 112048 (2023).
doi: 10.1016/j.celrep.2023.112048 pubmed: 36753420 pmcid: 9989827
Schwarzer, D. et al. A multivalent adsorption apparatus explains the broad host range of phage phi92: a comprehensive genomic and structural analysis. J. Virol. 86, 10384–10398 (2012).
doi: 10.1128/JVI.00801-12 pubmed: 22787233 pmcid: 3457257
Hanson, C. A., Marston, M. F. & Martiny, J. B. Biogeographic variation in host range phenotypes and taxonomic composition of marine cyanophage isolates. Front. Microbiol. 7, 983 (2016).
doi: 10.3389/fmicb.2016.00983 pubmed: 27446023 pmcid: 4919323
Klumpp, J., Dunne, M. & Loessner, M. J. A perfect fit: Bacteriophage receptor-binding proteins for diagnostic and therapeutic applications. Curr. Opin. Microbiol. 71, 102240 (2023).
doi: 10.1016/j.mib.2022.102240 pubmed: 36446275
Keen, E. C. Tradeoffs in bacteriophage life histories. Bacteriophage 4, e28365 (2014).
doi: 10.4161/bact.28365 pubmed: 24616839 pmcid: 3942329
Coclet, C. & Roux, S. Global overview and major challenges of host prediction methods for uncultivated phages. Curr. Opin. Virol. 49, 117–126 (2021).
doi: 10.1016/j.coviro.2021.05.003 pubmed: 34126465
Versoza, C. J. & Pfeifer, S. P. Computational prediction of bacteriophage host ranges. Microorganisms 10, 149 (2022).
doi: 10.3390/microorganisms10010149 pubmed: 35056598 pmcid: 8778386
Roux, S. et al. iPHoP: An integrated machine learning framework to maximize host prediction for metagenome-derived viruses of archaea and bacteria. PLoS Biol. 21, e3002083 (2023).
doi: 10.1371/journal.pbio.3002083 pubmed: 37083735 pmcid: 10155999
Shang, J. & Sun, Y. CHERRY: a computational method for accurate prediction of virus-prokaryotic interactions using a graph encoder-decoder model. Brief. Bioinforma. 23, bbac182 (2022).
doi: 10.1093/bib/bbac182
Schooley, R. T. et al. Development and use of personalized bacteriophage-based therapeutic cocktails to treat a patient with a disseminated resistant acinetobacter baumannii infection. Antimicrobial Agents Chemother. 61, e00954-17 (2017).
doi: 10.1128/AAC.00954-17
Dedrick, R. M. et al. Engineered bacteriophages for treatment of a patient with a disseminated drug-resistant Mycobacterium abscessus. Nat. Med. 25, 730–733 (2019).
doi: 10.1038/s41591-019-0437-z pubmed: 31068712 pmcid: 6557439
Eskenazi, A. et al. Combination of pre-adapted bacteriophage therapy and antibiotics for treatment of fracture-related infection due to pandrug-resistant Klebsiella pneumoniae. Nat. Commun. 13, 302 (2022).
doi: 10.1038/s41467-021-27656-z pubmed: 35042848 pmcid: 8766457
Ofer, D., Brandes, N. & Linial, M. The language of proteins: NLP, machine learning & protein sequences. Comput. Struct. Biotechnol. J. 19, 1750–1758 (2021).
doi: 10.1016/j.csbj.2021.03.022 pubmed: 33897979 pmcid: 8050421
Rives, A. et al. Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. Proc. Natl. Acad. Sci. USA 118, e2016239118 (2021).
doi: 10.1073/pnas.2016239118 pubmed: 33876751 pmcid: 8053943
Brandes, N., Ofer, D., Peleg, Y., Rappoport, N. & Linial, M. ProteinBERT: a universal deep-learning model of protein sequence and function. Bioinformatics 38, 2102–2110 (2022).
doi: 10.1093/bioinformatics/btac020 pubmed: 35020807 pmcid: 9386727
Lood, C. et al. Digital phagograms: predicting phage infectivity through a multilayer machine learning approach. Curr. Opin. Virol. 52, 174–181 (2022).
doi: 10.1016/j.coviro.2021.12.004 pubmed: 34952265
Nobrega, F. L. et al. Targeting mechanisms of tailed bacteriophages. Nat. Rev. Microbiol 16, 760–773 (2018).
doi: 10.1038/s41579-018-0070-8 pubmed: 30104690
Antimicrobial Resistance Collaborators. Global burden of bacterial antimicrobial resistance in 2019: a systematic analysis. Lancet 399, 629–655 (2022).
doi: 10.1016/S0140-6736(21)02724-0
Leite, D. M. C. et al. Computational prediction of inter-species relationships through omics data analysis and machine learning. BMC Bioinform. 19, 151–159 (2018).
doi: 10.1186/s12859-018-2388-7
Opoku-Temeng, C., Kobayashi, S. D. & Deleo, F. R. Klebsiella pneumoniae capsule polysaccharide as a target for therapeutics and vaccines. Comp. Struct. Biotechnol. Journ. 17, 1360–1366 (2019).
Squeglia, F. et al. Structural and functional studies of a klebsiella phage capsule depolymerase tailspike: mechanistic insights into capsular degradation. Structure 28, 613–624.e4 (2020).
doi: 10.1016/j.str.2020.04.015 pubmed: 32386574
Chen, T., Guestrin, C. XGBoost: A Scalable Tree Boosting System. Proceedings of the KDD ’16: 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 785–794 (2016).
Stock, M. et al. Pairwise learning for predicting pollination interactions based on traits and phylogeny. Ecol. Model 451, 109508 (2021).
doi: 10.1016/j.ecolmodel.2021.109508
Ferriol-González C., et al Targeted phage hunting to specific Klebsiella pneumoniae clinical isolates is an efficient antibiotic resistance and infection control strategy. Preprint at: https://doi.org/10.1101/2024.01.07.574526 (2024).
McNair, K., Zhou, C., Dinsdale, E. A., Souza, B. & Edwards, R. A. PHANOTATE: a novel approach to gene identification in phage genomes. Bioinformatics 35, 4537–4542 (2019).
doi: 10.1093/bioinformatics/btz265 pubmed: 31329826 pmcid: 6853651
Boeckaerts, D., Stock, M., De Baets, B. & Briers, Y. Identification of phage receptor-binding protein sequences with hidden markov modelsodels and an Extreme Gradient Boosting Classifier. Viruses 14, 1329 (2022).
doi: 10.3390/v14061329 pubmed: 35746800 pmcid: 9230537
Latka, A., Leiman, P. G., Drulis-Kawa, Z. & Briers, Y. Modeling the architecture of depolymerase-containing receptor binding proteins in Klebsiella Phages. Front. Microbiol. 10, 2649 (2019).
doi: 10.3389/fmicb.2019.02649 pubmed: 31803168 pmcid: 6872550
Wyres, K. L. et al. Identification of Klebsiella capsule synthesis loci from whole genome data. Micro. Genomics 2, e000102 (2016).
Lam, M. M. C., Wick, R. R., Judd, L. M., Holt, K. E. & Wyres, K. L. Kaptive 2.0: updated capsule and lipopolysaccharide locus typing for the Klebsiella pneumoniae species complex. Micro. Genomics 8, 000800 (2022).
Babenko, B. Multiple Instance Learning: Algorithms and Applications. Dept. Comp. Sci. & Eng, University of California, San Diego, 1-19. Available at: https://cs.gmu.edu/~kosecka/bbabenko_re.pdf (2008).
Lin, Z. et al. Evolutionary-scale prediction of atomic-level protein structure with a language model. Science 379, 1123–1130 (2023).
doi: 10.1126/science.ade2574 pubmed: 36927031
Tesson, F. et al. Systematic and quantitative view of the antiviral arsenal of prokaryotes. Nat. Commun. 13, 2561 (2022).
doi: 10.1038/s41467-022-30269-9 pubmed: 35538097 pmcid: 9090908
Latka, A. et al. Engineering the modular receptor-binding proteins of Klebsiella phages switches their capsule Serotype Specificity. mBio 12, e00455-21 (2021).
doi: 10.1128/mBio.00455-21 pubmed: 33947754 pmcid: 8262889
Pires, D. P., Oliveira, H., Melo, L. D., Sillankorva, S. & Azeredo, J. Bacteriophage-encoded depolymerases: their diversity and biotechnological applications. Appl. Microbiol. Biotechnol. 100, 2141–2151 (2016).
doi: 10.1007/s00253-015-7247-0 pubmed: 26767986
Oliveira, H. et al. Ability of phages to infect Acinetobacter calcoaceticus-Acinetobacter baumannii complex species through acquisition of different pectate lyase depolymerase domains. Environ. Microbiol .19, 5060–5077 (2017).
doi: 10.1111/1462-2920.13970 pubmed: 29076652
Liu, B. et al. Structure and genetics of Escherichia coli O antigens. FEMS Microbiol. Rev. 44, 655–683 (2020).
doi: 10.1093/femsre/fuz028 pubmed: 31778182
N’Toia, C. H., Kizziah, J. L., Hatoum-Aslan, A. & Dokland, T. Structure and host specificity of Staphylococcus epidermidis bacteriophage Andhra. Sci. Adv. 8, eade0459 (2022).
doi: 10.1126/sciadv.ade0459
Lowe, T. M. & Chan, P. P. tRNAscan-SE On-line: integrating search and context for analysis of transfer RNA genes. Nucleic Acids Res. 44, W54–W57 (2016).
doi: 10.1093/nar/gkw413 pubmed: 27174935 pmcid: 4987944
Sant, D. G., Woods, L. C., Barr, J. J. & McDonald, M. J. Host diversity slows bacteriophage adaptation by selecting generalists over specialists. Nat. Ecol. Evol. 5, 350–359 (2021).
doi: 10.1038/s41559-020-01364-1 pubmed: 33432132

Auteurs

Dimitri Boeckaerts (D)

Laboratory of Applied Biotechnology, Department of Biotechnology, Ghent University, Ghent, Belgium.
KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium.

Michiel Stock (M)

KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium.

Celia Ferriol-González (C)

Institute for Integrative Systems Biology (I2SysBio), Universitat de Valencia-CSIC, Paterna, Spain.

Jesús Oteo-Iglesias (J)

Laboratorio de Referencia e Investigación en Resistencia a Antibióticos e Infecciones Relacionadas con la Asistencia Sanitaria, Centro Nacional de Microbiología, Instituto de Salud Carlos III, Madrid, Spain.
CIBER de Enfermedades Infecciosas (CIBERINFEC), Instituto de Salud Carlos III, Madrid, Spain.

Rafael Sanjuán (R)

Institute for Integrative Systems Biology (I2SysBio), Universitat de Valencia-CSIC, Paterna, Spain.

Pilar Domingo-Calap (P)

Institute for Integrative Systems Biology (I2SysBio), Universitat de Valencia-CSIC, Paterna, Spain.

Bernard De Baets (B)

KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium.

Yves Briers (Y)

Laboratory of Applied Biotechnology, Department of Biotechnology, Ghent University, Ghent, Belgium. Yves.Briers@UGent.be.

Articles similaires

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software

Understanding the role of machine learning in predicting progression of osteoarthritis.

Simone Castagno, Benjamin Gompels, Estelle Strangmark et al.
1.00
Humans Disease Progression Machine Learning Osteoarthritis
Genome, Viral Ralstonia Composting Solanum lycopersicum Bacteriophages
Humans Meta-Analysis as Topic Sample Size Models, Statistical Computer Simulation

Classifications MeSH