DNA polymerase swapping in Caudoviricetes bacteriophages.


Journal

Virology journal
ISSN: 1743-422X
Titre abrégé: Virol J
Pays: England
ID NLM: 101231645

Informations de publication

Date de publication:
26 Aug 2024
Historique:
received: 21 05 2024
accepted: 21 08 2024
medline: 27 8 2024
pubmed: 27 8 2024
entrez: 26 8 2024
Statut: epublish

Résumé

Viruses with double-stranded (ds) DNA genomes in the realm Duplodnaviria share a conserved structural gene module but show a broad range of variation in their repertoires of DNA replication proteins. Some of the duplodnaviruses encode (nearly) complete replication systems whereas others lack (almost) all genes required for replication, relying on the host replication machinery. DNA polymerases (DNAPs) comprise the centerpiece of the DNA replication apparatus. The replicative DNAPs are classified into 4 unrelated or distantly related families (A-D), with the protein structures and sequences within each family being, generally, highly conserved. More than half of the duplodnaviruses encode a DNAP of family A, B or C. We showed previously that multiple pairs of closely related viruses in the order Crassvirales encode DNAPs of different families. Groups of phages in which DNAP swapping likely occurred were identified as subtrees of a defined depth in a comprehensive evolutionary tree of tailed bacteriophages that included phages with DNAPs of different families. The DNAP swaps were validated by constrained tree analysis that was performed on phylogenetic tree of large terminase subunits, and the phage genomes encoding swapped DNAPs were aligned using Mauve. The structures of the discovered unusual DNAPs were predicted using AlphaFold2. We identified four additional groups of tailed phages in the class Caudoviricetes in which the DNAPs apparently were swapped on multiple occasions, with replacements occurring both between families A and B, or A and C, or between distinct subfamilies within the same family. The DNAP swapping always occurs "in situ", without changes in the organization of the surrounding genes. In several cases, the DNAP gene is the only region of substantial divergence between closely related phage genomes, whereas in others, the swap apparently involved neighboring genes encoding other proteins involved in phage genome replication. In addition, we identified two previously undetected, highly divergent groups of family A DNAPs that are encoded in some phage genomes along with the main DNAP implicated in genome replication. Replacement of the DNAP gene by one encoding a DNAP of a different family occurred on many independent occasions during the evolution of different families of tailed phages, in some cases, resulting in very closely related phages encoding unrelated DNAPs. DNAP swapping was likely driven by selection for avoidance of host antiphage mechanisms targeting the phage DNAP that remain to be identified, and/or by selection against replicon incompatibility.

Sections du résumé

BACKGROUND BACKGROUND
Viruses with double-stranded (ds) DNA genomes in the realm Duplodnaviria share a conserved structural gene module but show a broad range of variation in their repertoires of DNA replication proteins. Some of the duplodnaviruses encode (nearly) complete replication systems whereas others lack (almost) all genes required for replication, relying on the host replication machinery. DNA polymerases (DNAPs) comprise the centerpiece of the DNA replication apparatus. The replicative DNAPs are classified into 4 unrelated or distantly related families (A-D), with the protein structures and sequences within each family being, generally, highly conserved. More than half of the duplodnaviruses encode a DNAP of family A, B or C. We showed previously that multiple pairs of closely related viruses in the order Crassvirales encode DNAPs of different families.
METHODS METHODS
Groups of phages in which DNAP swapping likely occurred were identified as subtrees of a defined depth in a comprehensive evolutionary tree of tailed bacteriophages that included phages with DNAPs of different families. The DNAP swaps were validated by constrained tree analysis that was performed on phylogenetic tree of large terminase subunits, and the phage genomes encoding swapped DNAPs were aligned using Mauve. The structures of the discovered unusual DNAPs were predicted using AlphaFold2.
RESULTS RESULTS
We identified four additional groups of tailed phages in the class Caudoviricetes in which the DNAPs apparently were swapped on multiple occasions, with replacements occurring both between families A and B, or A and C, or between distinct subfamilies within the same family. The DNAP swapping always occurs "in situ", without changes in the organization of the surrounding genes. In several cases, the DNAP gene is the only region of substantial divergence between closely related phage genomes, whereas in others, the swap apparently involved neighboring genes encoding other proteins involved in phage genome replication. In addition, we identified two previously undetected, highly divergent groups of family A DNAPs that are encoded in some phage genomes along with the main DNAP implicated in genome replication.
CONCLUSIONS CONCLUSIONS
Replacement of the DNAP gene by one encoding a DNAP of a different family occurred on many independent occasions during the evolution of different families of tailed phages, in some cases, resulting in very closely related phages encoding unrelated DNAPs. DNAP swapping was likely driven by selection for avoidance of host antiphage mechanisms targeting the phage DNAP that remain to be identified, and/or by selection against replicon incompatibility.

Identifiants

pubmed: 39187833
doi: 10.1186/s12985-024-02482-z
pii: 10.1186/s12985-024-02482-z
doi:

Substances chimiques

DNA-Directed DNA Polymerase EC 2.7.7.7
Viral Proteins 0
DNA, Viral 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

200

Informations de copyright

© 2024. This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply.

Références

Krupovic M, Bamford DH. Order to the viral universe. J Virol. 2010;84(24):12476–9.
Koonin EV, Dolja VV, Krupovic M, Varsani A, Wolf YI, Yutin N, Zerbini FM, Kuhn JH. Global organization and proposed megataxonomy of the Virus World. Microbiol Mol Biol Rev. 2020;84(2):e00061–19.
Weigel C, Seitz H. Bacteriophage replication modules. FEMS Microbiol Rev. 2006;30(3):321–81.
pubmed: 16594962 doi: 10.1111/j.1574-6976.2006.00015.x
Kazlauskas D, Krupovic M, Venclovas C. The logic of DNA replication in double-stranded DNA viruses: insights from global analysis of viral genomes. Nucleic Acids Res. 2016;44(10):4551–64.
pubmed: 27112572 pmcid: 4889955 doi: 10.1093/nar/gkw322
Koonin EV, Krupovic M, Ishino S, Ishino Y. The replication machinery of LUCA: Common origin of DNA replication and transcription. BMC Biology. 2020;18(1):61.
Czernecki D, Nourisson A, Legrand P, Delarue M. Reclassification of family A DNA polymerases reveals novel functional subfamilies and distinctive structural features. Nucleic Acids Res. 2023;51(9):4488–507.
pubmed: 37070157 pmcid: 10201439 doi: 10.1093/nar/gkad242
Raia P, Delarue M, Sauguet L. An updated structural classification of replicative DNA polymerases. Biochem Soc Trans. 2019;47(1):239–49.
pubmed: 30647142 doi: 10.1042/BST20180579
Kazlauskas D, Krupovic M, Guglielmini J, Forterre P, Venclovas C. Diversity and evolution of B-family DNA polymerases. Nucleic Acids Res. 2020;48(18):10142–56.
pubmed: 32976577 pmcid: 7544198 doi: 10.1093/nar/gkaa760
Sauguet L. The Extended two-Barrel polymerases Superfamily: structure, function and evolution. J Mol Biol. 2019;431(20):4167–83.
pubmed: 31103775 doi: 10.1016/j.jmb.2019.05.017
Kornberg A, Baker TS. DNA replication. 2nd ed. San Francisco: Freeman; 1992.
Greci MD, Bell SD. Archaeal DNA replication. Annu Rev Microbiol. 2020;74:65–80.
pubmed: 32503372 pmcid: 7712474 doi: 10.1146/annurev-micro-020518-115443
Burgers PM. Polymerase dynamics at the eukaryotic DNA replication fork. J Biol Chem. 2009;284(7):4041–5.
pubmed: 18835809 pmcid: 2640984 doi: 10.1074/jbc.R800062200
Burgers PMJ, Kunkel TA. Eukaryotic DNA replication fork. Annu Rev Biochem. 2017;86:417–38.
pubmed: 28301743 pmcid: 5597965 doi: 10.1146/annurev-biochem-061516-044709
Krupovic M, Kuhn JH, Fischer MG, Koonin EV. Natural history of eukaryotic DNA viruses with double jelly-roll major capsid proteins. Proc Natl Acad Sci U S A. 2024;121(23):e2405771121.
Krupovic M, Koonin EV. Polintons: a hotbed of eukaryotic virus, transposon and plasmid evolution. Nat Rev Microbiol. 2015;13(2):105–15.
pubmed: 25534808 doi: 10.1038/nrmicro3389
Schoenfeld TW, Murugapiran SK, Dodsworth JA, Floyd S, Lodes M, Mead DA, Hedlund BP. Lateral gene transfer of family A DNA polymerases between thermophilic viruses, aquificae, and apicomplexa. Mol Biol Evol. 2013;30(7):1653–64.
pubmed: 23608703 pmcid: 3684859 doi: 10.1093/molbev/mst078
Nasko DJ, Chopyk J, Sakowski EG, Ferrell BD, Polson SW, Wommack KE. Family A DNA polymerase phylogeny uncovers diversity and Replication Gene Organization in the Virioplankton. Front Microbiol. 2018;9:3053.
pubmed: 30619142 pmcid: 6302109 doi: 10.3389/fmicb.2018.03053
Iyer LM, Abhiman S, Aravind L. A new family of polymerases related to superfamily A DNA polymerases and T7-like DNA-dependent RNA polymerases. Biol Direct. 2008;3:39.
pubmed: 18834537 pmcid: 2579912 doi: 10.1186/1745-6150-3-39
Makarova KS, Krupovic M, Koonin EV. Evolution of replicative DNA polymerases in archaea and their contributions to the eukaryotic replication machinery. Front Microbiol. 2014;5:354.
pubmed: 25101062 pmcid: 4104785 doi: 10.3389/fmicb.2014.00354
Prangishvili D, Bamford DH, Forterre P, Iranzo J, Koonin EV, Krupovic M. The enigmatic archaeal virosphere. Nat Rev Microbiol. 2017;15(12):724–39.
pubmed: 29123227 doi: 10.1038/nrmicro.2017.125
Yutin N, Benler S, Shmakov SA, Wolf YI, Tolstoy I, Rayko M, Antipov D, Pevzner PA. Analysis of metagenome-assembled viral genomes from the human gut reveals diverse putative CrAss-like phages with unique genomic features. Nat Commun. 2021;12:1044.
pubmed: 33594055 pmcid: 7886860 doi: 10.1038/s41467-021-21350-w
Krupovic M, Bamford DH. Putative prophages related to lytic tailless marine dsDNA phage PM2 are widespread in the genomes of aquatic bacteria. BMC Genomics. 2007;8:236.
pubmed: 17634101 pmcid: 1950889 doi: 10.1186/1471-2164-8-236
Yutin N, Rayko M, Antipov D, Mutz P, Wolf YI, Krupovic M, Koonin EV. Varidnaviruses in the human gut: a major expansion of the order Vinavirales. Viruses 2022;14(9):1842.
Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25(17):3389–402.
pubmed: 9254694 pmcid: 146917 doi: 10.1093/nar/25.17.3389
Lefort V, Desper R, Gascuel O. FastME 2.0: a Comprehensive, Accurate, and fast Distance-based phylogeny inference program. Mol Biol Evol. 2015;32(10):2798–800.
pubmed: 26130081 pmcid: 4576710 doi: 10.1093/molbev/msv150
Keown RA, Dums JT, Brumm PJ, MacDonald J, Mead DA, Ferrell BD, Moore RM, Harrison AO, Polson SW, Wommack KE. Novel viral DNA polymerases from metagenomes suggest genomic sources of Strand-Displacing Biochemical Phenotypes. Front Microbiol. 2022;13:858366.
pubmed: 35531281 pmcid: 9069017 doi: 10.3389/fmicb.2022.858366
Dorawa S, Werbowy O, Plotka M, Kaczorowska AK, Makowska J, Kozlowski LP, Fridjonsson OH, Hreggvidsson GO, Aevarsson A, Kaczorowski T. Molecular characterization of a DNA polymerase from Thermus thermophilus MAT72 Phage vB_Tt72: a novel Type-A family enzyme with strong proofreading activity. Int J Mol Sci 2022, 23(14).
Timinskas K, Venclovas C. New insights into the structures and interactions of bacterial Y-family DNA polymerases. Nucleic Acids Res. 2019;47(9):4393–405.
pubmed: 30916324 pmcid: 6511836 doi: 10.1093/nar/gkz198
Steinegger M, Soding J. MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets. Nat Biotechnol. 2017;35(11):1026–8.
pubmed: 29035372 doi: 10.1038/nbt.3988
Edgar RC. Muscle5: high-accuracy alignment ensembles enable unbiased assessments of sequence homology and phylogeny. Nat Commun. 2022;13(1):6968.
pubmed: 36379955 pmcid: 9664440 doi: 10.1038/s41467-022-34630-w
Zimmermann L, Stephens A, Nam SZ, Rau D, Kubler J, Lozajic M, Gabler F, Soding J, Lupas AN, Alva V. A completely reimplemented MPI Bioinformatics Toolkit with a new HHpred server at its core. J Mol Biol. 2018;430(15):2237–43.
pubmed: 29258817 doi: 10.1016/j.jmb.2017.12.007
Darling AC, Mau B, Blattner FR, Perna NT. Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res. 2004;14(7):1394–403.
pubmed: 15231754 pmcid: 442156 doi: 10.1101/gr.2289704
Marchler-Bauer A, Bo Y, Han L, He J, Lanczycki CJ, Lu S, Chitsaz F, Derbyshire MK, Geer RC, Gonzales NR, et al. CDD/SPARCLE: functional classification of proteins via subfamily domain architectures. Nucleic Acids Res. 2017;45(D1):D200–3.
pubmed: 27899674 doi: 10.1093/nar/gkw1129
Soding J. Protein homology detection by HMM-HMM comparison. Bioinformatics. 2005;21(7):951–60.
pubmed: 15531603 doi: 10.1093/bioinformatics/bti125
Minh BQ, Schmidt HA, Chernomor O, Schrempf D, Woodhams MD, von Haeseler A, Lanfear R. IQ-TREE 2: New models and efficient methods for phylogenetic inference in the genomic era. Mol Biol Evol. 2020;37(5):1530–4.
pubmed: 32011700 pmcid: 7182206 doi: 10.1093/molbev/msaa015
Tamura K, Stecher G, Kumar S. MEGA11: Molecular Evolutionary Genetics Analysis Version 11. Mol Biol Evol. 2021;38(7):3022–7.
pubmed: 33892491 pmcid: 8233496 doi: 10.1093/molbev/msab120
Mirdita M, Schutze K, Moriwaki Y, Heo L, Ovchinnikov S, Steinegger M. ColabFold: making protein folding accessible to all. Nat Methods. 2022;19(6):679–82.
pubmed: 35637307 pmcid: 9184281 doi: 10.1038/s41592-022-01488-1
Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, Tunyasuvunakool K, Bates R, Zidek A, Potapenko A, et al. Highly accurate protein structure prediction with AlphaFold. Nature. 2021;596(7873):583–9.
pubmed: 34265844 pmcid: 8371605 doi: 10.1038/s41586-021-03819-2
Holm L. DALI and the persistence of protein shape. Protein Sci. 2020;29(1):128–40.
pubmed: 31606894 doi: 10.1002/pro.3749
Holm L. Dali server: structural unification of protein families. Nucleic Acids Res. 2022;50(W1):W210–5.
pubmed: 35610055 pmcid: 9252788 doi: 10.1093/nar/gkac387
Pettersen EF, Goddard TD, Huang CC, Meng EC, Couch GS, Croll TI, Morris JH, Ferrin TE. UCSF ChimeraX: structure visualization for researchers, educators, and developers. Protein Sci. 2021;30(1):70–82.
pubmed: 32881101 doi: 10.1002/pro.3943
Suttle CA. Viruses in the sea. Nature. 2005;437(7057):356–61.
pubmed: 16163346 doi: 10.1038/nature04160
Mushegian AR. Are there 10(31) virus particles on Earth, or more, or fewer? J Bacteriol 2020, 202(9).
Stokar-Avihail A, Fedorenko T, Hor J, Garb J, Leavitt A, Millman A, Shulman G, Wojtania N, Melamed S, Amitai G, et al. Discovery of phage determinants that confer sensitivity to bacterial immune systems. Cell. 2023;186(9):1863–e18761816.
pubmed: 37030292 doi: 10.1016/j.cell.2023.02.029
Millman A, Melamed S, Leavitt A, Doron S, Bernheim A, Hor J, Garb J, Bechon N, Brandis A, Lopatina A, et al. An expanded arsenal of immune systems that protect bacteria from phages. Cell Host Microbe. 2022;30(11):1556–e15691555.
pubmed: 36302390 doi: 10.1016/j.chom.2022.09.017
Huiting E, Bondy-Denomy J. Defining the expanding mechanisms of phage-mediated activation of bacterial immunity. Curr Opin Microbiol. 2023;74:102325.
pubmed: 37178480 pmcid: 11080646 doi: 10.1016/j.mib.2023.102325
Samson JE, Belanger M, Moineau S. Effect of the abortive infection mechanism and type III toxin/antitoxin system AbiQ on the lytic cycle of Lactococcus lactis phages. J Bacteriol. 2013;195(17):3947–56.
pubmed: 23813728 pmcid: 3754610 doi: 10.1128/JB.00296-13
LeRoux M, Srikant S, Teodoro GIC, Zhang T, Littlehale ML, Doron S, Badiee M, Leung AKL, Sorek R, Laub MT. The DarTG toxin-antitoxin system provides phage defence by ADP-ribosylating viral DNA. Nat Microbiol. 2022;7(7):1028–40.
pubmed: 35725776 pmcid: 9250638 doi: 10.1038/s41564-022-01153-5
Gao LA, Wilkinson ME, Strecker J, Makarova KS, Macrae RK, Koonin EV, Zhang F. Prokaryotic innate immunity through pattern recognition of conserved viral proteins. Science. 2022;377(6607):eabm4096.
pubmed: 35951700 pmcid: 10028730 doi: 10.1126/science.abm4096
Kibby EM, Conte AN, Burroughs AM, Nagy TA, Vargas JA, Whalen LA, Aravind L, Whiteley AT. Bacterial NLR-related proteins protect against phage. Cell. 2023;186(11):2410–e24242418.
pubmed: 37160116 pmcid: 10294775 doi: 10.1016/j.cell.2023.04.015
Jackson SA, Birkholz N, Malone LM, Fineran PC. Imprecise Spacer Acquisition generates CRISPR-Cas Immune Diversity through Primed Adaptation. Cell Host Microbe. 2019;25(2):250–60. e254.
pubmed: 30661951 doi: 10.1016/j.chom.2018.12.014
Shiriaeva AA, Kuznedelov K, Fedorov I, Musharova O, Khvostikov T, Tsoy Y, Kurilovich E, Smith GR, Semenova E, Severinov K. Host nucleases generate prespacers for primed adaptation in the E. Coli type I-E CRISPR-Cas system. Sci Adv. 2022;8(47):eabn8650.
pubmed: 36427302 pmcid: 9699676 doi: 10.1126/sciadv.abn8650
Igler C, Huisman JS, Siedentop B, Bonhoeffer S, Lehtinen S. Plasmid co-infection: linking biological mechanisms to ecological and evolutionary dynamics. Philos Trans R Soc Lond B Biol Sci. 2022;377(1842):20200478.
pubmed: 34839701 doi: 10.1098/rstb.2020.0478
Pilosof S. Conceptualizing microbe-plasmid communities as complex adaptive systems. Trends Microbiol. 2023;31(7):672–80.
pubmed: 36822952 doi: 10.1016/j.tim.2023.01.007

Auteurs

Natalya Yutin (N)

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA.

Igor Tolstoy (I)

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA.

Pascal Mutz (P)

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA.

Yuri I Wolf (YI)

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA.

Mart Krupovic (M)

Archaeal Virology Unit, Institut Pasteur, Université Paris Cité, Paris, France.

Eugene V Koonin (EV)

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA. koonin@ncbi.nlm.nih.gov.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing
Animals Hemiptera Insect Proteins Phylogeny Insecticides
Amaryllidaceae Alkaloids Lycoris NADPH-Ferrihemoprotein Reductase Gene Expression Regulation, Plant Plant Proteins
Drought Resistance Gene Expression Profiling Gene Expression Regulation, Plant Gossypium Multigene Family

Classifications MeSH