Large-scale network analysis captures biological features of bacterial plasmids.


Journal

Nature communications
ISSN: 2041-1723
Titre abrégé: Nat Commun
Pays: England
ID NLM: 101528555

Informations de publication

Date de publication:
15 05 2020
Historique:
received: 03 10 2019
accepted: 23 04 2020
entrez: 17 5 2020
pubmed: 18 5 2020
medline: 13 8 2020
Statut: epublish

Résumé

Many bacteria can exchange genetic material through horizontal gene transfer (HGT) mediated by plasmids and plasmid-borne transposable elements. Here, we study the population structure and dynamics of over 10,000 bacterial plasmids, by quantifying their genetic similarities and reconstructing a network based on their shared k-mer content. We use a community detection algorithm to assign plasmids into cliques, which correlate with plasmid gene content, bacterial host range, GC content, and existing classifications based on replicon and mobility (MOB) types. Further analysis of plasmid population structure allows us to uncover candidates for yet undescribed replicon genes, and to identify transposable elements as the main drivers of HGT at broad phylogenetic scales. Our work illustrates the potential of network-based analyses of the bacterial 'mobilome' and opens up the prospect of a natural, exhaustive classification framework for bacterial plasmids.

Identifiants

pubmed: 32415210
doi: 10.1038/s41467-020-16282-w
pii: 10.1038/s41467-020-16282-w
pmc: PMC7229196
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

2452

Subventions

Organisme : Medical Research Council
ID : MR/P007597/1
Pays : United Kingdom
Organisme : Wellcome Trust
ID : 19RX03
Pays : United Kingdom
Organisme : Biotechnology and Biological Sciences Research Council
ID : BB/R01356X/1
Pays : United Kingdom
Organisme : Department of Health
Pays : United Kingdom

Références

Shintani, M. & Suzuki, H. in DNA Traffic in the Environment 109–133 (Springer, Singapore, 2019).
Von Wintersdorff, C. J. H. et al. Dissemination of antimicrobial resistance in microbial ecosystems through horizontal gene transfer. Front. Microbiol. 7, https://doi.org/10.3389/fmicb.2016.00173 (2016).
Halary, S., Leigh, J. W., Cheaib, B., Lopez, P. & Bapteste, E. Network analyses structure genetic diversity in independent genetic worlds. Proc. Natl Acad. Sci. USA 107, 127–132 (2010).
pubmed: 20007769 doi: 10.1073/pnas.0908978107
Stokes, H. W. & Gillings, M. R. Gene flow, mobile genetic elements and the recruitment of antibiotic resistance genes into Gram-negative pathogens. FEMS Microbiol. Rev. 35, 790–819 (2011).
pubmed: 21517914 doi: 10.1111/j.1574-6976.2011.00273.x
Carattoli, A. et al. Identification of plasmids by PCR-based replicon typing. J. Microbiol. Methods 63, 219–228 (2005).
pubmed: 15935499 doi: 10.1016/j.mimet.2005.03.018
Garcillán-Barcia, M. P., Francia, M. V. & de La Cruz, F. The diversity of conjugative relaxases and its application in plasmid classification. FEMS Microbiol. Rev. 33, 657–687 (2009).
pubmed: 19396961 doi: 10.1111/j.1574-6976.2009.00168.x
Shintani, M., Sanchez, Z. K. & Kimbara, K. Genomics of microbial plasmids: classification and identification based on replication and transfer systems and host taxonomy. Front. Microbiol. 6, 1–16 (2015).
doi: 10.3389/fmicb.2015.00242
Orlek, A. et al. Plasmid classification in an era of whole-genome sequencing: application in studies of antibiotic resistance epidemiology. Front. Microbiol. 8, 1–10 (2017).
doi: 10.3389/fmicb.2017.00182
Orlek, A. et al. Ordering the mob: insights into replicon and MOB typing schemes from analysis of a curated dataset of publicly available plasmids. Plasmid 91, 42–52 (2017).
pubmed: 28286183 pmcid: 5466382 doi: 10.1016/j.plasmid.2017.03.002
Lozano, C. et al. Expansion of a plasmid classification system for Gram-positive bacteria and determination of the diversity of plasmids in Staphylococcus aureus strains of human, animal, and food origins. Appl. Environ. Microbiol. 78, 5948–5955 (2012).
pubmed: 22685157 pmcid: 3406130 doi: 10.1128/AEM.00870-12
Jensen, L. B. et al. A classification system for plasmids from enterococci and other Gram-positive bacteria. J. Microbiol. Methods 80, 25–43 (2010).
pubmed: 19879906 doi: 10.1016/j.mimet.2009.10.012
Carattoli, A. et al. In silico detection typing plasmids using PlasmidFinder plasmid multilocus sequence typing. Antimicrob. Agents Chemother. 58, 3895–3903 (2014).
pubmed: 24777092 pmcid: 4068535 doi: 10.1128/AAC.02412-14
Rozwandowicz, M. et al. Plasmids of distinct IncK lineages show compatible phenotypes. Antimicrob. Agents Chemother. 61, e01954–16 (2017).
pubmed: 28052854 pmcid: 5328535 doi: 10.1128/AAC.01954-16
Ambrose, S. J., Harmer, C. J. & Hall, R. M. Compatibility and entry exclusion of IncA and IncC plasmids revisited: IncA and IncC plasmids are compatible. Plasmid 96–97, 7–12 (2018).
pubmed: 29486211 doi: 10.1016/j.plasmid.2018.02.002
Robertson, J. & Nash, J. H. E. MOB-suite: software tools for clustering, reconstruction and typing of plasmids from draft assemblies. Microb. Genom. 4, https://doi.org/10.1099/mgen.0.00020 (2018).
Bapteste, E. et al. Prokaryotic evolution and the tree of life are two different things. Biol. Direct 4, 34 (2009).
pubmed: 19788731 pmcid: 2761302 doi: 10.1186/1745-6150-4-34
Brilli, M. et al. Analysis of plasmid genes by phylogenetic profiling and visualization of homology relationships using Blast2Network. BMC Bioinforma. 9, 551 (2008).
doi: 10.1186/1471-2105-9-551
Corel, E., Lopez, P., Méheust, R. & Bapteste, E. Network-Thinking: graphs to analyze microbial complexity and evolution. Trends Microbiol. 24, 224–237 (2016).
pubmed: 26774999 pmcid: 4766943 doi: 10.1016/j.tim.2015.12.003
Bernard, G., Greenfield, P., Ragan, M. A. & Chan, C. X. k-mer Similarity, networks of microbial genomes, and taxonomic rank. mSystems 3, https://doi.org/10.1128/mSystems.00257-18 (2018).
Dagan, T., Artzy-Randrup, Y. & Martin, W. Modular networks and cumulative impact of lateral transfer in prokaryote genome evolution. Proc. Natl Acad. Sci. USA 105, 10039–10044 (2008).
pubmed: 18632554 doi: 10.1073/pnas.0800679105 pmcid: 18632554
Tamminen, M., Virta, M., Fani, R. & Fondi, M. Large-scale analysis of plasmid relationships through gene-sharing networks. Mol. Biol. Evol. 29, 1225–1240 (2012).
pubmed: 22130968 doi: 10.1093/molbev/msr292 pmcid: 22130968
Yamashita, A. et al. Characterization of antimicrobial resistance dissemination across plasmid communities classified by network analysis. Pathogens 3, 356–376 (2014).
pubmed: 25437804 pmcid: 4243450 doi: 10.3390/pathogens3020356
Zielezinski, A., Vinga, S., Almeida, J. & Karlowski, W. M. Alignment-free sequence comparison: benefits, applications, and tools. Genome Biol. 18, 1–17 (2017).
doi: 10.1186/s13059-017-1319-7
Bernard, G. et al. Alignment-free inference of hierarchical and reticulate phylogenomic relationships. Brief Bioinform. 20, 426–435 (2019).
pubmed: 28673025 doi: 10.1093/bib/bbx067
Ren, J. et al. Alignment-free sequence analysis and applications. Annu. Rev. Biomed. Data Sci. 1, 93–114 (2018).
pubmed: 31828235 pmcid: 6905628 doi: 10.1146/annurev-biodatasci-080917-013431
Zielezinski, A. et al. Benchmarking of alignment-free sequence comparison methods. Genome Biol. 20, 144 (2019).
Jesus, T. F. et al. Plasmid ATLAS: plasmid visual analytics and identification in high-throughput sequencing data. Nucleic Acids Res. 47, D188–D194 (2019).
pubmed: 30395323 doi: 10.1093/nar/gky1073
Ondov, B. D. et al. Mash: fast genome and metagenome distance estimation using MinHash. Genome Biol. https://doi.org/10.1186/s13059-016-0997-x , 1–14 (2016).
Karp, R. M. in Complexity of Computer Computations 85–103 (Springer, US, 1972).
Lancichinetti, A., Radicchi, F., Ramasco, J. J. & Fortunato, S. Finding statistically significant communities in networks. PLoS ONE 6, e18961 (2011).
pubmed: 21559480 pmcid: 3084717 doi: 10.1371/journal.pone.0018961
Hric, D., Darst, R. K. & Fortunato, S. Community detection in networks: structural communities versus ground truth. Phys. Rev. E 90, 062805 (2014).
doi: 10.1103/PhysRevE.90.062805
Bron, C. & Kerbosch, J. Algorithm 457: finding all cliques of an undirected graph [H]. Commun. ACM 16, 575–577 (1973).
doi: 10.1145/362342.362367
Peixoto, T. P. The graph-tool python library. figshare https://doi.org/10.6084/m9.figshare.1164194 (2014).
Chikami, G. K., Guiney, D. G., Schmidhauser, T. J. & Helinski, D. R. Comparison of 10 IncP plasmids: homology in the regions involved in plasmid replication. J. Bacteriol. 162, 656–660 (1985).
pubmed: 2985542 pmcid: 218900 doi: 10.1128/JB.162.2.656-660.1985
Norberg, P., Bergström, M., Jethava, V., Dubhashi, D. & Hermansson, M. The IncP-1 plasmid backbone adapts to different host bacterial species and evolves through homologous recombination. Nat. Commun. 2, 1–11 (2011).
Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990).
doi: 10.1016/S0022-2836(05)80360-2 pubmed: 2231712 pmcid: 2231712
El-Gebali, S. et al. The Pfam protein families database in 2019. Nucleic Acids Res. 47, D427–D432 (2019).
pubmed: 30357350 doi: 10.1093/nar/gky995
del Solar, G., Giraldo, R., Ruiz-Echevarría, M. J., Espinosa, M. & Díaz-Orejas, R. Replication and control of circular bacterial plasmids. Microbiol. Mol. Biol. Rev. 62, 434–464 (1998).
pubmed: 9618448 pmcid: 98921 doi: 10.1128/MMBR.62.2.434-464.1998
Nishida, H. Comparative analyses of base compositions, DNA sizes, and dinucleotide frequency profiles in archaeal and bacterial chromosomes and plasmids. Int. J. Evol. Biol. 2012, 1–5 (2012).
Rizzatti, G., Lopetuso, L. R., Gibiino, G., Binda, C. & Gasbarrini, A. Proteobacteria: a common factor in human diseases. Biomed. Res. Int. 2017, 9351507 (2017).
pubmed: 29230419 pmcid: 5688358 doi: 10.1155/2017/9351507
Hu, H. et al. Novel plasmid and its variant harboring both a bla(NDM-1) gene and type IV secretion system in clinical isolates of Acinetobacter lwoffii. Antimicrob. Agents Chemother. 56, 1698–1702 (2012).
pubmed: 22290961 pmcid: 3318331 doi: 10.1128/AAC.06199-11
Campos, J. C. et al. Characterization of Tn3000, a transposon responsible for blaNDM-1 dissemination among Enterobacteriaceae in Brazil, Nepal, Morocco, and India. Antimicrob. Agents Chemother. 59, 7387–7395 (2015).
pubmed: 26392506 pmcid: 4649174 doi: 10.1128/AAC.01458-15
Smillie, C., Garcillán-Barcia, M. P., Francia, M. V., Rocha, E. P. C. & de la Cruz, F. Mobility of plasmids. Microbiol. Mol. Biol. Rev. 74, 434–452 (2010).
pubmed: 20805406 pmcid: 2937521 doi: 10.1128/MMBR.00020-10
Klappenbach, J. A. et al. DNA–DNA hybridization values and their relationship to whole-genome sequence similarities. Int. J. Syst. Evol. Microbiol. 57, 81–91 (2007).
pubmed: 17220447 doi: 10.1099/ijs.0.64483-0 pmcid: 17220447
O’Leary, N. A. et al. Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation. Nucleic Acids Res. 44, D733–D745 (2016).
pubmed: 26553804 doi: 10.1093/nar/gkv1189 pmcid: 26553804
Huerta-Cepas, J., Serra, F. & Bork, P. ETE 3: reconstruction, analysis, and visualization of phylogenomic data. Mol. Biol. Evol. 33, 1635–1638 (2016).
pubmed: 26921390 pmcid: 4868116 doi: 10.1093/molbev/msw046
Orlek, A. et al. A curated dataset of complete Enterobacteriaceae plasmids compiled from the NCBI nucleotide database. Data Br. 12, 423–426 (2017).
doi: 10.1016/j.dib.2017.04.024
Seemann, T. Prokka: rapid prokaryotic genome annotation. Bioinformatics 30, 2068–2069 (2014).
pubmed: 24642063 pmcid: 24642063 doi: 10.1093/bioinformatics/btu153
Page, A. J. et al. Roary: rapid large-scale prokaryote pan genome analysis. Bioinformatics 31, 3691–3693 (2015).
pubmed: 26198102 pmcid: 4817141 doi: 10.1093/bioinformatics/btv421
Ashburner, M. et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. 25, 25–29 (2000).
pubmed: 3037419 pmcid: 3037419 doi: 10.1038/75556
Carbon, S. et al. The Gene Ontology Resource: 20 years and still GOing strong. Nucleic Acids Res. 47, D330–D338 (2019).
doi: 10.1093/nar/gky1055
Huang, H. et al. A comprehensive protein-centric ID mapping service for molecular data integration. Bioinformatics 27, 1190–1191 (2011).
pubmed: 21478197 pmcid: 3072559 doi: 10.1093/bioinformatics/btr101
Lima, T. et al. HAMAP: a database of completely sequenced microbial proteome sets and manually curated microbial protein families in UniProtKB/Swiss-Prot. Nucleic Acids Res. 37, D471–D478 (2009).
pubmed: 18849571 doi: 10.1093/nar/gkn661
Reinert, G., Chew, D., Sun, F. & Waterman, M. S. Alignment-free sequence comparison (I): statistics and power. J. Comput. Biol. 16, 1615–1634 (2009).
pubmed: 20001252 pmcid: 2818754 doi: 10.1089/cmb.2009.0198
Zhao, X. BinDash, software for fast genome distance estimation on a typical personal laptop. Bioinformatics 35, 671–673 (2019).
pubmed: 30052763 doi: 10.1093/bioinformatics/bty651
Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
pubmed: 14597658 pmcid: 14597658 doi: 10.1101/gr.1239303
Fortunato, S. & Hric, D. Community detection in networks: a user guide. Phys. Rep. 659, 1–44 (2016).
doi: 10.1016/j.physrep.2016.09.002
Fred, A. L. N. & Jain, A. K. Robust data clustering. In 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings, Vol. 2, II-128–II–133 (IEEE Comput. Soc., Madison, 2003).

Auteurs

Mislav Acman (M)

UCL Genetics Institute, University College London, Gower Street, London, WC1E 6BT, UK. mislav.acman.17@ucl.ac.uk.

Lucy van Dorp (L)

UCL Genetics Institute, University College London, Gower Street, London, WC1E 6BT, UK.

Joanne M Santini (JM)

Institute of Structural and Molecular Biology, University College London, Gower Street, London, WC1E 6BT, UK.

Francois Balloux (F)

UCL Genetics Institute, University College London, Gower Street, London, WC1E 6BT, UK. f.balloux@ucl.ac.uk.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
Animals Hemiptera Insect Proteins Phylogeny Insecticides
Populus Soil Microbiology Soil Microbiota Fungi

Classifications MeSH