Deeplasmid: deep learning accurately separates plasmids from bacterial chromosomes.


Journal

Nucleic acids research
ISSN: 1362-4962
Titre abrégé: Nucleic Acids Res
Pays: England
ID NLM: 0411011

Informations de publication

Date de publication:
22 02 2022
Historique:
accepted: 23 10 2021
revised: 11 10 2021
received: 20 04 2021
pubmed: 7 12 2021
medline: 16 4 2022
entrez: 6 12 2021
Statut: ppublish

Résumé

Plasmids are mobile genetic elements that play a key role in microbial ecology and evolution by mediating horizontal transfer of important genes, such as antimicrobial resistance genes. Many microbial genomes have been sequenced by short read sequencers and have resulted in a mix of contigs that derive from plasmids or chromosomes. New tools that accurately identify plasmids are needed to elucidate new plasmid-borne genes of high biological importance. We have developed Deeplasmid, a deep learning tool for distinguishing plasmids from bacterial chromosomes based on the DNA sequence and its encoded biological data. It requires as input only assembled sequences generated by any sequencing platform and assembly algorithm and its runtime scales linearly with the number of assembled sequences. Deeplasmid achieves an AUC-ROC of over 89%, and it was more accurate than five other plasmid classification methods. Finally, as a proof of concept, we used Deeplasmid to predict new plasmids in the fish pathogen Yersinia ruckeri ATCC 29473 that has no annotated plasmids. Deeplasmid predicted with high reliability that a long assembled contig is part of a plasmid. Using long read sequencing we indeed validated the existence of a 102 kb long plasmid, demonstrating Deeplasmid's ability to detect novel plasmids.

Identifiants

pubmed: 34871418
pii: 6454267
doi: 10.1093/nar/gkab1115
pmc: PMC8860608
doi:

Types de publication

Journal Article Research Support, U.S. Gov't, Non-P.H.S.

Langues

eng

Sous-ensembles de citation

IM

Pagination

e17

Informations de copyright

© The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research.

Références

Nucleic Acids Res. 2021 Jan 8;49(D1):D751-D763
pubmed: 33119741
PeerJ. 2018 Apr 2;6:e4588
pubmed: 29629246
J Bacteriol. 1991 Sep;173(18):5861-8
pubmed: 1885553
EMBO J. 1983;2(1):93-8
pubmed: 11894917
FEMS Microbiol Rev. 2009 May;33(3):657-87
pubmed: 19396961
Nucleic Acids Res. 2019 Jan 8;47(D1):D666-D677
pubmed: 30289528
Mol Cell. 2009 Oct 9;36(1):121-30
pubmed: 19818715
Trends Microbiol. 2010 Apr;18(4):141-8
pubmed: 20080407
Bioinformatics. 2006 Jul 1;22(13):1658-9
pubmed: 16731699
Bioinformatics. 2016 Nov 15;32(22):3380-3387
pubmed: 27466620
Genome Res. 2019 Jun;29(6):961-968
pubmed: 31048319
FEMS Microbiol Rev. 1998;21:291-319
pubmed: 25508777
J Pediatr Gastroenterol Nutr. 1983;2(3):534-8
pubmed: 6352891
Annu Rev Microbiol. 1988;42:717-43
pubmed: 3060006
Nucleic Acids Res. 2007 Jan;35(Database issue):D61-5
pubmed: 17130148
Plasmid. 2017 May;91:42-52
pubmed: 28286183
Bioinformatics. 2017 Feb 15;33(4):475-482
pubmed: 28003256
Genome Res. 2012 Apr;22(4):802-9
pubmed: 22300632
J Bacteriol. 1984 Dec;160(3):1061-6
pubmed: 6542099
Science. 2007 Nov 30;318(5855):1449-52
pubmed: 17947550
Nucleic Acids Res. 2018 Apr 6;46(6):e35
pubmed: 29346586
Microbiol Mol Biol Rev. 2013 Jun;77(2):208-33
pubmed: 23699255
Cell Microbiol. 2010 Sep 1;12(9):1203-12
pubmed: 20642798
Lancet. 1998 Mar 14;351(9105):797-9
pubmed: 9519952
Bioinformatics. 2019 Nov 1;35(21):4436-4439
pubmed: 31116364
Front Cell Infect Microbiol. 2018 Oct 30;8:373
pubmed: 30460204
Trends Genet. 2002 Jun;18(6):291-4
pubmed: 12044357
J Bacteriol. 2010 Jul;192(13):3249-58
pubmed: 20363934
Genome Announc. 2014 Oct 23;2(5):
pubmed: 25342679
Genome Res. 2017 May;27(5):722-736
pubmed: 28298431
PLoS Genet. 2011 May;7(5):e1002073
pubmed: 21625564
Nat Biotechnol. 2020 Sep;38(9):1044-1053
pubmed: 32686750
Microb Genom. 2021 Apr;7(4):
pubmed: 33826492
Genome Announc. 2013 Nov 07;1(6):
pubmed: 24201202
Bioinformatics. 2020 Aug 1;36(15):4350-4352
pubmed: 32637988
Bioinformatics. 2010 Aug 15;26(16):2051-2
pubmed: 20538725
Bioinformatics. 2017 Dec 1;33(23):3796-3798
pubmed: 29036591
Mob Genet Elements. 2013 Sep 1;3(5):e26219
pubmed: 24251069
Neural Comput. 2000 Oct;12(10):2451-71
pubmed: 11032042
Antimicrob Agents Chemother. 2014 Jul;58(7):3895-903
pubmed: 24777092
Nucleic Acids Res. 2010 Jan;38(Database issue):D57-61
pubmed: 19933762
Microb Genom. 2017 Aug 18;3(10):e000128
pubmed: 29177087
Microbiol Mol Biol Rev. 2010 Sep;74(3):434-52
pubmed: 20805406
Nucleic Acids Res. 2012 Jan;40(Database issue):D115-22
pubmed: 22194640
Microb Genom. 2018 Nov;4(11):
pubmed: 30383524
Genome Announc. 2013 May 23;1(3):
pubmed: 23704173
J Bacteriol. 1986 Oct;168(1):123-31
pubmed: 3019995
J Bacteriol. 1975 Jul;123(1):255-64
pubmed: 1141196
Microb Genom. 2020 Oct;6(10):
pubmed: 32579097
Mol Microbiol. 2000 Aug;37(3):455-66
pubmed: 10931339
Bioinformatics. 2018 Sep 15;34(18):3094-3100
pubmed: 29750242
Mol Cell. 2018 Jun 7;70(5):768-784
pubmed: 29398446
Nucleic Acids Res. 2019 Jan 8;47(D1):D195-D202
pubmed: 30380090
Proc Natl Acad Sci U S A. 2002 Apr 16;99(8):5638-42
pubmed: 11943863
Microb Cell Fact. 2016 Jun 10;15(1):108
pubmed: 27286821
J Bacteriol. 1995 Dec;177(23):6766-72
pubmed: 7592466

Auteurs

William B Andreopoulos (WB)

Joint Genome Institute, US Department of Energy, LBNL Berkeley, CA, USA.
Department of Computer Science, San Jose State University, CA, USA.

Alexander M Geller (AM)

Department of Plant Pathology and Microbiology, The Institute of Environmental Science, The Robert H. Smith Faculty of Agriculture, Food, and Environment, The Hebrew University of Jerusalem, Rehovot, Israel.

Miriam Lucke (M)

Department of Plant Pathology and Microbiology, The Institute of Environmental Science, The Robert H. Smith Faculty of Agriculture, Food, and Environment, The Hebrew University of Jerusalem, Rehovot, Israel.

Jan Balewski (J)

National Energy Research Scientific Computing Center (NERSC), Berkeley, CA, USA.

Alicia Clum (A)

Joint Genome Institute, US Department of Energy, LBNL Berkeley, CA, USA.

Natalia N Ivanova (NN)

Joint Genome Institute, US Department of Energy, LBNL Berkeley, CA, USA.

Asaf Levy (A)

Department of Plant Pathology and Microbiology, The Institute of Environmental Science, The Robert H. Smith Faculty of Agriculture, Food, and Environment, The Hebrew University of Jerusalem, Rehovot, Israel.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing
Robotic Surgical Procedures Animals Humans Telemedicine Models, Animal

Odour generalisation and detection dog training.

Lyn Caldicott, Thomas W Pike, Helen E Zulch et al.
1.00
Animals Odorants Dogs Generalization, Psychological Smell
Animals TOR Serine-Threonine Kinases Colorectal Neoplasms Colitis Mice

Classifications MeSH