Nanopore sequencing and de novo assembly of a misidentified Camelpox vaccine reveals putative epigenetic modifications and alternate protein signal peptides.


Journal

Scientific reports
ISSN: 2045-2322
Titre abrégé: Sci Rep
Pays: England
ID NLM: 101563288

Informations de publication

Date de publication:
07 09 2021
Historique:
received: 20 09 2020
accepted: 19 08 2021
entrez: 8 9 2021
pubmed: 9 9 2021
medline: 17 11 2021
Statut: epublish

Résumé

DNA viruses can exploit host cellular epigenetic processes to their advantage; however, the epigenome status of most DNA viruses remains undetermined. Third generation sequencing technologies allow for the identification of modified nucleotides from sequencing experiments without specialized sample preparation, permitting the detection of non-canonical epigenetic modifications that may distinguish viral nucleic acid from that of their host, thus identifying attractive targets for advanced therapeutics and diagnostics. We present a novel nanopore de novo assembly pipeline used to assemble a misidentified Camelpox vaccine. Two confirmed deletions of this vaccine strain in comparison to the closely related Vaccinia virus strain modified vaccinia Ankara make it one of the smallest non-vector derived orthopoxvirus genomes to be reported. Annotation of the assembly revealed a previously unreported signal peptide at the start of protein A38 and several predicted signal peptides that were found to differ from those previously described. Putative epigenetic modifications around various motifs have been identified and the assembly confirmed previous work showing the vaccine genome to most closely resemble that of Vaccinia virus strain Modified Vaccinia Ankara. The pipeline may be used for other DNA viruses, increasing the understanding of DNA virus evolution, virulence, host preference, and epigenomics.

Identifiants

pubmed: 34493784
doi: 10.1038/s41598-021-97158-x
pii: 10.1038/s41598-021-97158-x
pmc: PMC8423768
doi:

Substances chimiques

Protein Sorting Signals 0
Vaccines, Attenuated 0
Viral Proteins 0
Viral Vaccines 0

Types de publication

Comparative Study Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

17758

Informations de copyright

© 2021. The Author(s).

Références

Bioinformatics. 2015 Oct 15;31(20):3350-2
pubmed: 26099265
Virology. 2010 Jan 5;396(1):160-8
pubmed: 19900687
J Gen Virol. 2004 Jan;85(Pt 1):105-117
pubmed: 14718625
Virology. 2000 Dec 5;278(1):244-52
pubmed: 11112499
Nat Biotechnol. 2019 Apr;37(4):420-423
pubmed: 30778233
Pathogens. 2020 May 21;9(5):
pubmed: 32455727
Nat Commun. 2018 Jan 15;9(1):189
pubmed: 29335486
PLoS One. 2017 Mar 3;12(3):e0173056
pubmed: 28257484
Pharmacogenomics. 2004 Jun;5(4):433-8
pubmed: 15165179
Virology. 1977 May 1;78(1):57-75
pubmed: 860412
PLoS Pathog. 2011 Sep;7(9):e1002247
pubmed: 21931555
Nat Rev Microbiol. 2020 Oct;18(10):559-570
pubmed: 32533130
Cold Spring Harb Perspect Biol. 2013 Sep 01;5(9):
pubmed: 23838441
Dev Growth Differ. 2019 Jun;61(5):316-326
pubmed: 31037722
Virology. 2011 Sep 1;417(2):334-42
pubmed: 21752417
Genome Biol. 2004;5(2):R12
pubmed: 14759262
Bioinformatics. 2014 May 1;30(9):1236-40
pubmed: 24451626
Genome Biol. 2020 Jul 20;21(1):175
pubmed: 32684155
Bioinformatics. 2020 Jul 1;36(Suppl_1):i75-i83
pubmed: 32657355
Genome Biol. 2020 Feb 7;21(1):30
pubmed: 32033565
Proc Natl Acad Sci U S A. 1992 Mar 1;89(5):1827-31
pubmed: 1542678
Genome Biol. 2016 Nov 25;17(1):239
pubmed: 27887629
Nat Methods. 2019 May;16(5):429-436
pubmed: 31011185
Proc Natl Acad Sci U S A. 1996 Nov 26;93(24):13770-3
pubmed: 8943010
J Virol. 2000 Dec;74(23):11230-9
pubmed: 11070021
J Gen Virol. 2013 Sep;94(Pt 9):2070-2081
pubmed: 23761407
Gene. 1985;37(1-3):221-8
pubmed: 3902572
Gigascience. 2018 Dec 1;7(12):
pubmed: 30476066
Genome Res. 2017 May;27(5):737-746
pubmed: 28100585
Curr Opin Virol. 2015 Apr;11:31-7
pubmed: 25668758
Proc Int Conf Intell Syst Mol Biol. 1994;2:28-36
pubmed: 7584402
BMC Bioinformatics. 2018 Feb 17;19(1):55
pubmed: 29454313
Clin Epigenetics. 2019 Mar 27;11(1):55
pubmed: 30917875
Nat Biotechnol. 2019 May;37(5):540-546
pubmed: 30936562
RNA Biol. 2017 Sep 2;14(9):1089-1098
pubmed: 28277931
J Virol. 2010 Oct;84(19):9907-19
pubmed: 20668072
Biologicals. 2020 May;65:39-41
pubmed: 32334926
Virology. 2015 May;479-480:153-9
pubmed: 25742715
J Biol Chem. 2011 Jun 10;286(23):20727-35
pubmed: 21474453
J Gen Virol. 2008 Oct;89(Pt 10):2377-2387
pubmed: 18796705
Virology. 1999 Mar 1;255(1):190-204
pubmed: 10049834
Bioinformatics. 2018 Aug 1;34(15):2666-2669
pubmed: 29547981
Front Public Health. 2018 Sep 04;6:241
pubmed: 30234087
J Gen Virol. 2012 Oct;93(Pt 10):2098-2108
pubmed: 22791606
J Virol. 1996 Feb;70(2):905-14
pubmed: 8551630
Virology. 1995 Dec 1;214(1):177-88
pubmed: 8525613
Proc Natl Acad Sci U S A. 1977 Dec;74(12):5463-7
pubmed: 271968
Virology. 2004 Feb 5;319(1):1-11
pubmed: 14967483
Virology. 1993 Aug;195(2):732-44
pubmed: 8337842
J Gen Virol. 2002 Apr;83(Pt 4):855-872
pubmed: 11907336
Nat Methods. 2010 Jun;7(6):461-5
pubmed: 20453866
Nat Methods. 2017 Apr;14(4):407-410
pubmed: 28218898
BMC Bioinformatics. 2010 Mar 08;11:119
pubmed: 20211023
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D311-6
pubmed: 15608205
Bioinformatics. 2009 Aug 15;25(16):2078-9
pubmed: 19505943
Virology. 2000 Feb 1;267(1):65-79
pubmed: 10648184
Proc Natl Acad Sci U S A. 2015 Sep 1;112(35):10908-13
pubmed: 26286988
Bioinformatics. 2018 Sep 15;34(18):3094-3100
pubmed: 29750242
Methods Mol Biol. 2019;1962:29-51
pubmed: 31020553
Sci Rep. 2020 Aug 14;10(1):13822
pubmed: 32796917
Viruses. 2010 Sep;2(9):1933-67
pubmed: 21994715
Genome Biol. 2007;8(2):R24
pubmed: 17324271
Eur J Cell Biol. 2018 Aug;97(6):422-441
pubmed: 29958716
Viruses. 2020 Jul 23;12(8):
pubmed: 32717784
Virology. 2008 Oct 25;380(2):226-33
pubmed: 18760436
BMC Bioinformatics. 2009 Dec 15;10:421
pubmed: 20003500
J Gen Virol. 2009 Jul;90(Pt 7):1582-1591
pubmed: 19264647
Virology. 1978 Jan;84(1):1-8
pubmed: 619484
Science. 2009 Jan 2;323(5910):133-8
pubmed: 19023044

Auteurs

Zack Saud (Z)

Department of Biosciences, College of Science, Swansea University, Singleton Park, Swansea, SA2 8PP, Wales, UK. zack.saud@swansea.ac.uk.

Matthew D Hitchings (MD)

Swansea University Medical School, Swansea University, Singleton Park, Swansea, Sa2 8PP, Wales, UK.

Tariq M Butt (TM)

Department of Biosciences, College of Science, Swansea University, Singleton Park, Swansea, SA2 8PP, Wales, UK.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
Animals Hemiptera Insect Proteins Phylogeny Insecticides

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software

Classifications MeSH