An improved genome assembly of the fluke Schistosoma japonicum.


Journal

PLoS neglected tropical diseases
ISSN: 1935-2735
Titre abrégé: PLoS Negl Trop Dis
Pays: United States
ID NLM: 101291488

Informations de publication

Date de publication:
08 2019
Historique:
received: 24 03 2019
accepted: 08 07 2019
entrez: 8 8 2019
pubmed: 8 8 2019
medline: 14 1 2020
Statut: epublish

Résumé

Schistosoma japonicum is a parasitic flatworm that causes human schistosomiasis, which is a significant cause of morbidity in China and the Philippines. A single draft genome was available for S. japonicum, yet this assembly is very fragmented and only covers 90% of the genome, which make it difficult to be applied as a reference in functional genome analysis and genes discovery. In this study, we present a high-quality assembly of the fluke S. japonicum genome by combining 20 G (~53X) long single molecule real time sequencing reads with 80 G (~ 213X) Illumina paired-end reads. This improved genome assembly is approximately 370.5 Mb, with contig and scaffold N50 length of 871.9 kb and 1.09 Mb, representing 142.4-fold and 6.2-fold improvement over the released WGS-based assembly, respectively. Additionally, our assembly captured 85.2% complete and 4.6% partial eukaryotic Benchmarking Universal Single-Copy Orthologs. Repetitive elements account for 46.80% of the genome, and 10,089 of the protein-coding genes were predicted from the improved genome, of which 96.5% have been functionally annotated. Lastly, using the improved assembly, we identified 20 significantly expanded gene families in S. japonicum, and those genes were primarily enriched in functions of proteolysis and protein glycosylation. Using the combination of PacBio and Illumina Sequencing technologies, we provided an improved high-quality genome of S. japonicum. This improved genome assembly, as well as the annotation, will be useful for the comparative genomics of the flukes and more importantly facilitate the molecular studies of this important parasite in the future.

Sections du résumé

BACKGROUND
Schistosoma japonicum is a parasitic flatworm that causes human schistosomiasis, which is a significant cause of morbidity in China and the Philippines. A single draft genome was available for S. japonicum, yet this assembly is very fragmented and only covers 90% of the genome, which make it difficult to be applied as a reference in functional genome analysis and genes discovery.
FINDINGS
In this study, we present a high-quality assembly of the fluke S. japonicum genome by combining 20 G (~53X) long single molecule real time sequencing reads with 80 G (~ 213X) Illumina paired-end reads. This improved genome assembly is approximately 370.5 Mb, with contig and scaffold N50 length of 871.9 kb and 1.09 Mb, representing 142.4-fold and 6.2-fold improvement over the released WGS-based assembly, respectively. Additionally, our assembly captured 85.2% complete and 4.6% partial eukaryotic Benchmarking Universal Single-Copy Orthologs. Repetitive elements account for 46.80% of the genome, and 10,089 of the protein-coding genes were predicted from the improved genome, of which 96.5% have been functionally annotated. Lastly, using the improved assembly, we identified 20 significantly expanded gene families in S. japonicum, and those genes were primarily enriched in functions of proteolysis and protein glycosylation.
CONCLUSIONS
Using the combination of PacBio and Illumina Sequencing technologies, we provided an improved high-quality genome of S. japonicum. This improved genome assembly, as well as the annotation, will be useful for the comparative genomics of the flukes and more importantly facilitate the molecular studies of this important parasite in the future.

Identifiants

pubmed: 31390359
doi: 10.1371/journal.pntd.0007612
pii: PNTD-D-19-00496
pmc: PMC6685614
doi:

Substances chimiques

Proteins 0

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

e0007612

Déclaration de conflit d'intérêts

The authors have declared that no competing interests exist.

Références

Parasit Vectors. 2011 Jul 07;4:131
pubmed: 21736723
Lancet. 2017 Sep 16;390(10100):1211-1259
pubmed: 28919117
Bioinformatics. 2016 Jul 15;32(14):2103-10
pubmed: 27153593
Clin Microbiol Rev. 2015 Oct;28(4):939-67
pubmed: 26224883
Mol Biochem Parasitol. 2017 Jul;215:2-10
pubmed: 27899279
Int J Plant Genomics. 2008;2008:619832
pubmed: 18483572
Bioinformatics. 2009 Jul 15;25(14):1754-60
pubmed: 19451168
Mob DNA. 2015 Jun 02;6:11
pubmed: 26045719
Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W435-9
pubmed: 16845043
Parasit Vectors. 2019 Aug 23;12(1):414
pubmed: 31443730
PLoS Comput Biol. 2013;9(8):e1003118
pubmed: 23950696
Nature. 2013 Apr 04;496(7443):57-63
pubmed: 23485966
Parasite Immunol. 2012 Feb-Mar;34(2-3):100-7
pubmed: 21707658
Nature. 1994 Sep 15;371(6494):215-20
pubmed: 8078581
Bioinformatics. 2017 Jul 15;33(14):2202-2204
pubmed: 28369201
Gigascience. 2018 Jun 1;7(6):
pubmed: 29893829
J Infect Dis. 2009 Mar 15;199(6):904-12
pubmed: 19434933
Mol Biol Evol. 2013 Apr;30(4):772-80
pubmed: 23329690
Nucleic Acids Res. 2007 Jan;35(Database issue):D61-5
pubmed: 17130148
Nature. 2017 Jun 22;546(7659):524-527
pubmed: 28605751
Nat Biotechnol. 2011 May 15;29(7):644-52
pubmed: 21572440
BMC Genomics. 2018 Mar 2;19(1):175
pubmed: 29499650
Nature. 2009 Jul 16;460(7253):345-51
pubmed: 19606140
PLoS Negl Trop Dis. 2012 Jan;6(1):e1455
pubmed: 22253936
OMICS. 2012 May;16(5):284-7
pubmed: 22455463
Bioinformatics. 2014 May 1;30(9):1236-40
pubmed: 24451626
Bioinformatics. 2014 Aug 1;30(15):2114-20
pubmed: 24695404
Nucleic Acids Res. 2012 Jan;40(1):37-52
pubmed: 21911355
Front Immunol. 2013 Aug 28;4:240
pubmed: 24009607
Nucleic Acids Res. 1997 Mar 1;25(5):955-64
pubmed: 9023104
Parasitol Today. 1999 Jun;15(6):214-5
pubmed: 10366824
Nucleic Acids Res. 2011 Jul;39(Web Server issue):W316-22
pubmed: 21715386
Wiley Interdiscip Rev RNA. 2013 Jan-Feb;4(1):93-105
pubmed: 23139082
Genome Biol. 2011 Oct 24;12(10):R107
pubmed: 22023798
Int J Parasitol. 2011 Apr;41(5):523-32
pubmed: 21236260
Nat Genet. 2000 May;25(1):25-9
pubmed: 10802651
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W182-5
pubmed: 17526522
Acta Trop. 2005 Nov-Dec;96(2-3):97-105
pubmed: 16125655
Proc Natl Acad Sci U S A. 2017 Sep 19;114(38):10214-10219
pubmed: 28874579
Bioinformatics. 2015 Oct 1;31(19):3210-2
pubmed: 26059717
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W451-4
pubmed: 15980510
Nat Genet. 2012 Jan 15;44(2):221-5
pubmed: 22246508
Bioinformatics. 2008 Mar 1;24(5):715-6
pubmed: 18227120
Lancet. 2014 Jun 28;383(9936):2253-64
pubmed: 24698483
Nucleic Acids Res. 2003 Oct 1;31(19):5654-66
pubmed: 14500829
PLoS Negl Trop Dis. 2015 Aug 18;9(8):e0003993
pubmed: 26285138
Nat Biotechnol. 2015 Mar;33(3):290-5
pubmed: 25690850
Curr Top Microbiol Immunol. 2008;326:17-37
pubmed: 18630745
Nucleic Acids Res. 2017 Jan 4;45(D1):D353-D361
pubmed: 27899662
Nucleic Acids Res. 2014 Jan;42(Database issue):D222-30
pubmed: 24288371
Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W720-4
pubmed: 16845106
Nucleic Acids Res. 2011 May;39(10):e68
pubmed: 21398631
Biochimie. 2018 Mar;146:73-78
pubmed: 29196110
Genome Res. 2003 Sep;13(9):2178-89
pubmed: 12952885
Nucleic Acids Res. 2003 Jan 1;31(1):365-70
pubmed: 12520024
BMC Genomics. 2006 Dec 28;7:327
pubmed: 17194304
Zhongguo Xue Xi Chong Bing Fang Zhi Za Zhi. 2017 Dec 28;29(6):669-677
pubmed: 29469441
Front Genet. 2014 Aug 05;5:262
pubmed: 25147556
Parasitology. 2007 Dec;134(Pt.14):2009-20
pubmed: 17822572
Curr Protoc Bioinformatics. 2009 Mar;Chapter 4:4.10.1-4.10.14
pubmed: 19274634
Nat Rev Dis Primers. 2018 Aug 9;4(1):13
pubmed: 30093684
Bioinformatics. 2013 Apr 15;29(8):1072-5
pubmed: 23422339
Genome Res. 2017 May;27(5):778-786
pubmed: 28159771
Bioinformatics. 2009 Aug 1;25(15):1972-3
pubmed: 19505945
Infect Dis Poverty. 2017 Mar 15;6(1):55
pubmed: 28292327
Gigascience. 2019 Jan 1;8(1):
pubmed: 30520948
Lancet Infect Dis. 2006 Jul;6(7):411-25
pubmed: 16790382
Nat Genet. 2013 Oct;45(10):1168-75
pubmed: 24013640
Nat Methods. 2015 Apr;12(4):357-60
pubmed: 25751142
PLoS One. 2014 Nov 19;9(11):e112963
pubmed: 25409509
Genome Biol. 2013 Aug 30;14(8):R93
pubmed: 24000942
Genome Biol. 2008 Jan 11;9(1):R7
pubmed: 18190707
Nucleic Acids Res. 2017 Jan 4;45(D1):D331-D338
pubmed: 27899567
Mol Biol Evol. 2013 Aug;30(8):1987-97
pubmed: 23709260
Nature. 2009 Jul 16;460(7253):352-8
pubmed: 19606141
J Exp Med. 2009 Aug 3;206(8):1681-90
pubmed: 19635859
Bioinformatics. 2009 Aug 15;25(16):2078-9
pubmed: 19505943
Curr Protoc Bioinformatics. 2018 Jun;62(1):e51
pubmed: 29927072
Bioinformatics. 2013 Nov 15;29(22):2933-5
pubmed: 24008419
BMC Bioinformatics. 2009 Dec 15;10:421
pubmed: 20003500
Nat Commun. 2014 Jul 09;5:4378
pubmed: 25007141
Nucleic Acids Res. 2007;35(9):3100-8
pubmed: 17452365
Trends Parasitol. 2001 Jul;17(7):320-4
pubmed: 11423374
BMC Bioinformatics. 2004 May 14;5:59
pubmed: 15144565

Auteurs

Fang Luo (F)

Department of infectious diseases, Huashan Hospital, State Key Laboratory of Genetic Engineering, Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, China.

Mingbo Yin (M)

Department of infectious diseases, Huashan Hospital, State Key Laboratory of Genetic Engineering, Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, China.
National Institute of Parasitic Diseases, Chinese Center for Disease Control and Prevention, Key Laboratory of Parasite and Vector Biology of China Ministry of Health, WHO Collaborating Centre for Tropical Diseases, Joint Research Laboratory of Genetics and Ecology on Parasite-host Interaction, Chinese Center for Disease Control and Prevention & Fudan University, Shanghai, China.

Xiaojin Mo (X)

National Institute of Parasitic Diseases, Chinese Center for Disease Control and Prevention, Key Laboratory of Parasite and Vector Biology of China Ministry of Health, WHO Collaborating Centre for Tropical Diseases, Joint Research Laboratory of Genetics and Ecology on Parasite-host Interaction, Chinese Center for Disease Control and Prevention & Fudan University, Shanghai, China.

Chengsong Sun (C)

Department of infectious diseases, Huashan Hospital, State Key Laboratory of Genetic Engineering, Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, China.

Qunfeng Wu (Q)

Department of infectious diseases, Huashan Hospital, State Key Laboratory of Genetic Engineering, Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, China.

Bingkuan Zhu (B)

Department of infectious diseases, Huashan Hospital, State Key Laboratory of Genetic Engineering, Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, China.

Manyu Xiang (M)

Department of infectious diseases, Huashan Hospital, State Key Laboratory of Genetic Engineering, Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, China.

Jipeng Wang (J)

Department of infectious diseases, Huashan Hospital, State Key Laboratory of Genetic Engineering, Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, China.

Yi Wang (Y)

Department of infectious diseases, Huashan Hospital, State Key Laboratory of Genetic Engineering, Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, China.

Jian Li (J)

Department of infectious diseases, Huashan Hospital, State Key Laboratory of Genetic Engineering, Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, China.

Ting Zhang (T)

National Institute of Parasitic Diseases, Chinese Center for Disease Control and Prevention, Key Laboratory of Parasite and Vector Biology of China Ministry of Health, WHO Collaborating Centre for Tropical Diseases, Joint Research Laboratory of Genetics and Ecology on Parasite-host Interaction, Chinese Center for Disease Control and Prevention & Fudan University, Shanghai, China.

Bin Xu (B)

National Institute of Parasitic Diseases, Chinese Center for Disease Control and Prevention, Key Laboratory of Parasite and Vector Biology of China Ministry of Health, WHO Collaborating Centre for Tropical Diseases, Joint Research Laboratory of Genetics and Ecology on Parasite-host Interaction, Chinese Center for Disease Control and Prevention & Fudan University, Shanghai, China.

Huajun Zheng (H)

Shanghai-MOST Key Laboratory of Health and Disease Genomics, Chinese National Human Genome Center at Shanghai, Shanghai, China.

Zheng Feng (Z)

National Institute of Parasitic Diseases, Chinese Center for Disease Control and Prevention, Key Laboratory of Parasite and Vector Biology of China Ministry of Health, WHO Collaborating Centre for Tropical Diseases, Joint Research Laboratory of Genetics and Ecology on Parasite-host Interaction, Chinese Center for Disease Control and Prevention & Fudan University, Shanghai, China.

Wei Hu (W)

Department of infectious diseases, Huashan Hospital, State Key Laboratory of Genetic Engineering, Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, China.
National Institute of Parasitic Diseases, Chinese Center for Disease Control and Prevention, Key Laboratory of Parasite and Vector Biology of China Ministry of Health, WHO Collaborating Centre for Tropical Diseases, Joint Research Laboratory of Genetics and Ecology on Parasite-host Interaction, Chinese Center for Disease Control and Prevention & Fudan University, Shanghai, China.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH