Rapid protein evolution, organellar reductions, and invasive intronic elements in the marine aerobic parasite dinoflagellate Amoebophrya spp.
Dinoflagellate
Genome
Introner elements
Non-canonical introns
Parasite
Journal
BMC biology
ISSN: 1741-7007
Titre abrégé: BMC Biol
Pays: England
ID NLM: 101190720
Informations de publication
Date de publication:
06 01 2021
06 01 2021
Historique:
received:
24
05
2020
accepted:
12
11
2020
entrez:
7
1
2021
pubmed:
8
1
2021
medline:
1
10
2021
Statut:
epublish
Résumé
Dinoflagellates are aquatic protists particularly widespread in the oceans worldwide. Some are responsible for toxic blooms while others live in symbiotic relationships, either as mutualistic symbionts in corals or as parasites infecting other protists and animals. Dinoflagellates harbor atypically large genomes (~ 3 to 250 Gb), with gene organization and gene expression patterns very different from closely related apicomplexan parasites. Here we sequenced and analyzed the genomes of two early-diverging and co-occurring parasitic dinoflagellate Amoebophrya strains, to shed light on the emergence of such atypical genomic features, dinoflagellate evolution, and host specialization. We sequenced, assembled, and annotated high-quality genomes for two Amoebophrya strains (A25 and A120), using a combination of Illumina paired-end short-read and Oxford Nanopore Technology (ONT) MinION long-read sequencing approaches. We found a small number of transposable elements, along with short introns and intergenic regions, and a limited number of gene families, together contribute to the compactness of the Amoebophrya genomes, a feature potentially linked with parasitism. While the majority of Amoebophrya proteins (63.7% of A25 and 59.3% of A120) had no functional assignment, we found many orthologs shared with Dinophyceae. Our analyses revealed a strong tendency for genes encoded by unidirectional clusters and high levels of synteny conservation between the two genomes despite low interspecific protein sequence similarity, suggesting rapid protein evolution. Most strikingly, we identified a large portion of non-canonical introns, including repeated introns, displaying a broad variability of associated splicing motifs never observed among eukaryotes. Those introner elements appear to have the capacity to spread over their respective genomes in a manner similar to transposable elements. Finally, we confirmed the reduction of organelles observed in Amoebophrya spp., i.e., loss of the plastid, potential loss of a mitochondrial genome and functions. These results expand the range of atypical genome features found in basal dinoflagellates and raise questions regarding speciation and the evolutionary mechanisms at play while parastitism was selected for in this particular unicellular lineage.
Sections du résumé
BACKGROUND
Dinoflagellates are aquatic protists particularly widespread in the oceans worldwide. Some are responsible for toxic blooms while others live in symbiotic relationships, either as mutualistic symbionts in corals or as parasites infecting other protists and animals. Dinoflagellates harbor atypically large genomes (~ 3 to 250 Gb), with gene organization and gene expression patterns very different from closely related apicomplexan parasites. Here we sequenced and analyzed the genomes of two early-diverging and co-occurring parasitic dinoflagellate Amoebophrya strains, to shed light on the emergence of such atypical genomic features, dinoflagellate evolution, and host specialization.
RESULTS
We sequenced, assembled, and annotated high-quality genomes for two Amoebophrya strains (A25 and A120), using a combination of Illumina paired-end short-read and Oxford Nanopore Technology (ONT) MinION long-read sequencing approaches. We found a small number of transposable elements, along with short introns and intergenic regions, and a limited number of gene families, together contribute to the compactness of the Amoebophrya genomes, a feature potentially linked with parasitism. While the majority of Amoebophrya proteins (63.7% of A25 and 59.3% of A120) had no functional assignment, we found many orthologs shared with Dinophyceae. Our analyses revealed a strong tendency for genes encoded by unidirectional clusters and high levels of synteny conservation between the two genomes despite low interspecific protein sequence similarity, suggesting rapid protein evolution. Most strikingly, we identified a large portion of non-canonical introns, including repeated introns, displaying a broad variability of associated splicing motifs never observed among eukaryotes. Those introner elements appear to have the capacity to spread over their respective genomes in a manner similar to transposable elements. Finally, we confirmed the reduction of organelles observed in Amoebophrya spp., i.e., loss of the plastid, potential loss of a mitochondrial genome and functions.
CONCLUSION
These results expand the range of atypical genome features found in basal dinoflagellates and raise questions regarding speciation and the evolutionary mechanisms at play while parastitism was selected for in this particular unicellular lineage.
Identifiants
pubmed: 33407428
doi: 10.1186/s12915-020-00927-9
pii: 10.1186/s12915-020-00927-9
pmc: PMC7789003
doi:
Substances chimiques
DNA, Protozoan
0
Protozoan Proteins
0
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
1Commentaires et corrections
Type : ErratumIn
Références
PLoS Genet. 2014 Feb 06;10(2):e1004007
pubmed: 24516393
Bioessays. 2009 Feb;31(2):237-45
pubmed: 19204978
Science. 2010 Dec 3;330(6009):1381-5
pubmed: 21097902
PLoS One. 2011 Jan 31;6(1):e16526
pubmed: 21304975
Bioinformatics. 2006 Jul 1;22(13):1658-9
pubmed: 16731699
J Eukaryot Microbiol. 2007 Sep-Oct;54(5):427-35
pubmed: 17910687
Curr Biol. 2018 Aug 20;28(16):2570-2580.e6
pubmed: 30100341
Sci Rep. 2020 Feb 13;10(1):2531
pubmed: 32054950
J Eukaryot Microbiol. 2004 Mar-Apr;51(2):145-55
pubmed: 15134249
Science. 2004 Apr 16;304(5669):441-5
pubmed: 15044751
PLoS One. 2011;6(5):e19933
pubmed: 21629701
Curr Opin Microbiol. 2014 Aug;20:82-7
pubmed: 24934558
Trends Genet. 1997 Dec;13(12):497-8
pubmed: 9433140
Mol Biol Evol. 2013 Jan;30(1):123-39
pubmed: 22923466
Bioinformatics. 2012 Dec 15;28(24):3211-7
pubmed: 23071270
Gigascience. 2012 Dec 27;1(1):18
pubmed: 23587118
Front Microbiol. 2018 Oct 02;9:2251
pubmed: 30333799
J Comput Biol. 2006 Jun;13(5):1028-40
pubmed: 16796549
PLoS One. 2008 Aug 13;3(8):e2929
pubmed: 18698341
Bioinformatics. 2010 Mar 1;26(5):589-95
pubmed: 20080505
Elife. 2015 Jul 15;4:e06974
pubmed: 26175406
Genome Biol Evol. 2018 Jan 1;10(1):1-13
pubmed: 29202176
Nucleic Acids Res. 2003 Jan 1;31(1):234-6
pubmed: 12519989
PLoS One. 2015 Jun 01;10(6):e0127623
pubmed: 26030411
Nat Rev Genet. 2006 Mar;7(3):211-21
pubmed: 16485020
Genome Biol. 2004;5(2):R12
pubmed: 14759262
Bioinformatics. 2014 May 1;30(9):1236-40
pubmed: 24451626
Nucleic Acids Res. 2010 Jan;38(Database issue):D457-62
pubmed: 19843604
Mol Biochem Parasitol. 2004 Apr;134(2):183-91
pubmed: 15003838
Cytogenet Genome Res. 2005;110(1-4):462-7
pubmed: 16093699
Curr Biol. 2013 Aug 5;23(15):1399-408
pubmed: 23850284
Nat Rev Genet. 2016 Jul;17(7):407-421
pubmed: 27240813
Trends Genet. 2000 Jun;16(6):276-7
pubmed: 10827456
Bioinformatics. 2005 Jun;21 Suppl 1:i351-8
pubmed: 15961478
Genome Res. 2004 May;14(5):988-95
pubmed: 15123596
Genome Biol Evol. 2014 Mar;6(3):666-84
pubmed: 24572015
Nucleic Acids Res. 2011 May;39(9):3820-35
pubmed: 21245033
Science. 2015 Nov 6;350(6261):691-4
pubmed: 26542574
Science. 2008 Nov 21;322(5905):1254-7
pubmed: 19023082
BMC Biol. 2020 May 24;18(1):56
pubmed: 32448240
Proc Natl Acad Sci U S A. 2017 Jan 10;114(2):E171-E180
pubmed: 28028238
Wiley Interdiscip Rev RNA. 2013 Jan-Feb;4(1):61-76
pubmed: 23074130
Mol Biol Evol. 2010 Jan;27(1):7-10
pubmed: 19767348
Genome Res. 2011 Mar;21(3):487-93
pubmed: 21209072
Nucleic Acids Res. 1999 Jan 15;27(2):573-80
pubmed: 9862982
Proc Natl Acad Sci U S A. 2010 Jun 15;107(24):10949-54
pubmed: 20534454
Nucleic Acids Res. 2009 Jan;37(Database issue):D539-43
pubmed: 18957442
Curr Biol. 2019 Oct 7;29(19):3193-3199.e4
pubmed: 31543449
PLoS Comput Biol. 2005 Jul;1(2):166-75
pubmed: 16110336
Sci Rep. 2016 Dec 22;6:39734
pubmed: 28004835
Gigascience. 2017 Feb 1;6(2):1-13
pubmed: 28369459
Microorganisms. 2019 Jan 22;7(2):
pubmed: 30678153
Nucleic Acids Res. 2019 Jan 8;47(D1):D506-D515
pubmed: 30395287
Nat Commun. 2017 Nov 6;8(1):1326
pubmed: 29109544
PLoS One. 2013 Jun 11;8(6):e66347
pubmed: 23776661
Nat Genet. 2018 Apr;50(4):581-590
pubmed: 29507423
Environ Microbiol. 2008 Dec;10(12):3349-65
pubmed: 18771501
Comput Appl Biosci. 1997 Aug;13(4):477-8
pubmed: 9283765
Genome Biol Evol. 2012;4(1):59-72
pubmed: 22113794
Biol Rev Camb Philos Soc. 2018 Feb;93(1):201-222
pubmed: 28544184
J Phycol. 2020 Feb;56(1):6-10
pubmed: 31713873
Methods Mol Biol. 2019;1962:227-245
pubmed: 31020564
BMC Genomics. 2016 Mar 31;17:267
pubmed: 27029936
Proc Natl Acad Sci U S A. 2007 Mar 27;104(13):5608-13
pubmed: 17372224
J Eukaryot Microbiol. 2015 Sep-Oct;62(5):679-87
pubmed: 25963315
Nucleic Acids Res. 2016 Jan 4;44(D1):D457-62
pubmed: 26476454
Genome Res. 2003 Sep;13(9):2178-89
pubmed: 12952885
BMC Res Notes. 2017 Dec 4;10(1):667
pubmed: 29202864
Annu Rev Genet. 2007;41:331-68
pubmed: 18076328
PLoS Comput Biol. 2011 Sep;7(9):e1002150
pubmed: 21935348
BMC Bioinformatics. 2010 Aug 18;11:431
pubmed: 20718988
Bioinformatics. 2012 Apr 15;28(8):1086-92
pubmed: 22368243
Proc Natl Acad Sci U S A. 2007 Mar 13;104(11):4618-23
pubmed: 17360573
Genome Biol Evol. 2013;5(3):468-83
pubmed: 23395982
PLoS One. 2009 Sep 14;4(9):e6978
pubmed: 19750009
Genome Res. 2002 Apr;12(4):656-64
pubmed: 11932250
Cells. 2020 Feb 18;9(2):
pubmed: 32085510
Bioinformatics. 2014 Dec 1;30(23):3399-401
pubmed: 25143291
Proc Natl Acad Sci U S A. 2011 Jan 25;108(4):1513-8
pubmed: 21187386
Eukaryot Cell. 2006 Jun;5(6):924-34
pubmed: 16757740
Mol Biol Evol. 2015 May;32(5):1115-31
pubmed: 25660376
Nucleic Acids Res. 2002 Apr 1;30(7):1575-84
pubmed: 11917018
Nat Methods. 2015 Apr;12(4):357-60
pubmed: 25751142
Sci Rep. 2016 Jan 22;6:19688
pubmed: 26795595
Proc Natl Acad Sci U S A. 2015 Aug 18;112(33):10177-84
pubmed: 25814499
PLoS One. 2014 Nov 19;9(11):e112963
pubmed: 25409509
Sci Adv. 2019 Apr 24;5(4):eaav1110
pubmed: 31032404
Bioinformatics. 2011 Mar 15;27(6):764-70
pubmed: 21217122
BMC Biol. 2021 Jan 6;19(1):1
pubmed: 33407428
Plant Cell. 2017 Oct;29(10):2336-2348
pubmed: 29025960
Wiley Interdiscip Rev RNA. 2011 May-Jun;2(3):417-34
pubmed: 21957027
Commun Biol. 2018 Jul 17;1:95
pubmed: 30271976
BMC Genomics. 2012 Nov 09;13:603
pubmed: 23137308
PLoS Genet. 2018 Oct 26;14(10):e1007761
pubmed: 30365503
Genome. 2013 Sep;56(9):475-86
pubmed: 24168668
Nature. 2016 Oct 27;538(7626):533-536
pubmed: 27760113
Mob DNA. 2018 Jun 18;9:19
pubmed: 29946369
PLoS One. 2010 Mar 12;5(3):e9688
pubmed: 20300646
Trends Genet. 2008 Jul;24(7):328-35
pubmed: 18514360
Proc Natl Acad Sci U S A. 2015 May 5;112(18):5767-72
pubmed: 25902514
J Mol Biol. 1990 Oct 5;215(3):403-10
pubmed: 2231712
Nucleic Acids Res. 2004 Mar 19;32(5):1792-7
pubmed: 15034147
Curr Biol. 2008 Jul 8;18(13):R550-2
pubmed: 18606121
Proc Natl Acad Sci U S A. 2012 Sep 25;109(39):15793-8
pubmed: 23019363
J Hum Genet. 2016 May;61(5):463-6
pubmed: 26763876