Comparative genomic and transcriptomic analyses of transposable elements in polychaetous annelids highlight LTR retrotransposon diversity and evolution.

LTR-retrotransposons PIWI proteins Polychaetous annelids Transcriptoms Transposable elements

Journal

Mobile DNA
ISSN: 1759-8753
Titre abrégé: Mob DNA
Pays: England
ID NLM: 101519891

Informations de publication

Date de publication:
29 Oct 2021
Historique:
received: 10 06 2021
accepted: 08 10 2021
entrez: 30 10 2021
pubmed: 31 10 2021
medline: 31 10 2021
Statut: epublish

Résumé

With the expansion of high throughput sequencing, we now have access to a larger number of genome-wide studies analyzing the Transposable elements (TEs) composition in a wide variety of organisms. However, genomic analyses often remain too limited in number and diversity of species investigated to study in depth the dynamics and evolutionary success of the different types of TEs among metazoans. Therefore, we chose to investigate the use of transcriptomes to describe the diversity of TEs in phylogenetically related species by conducting the first comparative analysis of TEs in two groups of polychaetes and evaluate the diversity of TEs that might impact genomic evolution as a result of their mobility. We present a detailed analysis of TEs distribution in transcriptomes extracted from 15 polychaetes depending on the number of reads used during assembly, and also compare these results with additional TE scans on associated low-coverage genomes. We then characterized the clades defined by 1021 LTR-retrotransposon families identified in 26 species. Clade richness was highly dependent on the considered superfamily. Copia elements appear rare and are equally distributed in only three clades, GalEa, Hydra and CoMol. Among the eight BEL/Pao clades identified in annelids, two small clades within the Sailor lineage are new for science. We characterized 17 Gypsy clades of which only 4 are new; the C-clade largely dominates with a quarter of the families. Finally, all species also expressed for the majority two distinct transcripts encoding PIWI proteins, known to be involved in control of TEs mobilities. This study shows that the use of transcriptomes assembled from 40 million reads was sufficient to access to the diversity and proportion of the transposable elements compared to those obtained by low coverage sequencing. Among LTR-retrotransposons Gypsy elements were unequivocally dominant but results suggest that the number of Gypsy clades, although high, may be more limited than previously thought in metazoans. For BEL/Pao elements, the organization of clades within the Sailor lineage appears more difficult to establish clearly. The Copia elements remain rare and result from the evolutionary consistent success of the same three clades.

Sections du résumé

BACKGROUND BACKGROUND
With the expansion of high throughput sequencing, we now have access to a larger number of genome-wide studies analyzing the Transposable elements (TEs) composition in a wide variety of organisms. However, genomic analyses often remain too limited in number and diversity of species investigated to study in depth the dynamics and evolutionary success of the different types of TEs among metazoans. Therefore, we chose to investigate the use of transcriptomes to describe the diversity of TEs in phylogenetically related species by conducting the first comparative analysis of TEs in two groups of polychaetes and evaluate the diversity of TEs that might impact genomic evolution as a result of their mobility.
RESULTS RESULTS
We present a detailed analysis of TEs distribution in transcriptomes extracted from 15 polychaetes depending on the number of reads used during assembly, and also compare these results with additional TE scans on associated low-coverage genomes. We then characterized the clades defined by 1021 LTR-retrotransposon families identified in 26 species. Clade richness was highly dependent on the considered superfamily. Copia elements appear rare and are equally distributed in only three clades, GalEa, Hydra and CoMol. Among the eight BEL/Pao clades identified in annelids, two small clades within the Sailor lineage are new for science. We characterized 17 Gypsy clades of which only 4 are new; the C-clade largely dominates with a quarter of the families. Finally, all species also expressed for the majority two distinct transcripts encoding PIWI proteins, known to be involved in control of TEs mobilities.
CONCLUSIONS CONCLUSIONS
This study shows that the use of transcriptomes assembled from 40 million reads was sufficient to access to the diversity and proportion of the transposable elements compared to those obtained by low coverage sequencing. Among LTR-retrotransposons Gypsy elements were unequivocally dominant but results suggest that the number of Gypsy clades, although high, may be more limited than previously thought in metazoans. For BEL/Pao elements, the organization of clades within the Sailor lineage appears more difficult to establish clearly. The Copia elements remain rare and result from the evolutionary consistent success of the same three clades.

Identifiants

pubmed: 34715903
doi: 10.1186/s13100-021-00252-0
pii: 10.1186/s13100-021-00252-0
pmc: PMC8556966
doi:

Types de publication

Journal Article

Langues

eng

Pagination

24

Informations de copyright

© 2021. The Author(s).

Références

Mol Biol Evol. 2013 Apr;30(4):772-80
pubmed: 23329690
PLoS One. 2013;8(3):e57675
pubmed: 23469217
J Virol. 2004 Mar;78(6):2967-78
pubmed: 14990715
BMC Biol. 2019 Nov 18;17(1):91
pubmed: 31739792
Mol Biol Evol. 2011 Oct;28(10):2731-9
pubmed: 21546353
Nucleic Acids Res. 2009 Nov;37(21):e143
pubmed: 19762481
Evodevo. 2011 May 05;2:10
pubmed: 21545709
Genome Biol Evol. 2015 Mar 11;7(4):1192-205
pubmed: 25767248
Mol Biol Evol. 2002 Feb;19(2):149-62
pubmed: 11801743
Nucleic Acids Res. 2011 Jan;39(Database issue):D70-4
pubmed: 21036865
Mob DNA. 2017 Apr 27;8:7
pubmed: 28465726
Plant J. 2001 Oct;28(2):159-68
pubmed: 11722759
Science. 2007 May 4;316(5825):744-7
pubmed: 17446352
Plant Mol Biol. 2000 Jan;42(1):251-69
pubmed: 10688140
BMC Genomics. 2009 Apr 29;10:203
pubmed: 19402907
Genome Res. 2011 Dec;21(12):2213-23
pubmed: 21903743
Bioinformatics. 2004 Jul 22;20(11):1806-7
pubmed: 14988107
Biol Direct. 2009 Nov 02;4:41
pubmed: 19883502
Dev Genes Evol. 2013 Sep;223(5):329-34
pubmed: 23609434
Mob DNA. 2015 Jun 02;6:11
pubmed: 26045719
Nature. 2006 Oct 5;443(7111):521-4
pubmed: 17024082
Genetics. 2010 Dec;186(4):1085-93
pubmed: 21156958
Dev Genes Evol. 2015 Jun;225(3):129-38
pubmed: 25772273
Plant Physiol. 2001 Sep;127(1):212-21
pubmed: 11553749
Mol Biol Evol. 2016 Aug;33(8):1937-56
pubmed: 27189548
RNA. 2003 Dec;9(12):1422-30
pubmed: 14623998
Bioinformatics. 2010 Mar 1;26(5):589-95
pubmed: 20080505
BMC Genomics. 2017 May 25;18(1):410
pubmed: 28545447
Nat Rev Genet. 2010 Jan;11(1):31-46
pubmed: 19997069
Cytogenet Genome Res. 2005;110(1-4):462-7
pubmed: 16093699
Nucleic Acids Res. 2014 May;42(9):e80
pubmed: 24634442
Syst Biol. 2006 Dec;55(6):875-85
pubmed: 17345670
BMC Evol Biol. 2005 Feb 23;5:20
pubmed: 15725362
Mol Biol Evol. 1987 Jul;4(4):406-25
pubmed: 3447015
Nat Ecol Evol. 2018 Jan;2(1):174-181
pubmed: 29203920
Insect Mol Biol. 2015 Oct;24(5):503-16
pubmed: 26079156
Genes Genet Syst. 2020 Jan 30;94(6):233-252
pubmed: 30416149
Cell. 2007 Mar 23;128(6):1089-103
pubmed: 17346786
BMC Genomics. 2013 Mar 12;14:167
pubmed: 23496952
Science. 2004 Mar 12;303(5664):1626-32
pubmed: 15016989
Heredity (Edinb). 2000 Aug;85 ( Pt 2):101-6
pubmed: 11012710
Cytometry A. 2003 Feb;51(2):127-8; author reply 129
pubmed: 12541287
BMC Evol Biol. 2011 Jun 04;11:154
pubmed: 21639932
Mob DNA. 2013 Aug 27;4(1):19
pubmed: 23981484
PLoS One. 2011;6(12):e29425
pubmed: 22242120
Cytogenet Genome Res. 2005;110(1-4):229-41
pubmed: 16093677
Curr Biol. 2012 Jun 5;22(11):R432-7
pubmed: 22677280
Nature. 2013 Jan 24;493(7433):526-31
pubmed: 23254933
BMC Genomics. 2018 Nov 15;19(1):821
pubmed: 30442098
Genetica. 2017 Jun;145(3):275-293
pubmed: 28424974
Gene. 2009 Dec 15;448(2):207-13
pubmed: 19651192
Curr Protoc Cytom. 2001 May;Chapter 11:Unit 11.12
pubmed: 18770686
RNA Biol. 2017 Dec 2;14(12):1727-1741
pubmed: 28783426
Evolution. 1985 Jul;39(4):783-791
pubmed: 28561359
Mol Genet Genomics. 2008 Jan;279(1):63-73
pubmed: 17929059
Nat Rev Genet. 2007 Dec;8(12):973-82
pubmed: 17984973
Virus Res. 2008 Jun;134(1-2):221-34
pubmed: 18261821
Front Zool. 2012 Nov 29;9(1):33
pubmed: 23190771
Science. 2015 Jan 2;347(6217):1258522
pubmed: 25554792
BMC Genomics. 2021 Jun 23;22(1):466
pubmed: 34157969
Cytogenet Genome Res. 2005;110(1-4):575-88
pubmed: 16093711
Dev Biol. 2007 Jun 15;306(2):599-611
pubmed: 17467683
Comput Appl Biosci. 1992 Jun;8(3):275-82
pubmed: 1633570

Auteurs

Jonathan Filée (J)

Laboratoire Evolution, Genomes, Comportement, Ecologie CNRS, Université Paris-Sud, IRD, Université Paris-Saclay, Gif-sur-Yvette, France.

Sarah Farhat (S)

Marine Animal Disease Laboratory, School of Marine and Atmospheric Sciences, Stony Brook University, 100 Nicolls Road, Stony Brook, NY, 11794-5000, USA.

Dominique Higuet (D)

Institut de Systématique, Evolution, Biodiversité (ISYEB) - Sorbonne Université, Muséum National d'Histoire Naturel, CNRS, EPHE, Université des Antilles, 7 quai Saint Bernard, 75252, Paris Cedex 05, France.

Laure Teysset (L)

Sorbonne Université, CNRS, Institut de Biologie Paris-Seine, Laboratoire Biologie du Développement, UMR7622, "Transgenerational Epigenetics & small RNA Biology", F-75005, Paris, France.

Dominique Marie (D)

Sorbonne Université, CNRS, UMR 7144 AD2M, Station Biologique de Roscoff, Place Georges Teissier, 29688, Roscoff, France.

Camille Thomas-Bulle (C)

Sorbonne Université, CNRS, UMR 7144 AD2M, Station Biologique de Roscoff, Place Georges Teissier, 29688, Roscoff, France.

Stephane Hourdez (S)

UMR8222 LECOB CNRS-Sorbonne Université, Observatoire Océanologique de Banyuls, 1 avenue Pierre Fabre, 66650, Banyuls-sur-Mer, France.

Didier Jollivet (D)

Sorbonne Université, CNRS, UMR 7144 AD2M, Station Biologique de Roscoff, Place Georges Teissier, 29688, Roscoff, France.

Eric Bonnivard (E)

Sorbonne Université, CNRS, UMR 7144 AD2M, Station Biologique de Roscoff, Place Georges Teissier, 29688, Roscoff, France. ebonnivard@sb-roscoff.fr.

Classifications MeSH