The sterlet sturgeon genome sequence and the mechanisms of segmental rediploidization.
Journal
Nature ecology & evolution
ISSN: 2397-334X
Titre abrégé: Nat Ecol Evol
Pays: England
ID NLM: 101698577
Informations de publication
Date de publication:
06 2020
06 2020
Historique:
received:
19
08
2019
accepted:
27
02
2020
pubmed:
2
4
2020
medline:
25
8
2020
entrez:
2
4
2020
Statut:
ppublish
Résumé
Sturgeons seem to be frozen in time. The archaic characteristics of this ancient fish lineage place it in a key phylogenetic position at the base of the ~30,000 modern teleost fish species. Moreover, sturgeons are notoriously polyploid, providing unique opportunities to investigate the evolution of polyploid genomes. We assembled a high-quality chromosome-level reference genome for the sterlet, Acipenser ruthenus. Our analysis revealed a very low protein evolution rate that is at least as slow as in other deep branches of the vertebrate tree, such as that of the coelacanth. We uncovered a whole-genome duplication that occurred in the Jurassic, early in the evolution of the entire sturgeon lineage. Following this polyploidization, the rediploidization of the genome included the loss of whole chromosomes in a segmental deduplication process. While known adaptive processes helped conserve a high degree of structural and functional tetraploidy over more than 180 million years, the reduction of redundancy of the polyploid genome seems to have been remarkably random.
Identifiants
pubmed: 32231327
doi: 10.1038/s41559-020-1166-x
pii: 10.1038/s41559-020-1166-x
pmc: PMC7269910
mid: NIHMS1585560
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
841-852Subventions
Organisme : NIH HHS
ID : R01 OD011116
Pays : United States
Références
Van de Peer, Y., Maere, S. & Meyer, A. The evolutionary significance of ancient genome duplications. Nat. Rev. Genet. 10, 725–732 (2009).
pubmed: 19652647
doi: 10.1038/nrg2600
Lynch, M. & Conery, J. S. The evolutionary fate and consequences of duplicate genes. Science 290, 1151–1155 (2000).
pubmed: 11073452
doi: 10.1126/science.290.5494.1151
Dehal, P. & Boore, J. L. Two rounds of whole genome duplication in the ancestral vertebrate. PLoS Biol. 3, e314 (2005).
pubmed: 16128622
pmcid: 1197285
doi: 10.1371/journal.pbio.0030314
Meyer, A. & Van de Peer, Y. From 2R to 3R: evidence for a fish‐specific genome duplication (FSGD). Bioessays 27, 937–945 (2005).
pubmed: 16108068
doi: 10.1002/bies.20293
Braasch, I. & Postlethwait, J. H. in Polyploidy and Genome Evolution (eds Soltis P. & Soltis, D.) 341–383 (Springer, 2012).
Symonová, R. et al. Molecular cytogenetic differentiation of paralogs of Hox paralogs in duplicated and re-diploidized genome of the North American paddlefish (Polyodon spathula). BMC Genet. 18, 19 (2017).
pubmed: 28253860
pmcid: 5335500
doi: 10.1186/s12863-017-0484-8
Havelka, M., Hulák, M., Bailie, D., Prodöhl, P. & Flajšhans, M. Extensive genome duplications in sturgeons: new evidence from microsatellite data. J. Appl. Ichthyol. 29, 704–708 (2013).
doi: 10.1111/jai.12224
Otto, S. P. The evolutionary consequences of polyploidy. Cell 131, 452–462 (2007).
pubmed: 17981114
doi: 10.1016/j.cell.2007.10.022
Force, A. et al. Preservation of duplicate genes by complementary, degenerative mutations. Genetics 151, 1531–1545 (1999).
pubmed: 10101175
pmcid: 1460548
Ludwig, A., Belfiore, N. M., Pitra, C., Svirsky, V. & Jenneckens, I. Genome duplication events and functional reduction of ploidy levels in sturgeon (Acipenser, Huso and Scaphirhynchus). Genetics 158, 1203–1215 (2001).
pubmed: 11454768
pmcid: 1461728
Wolfe, K. H. Yesterday’s polyploids and the mystery of diploidization. Nat. Rev. Genet. 2, 333–341 (2001).
pubmed: 11331899
doi: 10.1038/35072009
Rajkov, J., Shao, Z. & Berrebi, P. Evolution of polyploidy and functional diploidization in sturgeons: microsatellite analysis in 10 sturgeon species. J. Heredity 105, 521–531 (2014).
doi: 10.1093/jhered/esu027
Crow, K. D., Smith, C. D., Cheng, J.-F., Wagner, G. P. & Amemiya, C. T. An independent genome duplication inferred from Hox paralogs in the American paddlefish—a representative basal ray-finned fish and important comparative reference. Genome Biol. Evol. 4, 937–953 (2012).
pubmed: 22851613
pmcid: 3509897
doi: 10.1093/gbe/evs067
Miller, M. J. in Sturgeons and Paddlefish of North America. Fish & Fisheries Series Vol. 27 (eds LeBreton, G. T. O., Beamish, F. W. H. & McKinley, R. S.) 87–101 (Springer, 2004).
Saito, T. et al. The origin and migration of primordial germ cells in sturgeons. PLoS ONE 9, e86861 (2014).
pubmed: 24505272
pmcid: 3914811
doi: 10.1371/journal.pone.0086861
Hochleithner, M. & Gessner, J. The Sturgeon and Paddlefishes of the World—Biology and Aquaculture Aquatech Publication 106 (Aquatech, 2001).
Allen, P. J., Cech, J. J. & Kültz, D. Mechanisms of seawater acclimation in a primitive, anadromous fish, the green sturgeon. J. Comp. Physiol. B 179, 903–920 (2009).
pubmed: 19517116
pmcid: 2745624
doi: 10.1007/s00360-009-0372-2
Haidvogl, G. et al. Typology of historical sources and the reconstruction of long‐term historical changes of riverine fish: a case study of the Austrian Danube and northern Russian rivers. Ecol. Freshw. Fish 23, 498–515 (2014).
pubmed: 25284959
doi: 10.1111/eff.12103
Saffron, I. Caviar: The Strange History and Uncertain Future of the World’s Most Coveted Delicacy (Broadway Books, 2002).
Cheng, P. et al. Draft genome and complete Hox-cluster characterization of the sterlet sturgeon (Acipenser ruthenus). Front. Genet. 10, 776 (2019).
pubmed: 31543900
pmcid: 6739705
doi: 10.3389/fgene.2019.00776
Bytyutskyy, D., Srp, J. & Flajšhans, M. Use of Feulgen image analysis densitometry to study the effect of genome size on nuclear size in polyploid sturgeons. J. Appl. Ichthyol. 28, 704–708 (2012).
doi: 10.1111/j.1439-0426.2012.02021.x
Fontana, F. et al. Fluorescent in situ hybridization with rDNA probes on chromosomes of Acipenser ruthenus and Acipenser naccarii (Osteichthyes Acipenseriformes). Genome 42, 1008–1012 (1999).
doi: 10.1139/g99-030
Valente, G. T. et al. B chromosomes: from cytogenetics to systems biology. Chromosoma 126, 73–81 (2017).
pubmed: 27558128
doi: 10.1007/s00412-016-0613-6
Du, K. et al. The genome of the arapaima (Arapaima gigas) provides insights into gigantism, fast growth and chromosomal sex determination system. Sci. Rep. 9, 5293 (2019).
pubmed: 30923320
pmcid: 6439221
doi: 10.1038/s41598-019-41457-x
Betancur-R, R. et al. Phylogenetic classification of bony fishes. BMC Evol. Biol. 17, 162 (2017).
pubmed: 28683774
pmcid: 5501477
doi: 10.1186/s12862-017-0958-3
Betancur-R, R. et al. The tree of life and a new classification of bony fishes. PLoS Curr. 5 http://doi.org/dpxx (2013).
Near, T. J. et al. Resolution of ray-finned fish phylogeny and timing of diversification. Proc. Natl Acad. Sci. USA 109, 13698–13703 (2012).
pubmed: 22869754
doi: 10.1073/pnas.1206625109
Hughes, L. C. et al. Comprehensive phylogeny of ray-finned fishes (Actinopterygii) based on transcriptomic and genomic data. Proc. Natl Acad. Sci. USA 115, 6249–6254 (2018).
pubmed: 29760103
doi: 10.1073/pnas.1719358115
Rabosky, D. L. et al. Rates of speciation and morphological evolution are correlated across the largest vertebrate radiation. Nat. Commun. 4, 1958 (2013).
pubmed: 23739623
doi: 10.1038/ncomms2958
Darwin, C. On the Origin of Species by Means of Natural Selection, or the Preservation of Favoured Races in the Struggle for Life (Oxford Univ. Press, 1859).
Krieger, J. & Fuerst, P. A. Evidence for a slowed rate of molecular evolution in the order Acipenseriformes. Mol. Biol. Evol. 19, 891–897 (2002).
pubmed: 12032245
doi: 10.1093/oxfordjournals.molbev.a004146
Chalopin, D., Naville, M., Plard, F., Galiana, D. & Volff, J.-N. Comparative analysis of transposable elements highlights mobilome diversity and evolution in vertebrates. Genome Biol. Evol. 7, 567–580 (2015).
pubmed: 25577199
pmcid: 4350176
doi: 10.1093/gbe/evv005
Amemiya, C. T. et al. The African coelacanth genome provides insights into tetrapod evolution. Nature 496, 311–316 (2013).
pubmed: 23598338
pmcid: 3633110
doi: 10.1038/nature12027
Berthelot, C. et al. The rainbow trout genome provides novel insights into evolution after whole-genome duplication in vertebrates. Nat. Commun. 5, 3657 (2014).
pubmed: 24755649
pmcid: 4071752
doi: 10.1038/ncomms4657
Lien, S. et al. The Atlantic salmon genome provides insights into rediploidization. Nature 533, 200–205 (2016).
pubmed: 27088604
doi: 10.1038/nature17164
pmcid: 27088604
Chen, Z. et al. De novo assembly of the goldfish (Carassius auratus) genome and the evolution of genes after whole-genome duplication. Sci. Adv. 5, eaav0547 (2019).
pubmed: 31249862
pmcid: 6594761
doi: 10.1126/sciadv.aav0547
Andreyushkova, D. et al. Next generation sequencing of chromosome-specific libraries sheds light on genome evolution in paleotetraploid sterlet (Acipenser ruthenus). Genes 8, 318 (2017).
pmcid: 5704231
doi: 10.3390/genes8110318
Romanenko, S. A. et al. Segmental paleotetraploidy revealed in sterlet (Acipenser ruthenus) genome by chromosome painting. Mol. Cytogenet. 8, 90 (2015).
pubmed: 26587056
pmcid: 4652396
doi: 10.1186/s13039-015-0194-8
Bennett, R. J., Uhl, M. A., Miller, M. G. & Johnson, A. D. Identification and characterization of a Candida albicans mating pheromone. Mol. Cell. Biol. 23, 8189–8201 (2003).
pubmed: 14585977
pmcid: 262406
doi: 10.1128/MCB.23.22.8189-8201.2003
Gerstein, A. C., Chun, H.-J. E., Grant, A. & Otto, S. P. Genomic convergence toward diploidy in Saccharomyces cerevisiae. PLoS Genet. 2, e145 (2006).
pubmed: 17002497
pmcid: 1570378
doi: 10.1371/journal.pgen.0020145
De Storme, N. & Mason, A. Plant speciation through chromosome instability and ploidy change: cellular mechanisms, molecular factors and evolutionary relevance. Curr. Plant Biol. 1, 10–33 (2014).
doi: 10.1016/j.cpb.2014.09.002
Rajaraman, R., Rajaraman, M. M., Rajaraman, S. R. & Guernsey, D. L. Neosis—a paradigm of self‐renewal in cancer. Cell Biol. Int. 29, 1084–1097 (2005).
pubmed: 16316756
doi: 10.1016/j.cellbi.2005.10.003
Session, A. M. et al. Genome evolution in the allotetraploid frog Xenopus laevis. Nature 538, 336–343 (2016).
pubmed: 27762356
pmcid: 5313049
doi: 10.1038/nature19840
Tang, H. et al. Synteny and collinearity in plant genomes. Science 320, 486–488 (2008).
pubmed: 18436778
doi: 10.1126/science.1153917
Dewey, C. N. Positional orthology: putting genomic evolutionary relationships into context. Brief. Bioinform. 12, 401–412 (2011).
pubmed: 21705766
pmcid: 3178058
doi: 10.1093/bib/bbr040
Freeling, M. Bias in plant gene content following different sorts of duplication: tandem, whole-genome, segmental, or by transposition. Annu. Rev. Plant Biol. 60, 433–453 (2009).
pubmed: 19575588
doi: 10.1146/annurev.arplant.043008.092122
Birchler, J. A. & Veitia, R. A. Gene balance hypothesis: connecting issues of dosage sensitivity across biological disciplines. Proc. Natl Acad. Sci. USA 109, 14746–14753 (2012).
pubmed: 22908297
doi: 10.1073/pnas.1207726109
Hrbek, T., Seckinger, J. & Meyer, A. A phylogenetic and biogeographic perspective on the evolution of poeciliid fishes. Mol. Phylogenet. Evol. 43, 986–998 (2007).
pubmed: 17185005
doi: 10.1016/j.ympev.2006.06.009
De Smet, R. et al. Convergent gene loss following gene and genome duplications creates single-copy families in flowering plants. Proc. Natl Acad. Sci. USA 110, 2898–2903 (2013).
pubmed: 23382190
doi: 10.1073/pnas.1300127110
Sémon, M. & Wolfe, K. H. Consequences of genome duplication. Curr. Opin. Genet. Dev. 17, 505–512 (2007).
pubmed: 18006297
doi: 10.1016/j.gde.2007.09.007
Conant, G. C., Birchler, J. A. & Pires, J. C. Dosage, duplication, and diploidization: clarifying the interplay of multiple models for duplicate gene evolution over time. Curr. Opin. Plant Biol. 19, 91–98 (2014).
pubmed: 24907529
doi: 10.1016/j.pbi.2014.05.008
Makino, T. & McLysaght, A. Positionally biased gene loss after whole genome duplication: evidence from human, yeast, and plant. Genome Res. 22, 2427–2435 (2012).
pubmed: 22835904
pmcid: 3514672
doi: 10.1101/gr.131953.111
Szklarczyk, D. et al. STRING v11: protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 47, D607–D613 (2019).
pubmed: 30476243
doi: 10.1093/nar/gky1131
pmcid: 30476243
Kikuta, H. et al. Genomic regulatory blocks encompass multiple neighboring genes and maintain conserved synteny in vertebrates. Genome Res. 17, 545–555 (2007).
pubmed: 17387144
pmcid: 1855176
doi: 10.1101/gr.6086307
Innan, H. & Kondrashov, F. The evolution of gene duplications: classifying and distinguishing between models. Nat. Rev. Genet. 11, 97–108 (2010).
pubmed: 20051986
doi: 10.1038/nrg2689
Ward, R. & Durrett, R. Subfunctionalization: how often does it occur? How long does it take? Theor. Popul. Biol. 66, 93–100 (2004).
pubmed: 15302219
doi: 10.1016/j.tpb.2004.03.004
Ohno, S. Evolution by Gene Duplication (Springer Science & Business Media, 2013).
Kjærner-Semb, E. et al. Atlantic salmon populations reveal adaptive divergence of immune related genes—a duplicated genome under selection. BMC Genomics 17, 610 (2016).
pubmed: 27515098
pmcid: 4982270
doi: 10.1186/s12864-016-2867-z
Wassarman, P. M. Zona pellucida glycoproteins. J. Biol. Chem. 283, 24285–24289 (2008).
pubmed: 18539589
pmcid: 2528931
doi: 10.1074/jbc.R800027200
Sano, K. et al. Comparison of egg envelope thickness in teleosts and its relationship to the sites of ZP protein synthesis. J. Exp. Zool. B 328, 240–258 (2017).
doi: 10.1002/jez.b.22729
Kim, B.-M. et al. Antarctic blackfin icefish genome reveals adaptations to extreme environments. Nat. Ecol. Evol. 3, 469–478 (2019).
pubmed: 30804520
doi: 10.1038/s41559-019-0812-7
pmcid: 7307600
Cao, L. et al. Neofunctionalization of zona pellucida proteins enhances freeze-prevention in the eggs of Antarctic notothenioids. Nat. Commun. 7, 12987 (2016).
pubmed: 27698404
pmcid: 5059455
doi: 10.1038/ncomms12987
Montavon, T. et al. A regulatory archipelago controls Hox genes transcription in digits. Cell 147, 1132–1145 (2011).
pubmed: 22118467
doi: 10.1016/j.cell.2011.10.023
Beccari, L. et al. A role for HOX13 proteins in the regulatory switch between TADs at the HoxD locus. Genes Dev. 30, 1172–1186 (2016).
pubmed: 27198226
pmcid: 4888838
Woltering, J. M., Noordermeer, D., Leleu, M. & Duboule, D. Conservation and divergence of regulatory strategies at Hox loci and the origin of tetrapod digits. PLoS Biol. 12, e1001773 (2014).
pubmed: 24465181
pmcid: 3897358
doi: 10.1371/journal.pbio.1001773
Braasch, I. et al. The spotted gar genome illuminates vertebrate evolution and facilitates human–teleost comparisons. Nat. Genet. 48, 427–437 (2016).
pubmed: 26950095
pmcid: 4817229
doi: 10.1038/ng.3526
Amores, A. et al. Zebrafish hox clusters and vertebrate genome evolution. Science 282, 1711–1714 (1998).
pubmed: 9831563
doi: 10.1126/science.282.5394.1711
Mungpakdee, S. et al. Differential evolution of the 13 Atlantic salmon Hox clusters. Mol. Biol. Evol. 25, 1333–1343 (2008).
pubmed: 18424774
doi: 10.1093/molbev/msn097
Martin, K. J. & Holland, P. W. Enigmatic orthology relationships between Hox clusters of the African butterfly fish and other teleosts following ancient whole-genome duplication. Mol. Biol. Evol. 31, 2592–2611 (2014).
pubmed: 24974377
pmcid: 4166920
doi: 10.1093/molbev/msu202
Kuraku, S. & Meyer, A. The evolution and maintenance of Hox gene clusters in vertebrates and the teleost-specific genome duplication. Int. J. Dev. Biol. 53, 765–773 (2009).
pubmed: 19557682
doi: 10.1387/ijdb.072533km
Woltering, J. M. & Durston, A. J. The zebrafish hoxdb cluster has been reduced to a single microRNA. Nat. Genet. 38, 601–602 (2006).
pubmed: 16736008
doi: 10.1038/ng0606-601
McClintock, J. M., Kheirbek, M. A. & Prince, V. E. Knockdown of duplicated zebrafish hoxb1 genes reveals distinct roles in hindbrain patterning and a novel mechanism of duplicate gene retention. Development 129, 2339–2354 (2002).
pubmed: 11973267
Takamatsu, N. et al. Duplicated Abd-B class genes in medaka hoxAa and hoxAb clusters exhibit differential expression patterns in pectoral fin buds. Dev. Genes Evol. 217, 263–273 (2007).
pubmed: 17333260
doi: 10.1007/s00427-007-0137-4
Schartl, M. et al. The genome of the platyfish, Xiphophorus maculatus, provides insights into evolutionary adaptation and several complex traits. Nat. Genet. 45, 567–572 (2013).
pubmed: 23542700
pmcid: 3677569
doi: 10.1038/ng.2604
Muller, H. Why polyploidy is rarer in animals than in plants. Am. Nat. 59, 346–353 (1925).
doi: 10.1086/280047
Mable, B. ‘Why polyploidy is rarer in animals than in plants’: myths and mechanisms. Biol. J. Linn. Soc. 82, 453–466 (2004).
doi: 10.1111/j.1095-8312.2004.00332.x
Orr, H. A. ‘Why polyploidy is rarer in animals than in plants’ revisited. Am. Nat. 136, 759–770 (1990).
doi: 10.1086/285130
Wertheim, B., Beukeboom, L. & Van de Zande, L. Polyploidy in animals: effects of gene expression on sex determination, evolution and ecology. Cytogenet. Genome Res. 140, 256–269 (2013).
pubmed: 23817224
doi: 10.1159/000351998
Fopp-Bayat, D., Kolman, R. & Woznicki, P. Induction of meiotic gynogenesis in sterlet (Acipenser ruthenus) using UV-irradiated bester sperm. Aquaculture 264, 54–58 (2007).
doi: 10.1016/j.aquaculture.2006.12.006
Havelka, M. & Arai, K. in Sex Control in Aquaculture (eds Wang H.-P. et al.) 669–687 (John Wiley & Sons Ltd., 2018).
Keyvanshokooh, S. & Gharaei, A. A review of sex determination and searches for sex‐specific markers in sturgeon. Aquac. Res. 41, e1–e7 (2010).
doi: 10.1111/j.1365-2109.2009.02463.x
Havelka, M., Kašpar, V., Hulák, M. & Flajšhans, M. Sturgeon genetics and cytogenetics: a review related to ploidy levels and interspecific hybridization. Folia Zool. 60, 93–104 (2011).
doi: 10.25225/fozo.v60.i2.a3.2011
Slotkin, R. K. & Martienssen, R. Transposable elements and the epigenetic regulation of the genome. Nat. Rev. Genet. 8, 272–285 (2007).
pubmed: 17363976
doi: 10.1038/nrg2072
Klein, S. J. & O’Neill, R. J. Transposable elements: genome innovation, chromosome diversity, and centromere conflict. Chromosome Res. 26, 5–23 (2018).
pubmed: 29332159
pmcid: 5857280
doi: 10.1007/s10577-017-9569-5
Foissac, S. et al. Transcriptome and chromatin structure annotation of liver, CD4+ and CD8+T cells from four livestock species. Preprint at bioRxiv https://doi.org/10.1101/316091 (2019).
SMRT Link v.5.0.1 (Pacific Biosciences of California, Inc. 2018).
Ruan, J. SMARTdenovo: Ultra-fast de novo assembler using long noisy reads (Github, accessed 10 January 2019); https://github.com/ruanjue/smartdenovo
Vaser, R., Sović, I., Nagarajan, N. & Šikić, M. Fast and accurate de novo genome assembly from long uncorrected reads. Genome Res. 27, 737–746 (2017).
pubmed: 28100585
pmcid: 5411768
doi: 10.1101/gr.214270.116
Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).
pubmed: 29750242
pmcid: 6137996
doi: 10.1093/bioinformatics/bty191
Walker, B. J. et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS ONE 9, e112963 (2014).
pubmed: 25409509
pmcid: 4237348
doi: 10.1371/journal.pone.0112963
Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. Preprint at https://arxiv.org/abs/1303.3997 (2013).
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
pubmed: 19505943
pmcid: 19505943
doi: 10.1093/bioinformatics/btp352
Garrison, E. & Marth, G. Haplotype-based variant detection from short-read sequencing. Preprint at https://arxiv.org/abs/1207.3907 (2012).
Edge, P., Bafna, V. & Bansal, V. HapCUT2: robust and accurate haplotype assembly for diverse sequencing technologies. Genome Res. 27, 801–812 (2017).
pubmed: 27940952
pmcid: 5411775
doi: 10.1101/gr.213462.116
fgbio: Tools for working with genomic and high throughput sequencing data (Github, 2019); http://fulcrumgenomics.github.io/fgbio/
Li, H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics 27, 2987–2993 (2011).
pubmed: 21903627
pmcid: 3198575
doi: 10.1093/bioinformatics/btr509
Durand, N. C. et al. Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments. Cell Syst. 3, 95–98 (2016).
pubmed: 27467249
pmcid: 5846465
doi: 10.1016/j.cels.2016.07.002
Dudchenko, O. et al. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science 356, 92–95 (2017).
pubmed: 28336562
pmcid: 5635820
doi: 10.1126/science.aal3327
Durand, N. C. et al. Juicebox provides a visualization system for Hi-C contact maps with unlimited zoom. Cell Syst. 3, 99–101 (2016).
pubmed: 27467250
pmcid: 5596920
doi: 10.1016/j.cels.2015.07.012
Smit, A., Hubley, R. & Green, P. RepeatMasker Open-4.0 (ISB, 2013–2015); http://www.repeatmasker.org
Ou, S. & Jiang, N. LTR_retriever: a highly accurate and sensitive program for identification of long terminal repeat retrotransposons. Plant Physiol. 176, 1410–1422 (2018).
pubmed: 29233850
doi: 10.1104/pp.17.01310
Ellinghaus, D., Kurtz, S. & Willhoeft, U. LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons. BMC Bioinform. 9, 18 (2008).
doi: 10.1186/1471-2105-9-18
Xu, Z. & Wang, H. LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons. Nucleic Acids Res. 35, W265–W268 (2007).
pubmed: 17485477
pmcid: 1933203
doi: 10.1093/nar/gkm286
Smit, A. F., Hubley, R. & Green, P. RepeatMasker (ISB, 2019); http://www.repeatmasker.org/webrepeatmaskerhelp.html
Bairoch, A. & Apweiler, R. The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res. 28, 45–48 (2000).
pubmed: 10592178
pmcid: 102476
doi: 10.1093/nar/28.1.45
Li, W. & Godzik, A. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 22, 1658–1659 (2006).
pubmed: 16731699
doi: 10.1093/bioinformatics/btl158
pmcid: 16731699
Bao, W., Kojima, K. K. & Kohany, O. Repbase Update, a database of repetitive elements in eukaryotic genomes. Mob. DNA 6, 11 (2015).
pubmed: 26045719
pmcid: 4455052
doi: 10.1186/s13100-015-0041-9
Shao, F., Wang, J., Xu, H. & Peng, Z. FishTEDB: a collective database of transposable elements identified in the complete genomes of fish. Database 2018, 1–9 (2018).
doi: 10.1093/database/bax106
Christiam, C. et al. BLAST+: architecture and applications. BMC Bioinform. 10, 421 (2009).
doi: 10.1186/1471-2105-10-421
Novák, P. et al. TAREAN: a computational tool for identification and characterization of satellite DNA from unassembled short reads. Nucleic Acids Res. 45, e111 (2017).
pubmed: 28402514
pmcid: 5499541
doi: 10.1093/nar/gkx257
Novák, P., Neumann, P., Pech, J., Steinhaisl, J. & Macas, J. RepeatExplorer: a Galaxy-based web server for genome-wide characterization of eukaryotic repetitive elements from next-generation sequence reads. Bioinformatics 29, 792–793 (2013).
pubmed: 23376349
doi: 10.1093/bioinformatics/btt054
Chen, S., Zhou, Y., Chen, Y. & Gu, J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884–i890 (2018).
pubmed: 30423086
pmcid: 6129281
doi: 10.1093/bioinformatics/bty560
Neumann, P., Novák, P., Hoštáková, N. & Macas, J. Systematic survey of plant LTR-retrotransposons elucidates phylogenetic relationships of their polyprotein domains and provides a reference for element classification. Mob. DNA 10, 1 (2019).
pubmed: 30622655
pmcid: 6317226
doi: 10.1186/s13100-018-0144-1
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
pubmed: 22388286
pmcid: 22388286
doi: 10.1038/nmeth.1923
Wickham, H. ggplot2. Wiley Interdiscip. Rev. Comput. Stat. 3, 180–185 (2011).
doi: 10.1002/wics.147
Higham, D. J. & Higham, N. J. MATLAB Guide Vol. 150 (Society for Industrial and Applied Mathematics, 2016).
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
pubmed: 26059717
pmcid: 26059717
doi: 10.1093/bioinformatics/btv351
Stanke, M. et al. AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic Acids Res. 34, W435–W439 (2006).
pubmed: 16845043
pmcid: 1538822
doi: 10.1093/nar/gkl200
Slater, G. S. C. & Birney, E. Automated generation of heuristics for biological sequence comparison. BMC Bioinform. 6, 31 (2005).
doi: 10.1186/1471-2105-6-31
Birney, E., Clamp, M. & Durbin, R. GeneWise and genomewise. Genome Res. 14, 988–995 (2004).
pubmed: 15123596
pmcid: 479130
doi: 10.1101/gr.1865504
She, R., Chu, J. S.-C., Wang, K., Pei, J. & Chen, N. GenBlastA: enabling BLAST to identify homologous gene sequences. Genome Res. 19, 143–149 (2009).
pubmed: 18838612
pmcid: 2612959
doi: 10.1101/gr.082081.108
Ter-Hovhannisyan, V., Lomsadze, A., Chernoff, Y. O. & Borodovsky, M. Gene prediction in novel fungal genomes using an ab initio algorithm with unsupervised training. Genome Res. 18, 1979–1990 (2008).
pubmed: 18757608
pmcid: 2593577
doi: 10.1101/gr.081612.108
Trapnell, C. et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat. Protoc. 7, 562–578 (2012).
pubmed: 22383036
pmcid: 22383036
doi: 10.1038/nprot.2012.016
Haas, B. J. et al. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat. Protoc. 8, 1494–1512 (2013).
pubmed: 23845962
doi: 10.1038/nprot.2013.084
pmcid: 23845962
Haas, B. J. et al. Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. Nucleic Acids Res. 31, 5654–5666 (2003).
pubmed: 14500829
pmcid: 206470
doi: 10.1093/nar/gkg770
Haas, B. J. et al. Automated eukaryotic gene structure annotation using evidencemodeler and the program to assemble spliced alignments. Genome Biol. 9, R7 (2008).
pubmed: 18190707
pmcid: 2395244
doi: 10.1186/gb-2008-9-1-r7
Camacho, C. et al. BLAST+: architecture and applications. BMC Bioinform. 10, 421 (2009).
doi: 10.1186/1471-2105-10-421
Mudunuri, U., Che, A., Yi, M. & Stephens, R. M. bioDBnet: the biological database network. Bioinformatics 25, 555–556 (2009).
pubmed: 19129209
pmcid: 2642638
doi: 10.1093/bioinformatics/btn654
Kulmanov, M., Khan, M. A. & Hoehndorf, R. DeepGO: predicting protein functions from sequence and interactions using a deep ontology-aware classifier. Bioinformatics 34, 660–668 (2017).
pmcid: 5860606
doi: 10.1093/bioinformatics/btx624
Chan, P. P. & Lowe, T. M. in Gene Prediction. Methods in Molecular Biology Vol. 1962 (ed. Kollmar, M.) 1–14 (Humana, 2019).
Lagesen, K. et al. RNammer: consistent annotation of rRNA genes in genomic sequences. Nucleic Acids Res. 35, 3100–3108 (2007).
pubmed: 1888812
pmcid: 1888812
doi: 10.1093/nar/gkm160
Nawrocki, E. P. & Eddy, S. R. Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics 29, 2933–2935 (2013).
pubmed: 24008419
pmcid: 3810854
doi: 10.1093/bioinformatics/btt509
Kalvari, I. et al. Rfam 13.0: shifting to a genome-centric resource for non-coding RNA families. Nucleic Acids Res. 46, D335–D342 (2017).
pmcid: 5753348
doi: 10.1093/nar/gkx1038
Cho, Y. S. et al. The tiger genome and comparative analysis with lion and snow leopard genomes. Nat. Commun. 4, 3433 (2013).
Ruan, J. et al. TreeFam: 2008 update. Nucleic Acids Res. 36, D735–D740 (2007).
pubmed: 18056084
pmcid: 2238856
doi: 10.1093/nar/gkm1005
Ponting, C. TreeBeST v.0.5 (SourceForge, 2007); http://treesoft.sourceforge.net/treebest.shtml
Edgar, R. C. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 32, 1792–1797 (2004).
pubmed: 15034147
pmcid: 15034147
doi: 10.1093/nar/gkh340
Capella-Gutiérrez, S., Silla-Martínez, J. M. & Gabaldón, T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25, 1972–1973 (2009).
pubmed: 19505945
pmcid: 2712344
doi: 10.1093/bioinformatics/btp348
Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
pubmed: 24451623
pmcid: 24451623
doi: 10.1093/bioinformatics/btu033
Ronquist, F. et al. MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. Syst. Biol. 61, 539–542 (2012).
pubmed: 22357727
pmcid: 3329765
doi: 10.1093/sysbio/sys029
Yang, Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 24, 1586–1591 (2007).
pubmed: 17483113
doi: 10.1093/molbev/msm088
Inoue, J., Dos Reis, M. & Yang, Z. A step-by-step tutorial: Divergence time estimation with approximate likelihood calculation using MCMCTREE in PAML (Citeseer, 2011).
Stamatakis, A., Hoover, P. & Rougemont, J. A rapid bootstrap algorithm for the RAxML web servers. Syst. Biol. 57, 758–771 (2008).
pubmed: 18853362
doi: 10.1080/10635150802429642
Lin, Q. et al. The seahorse genome and the evolution of its specialized morphology. Nature 540, 395–399 (2016).
pubmed: 27974754
doi: 10.1038/nature20595
Lundberg, J. G. & Chernoff, B. A Miocene fossil of the amazonian fish Arapaima (Teleostei, Arapaimidae) from the Magdalena River region of Colombia-biogeographic and evolutionary implications. Biotropica 24, 2–14 (1992).
doi: 10.2307/2388468
Kumazawa, Y. & Nishida, M. Molecular phylogeny of osteoglossoids: a new model for Gondwanian origin and plate tectonic transportation of the Asian arowana. Mol. Biol. Evol. 17, 1869–1878 (2000).
pubmed: 11110903
doi: 10.1093/oxfordjournals.molbev.a026288
O’Brien, K. P., Remm, M. & Sonnhammer, E. L. Inparanoid: a comprehensive database of eukaryotic orthologs. Nucleic Acids Res. 33, D476–D480 (2005).
pubmed: 15608241
doi: 10.1093/nar/gki107
Suyama, M., Torrents, D. & Bork, P. PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments. Nucleic Acids Res. 34, W609–W612 (2006).
pubmed: 16845082
pmcid: 1538804
doi: 10.1093/nar/gkl315
Talavera, G. & Castresana, J. Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Syst. Biol. 56, 564–577 (2007).
pubmed: 17654362
doi: 10.1080/10635150701472164
Retief, J. D. in Bioinformatics Methods and Protocols (eds Misener, S. & Krawetz, S. A.) 243–258 (Humana Press, 2000).
Huerta-Cepas, J., Serra, F. & Bork, P. ETE 3: reconstruction, analysis, and visualization of phylogenomic data. Mol. Biol. Evol. 33, 1635–1638 (2016).
pubmed: 26921390
pmcid: 4868116
doi: 10.1093/molbev/msw046
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
doi: 10.1093/bioinformatics/bts635
Catchen, J. M., Amores, A., Hohenlohe, P., Cresko, W. & Postlethwait, J. H. Stacks: building and genotyping loci de novo from short-read sequences. G3 1, 171–182 (2011).
Catchen, J., Hohenlohe, P. A., Bassham, S., Amores, A. & Cresko, W. A. Stacks: an analysis tool set for population genomics. Mol. Ecol. 22, 3124–3140 (2013).
pubmed: 23701397
pmcid: 3936987
doi: 10.1111/mec.12354
Feron, R. et al. RADSex (Github, 2018); https://github.com/RomainFeron/RADSex