Dense sampling of bird diversity increases power of comparative genomics.
Journal
Nature
ISSN: 1476-4687
Titre abrégé: Nature
Pays: England
ID NLM: 0410462
Informations de publication
Date de publication:
11 2020
11 2020
Historique:
received:
09
08
2019
accepted:
27
07
2020
entrez:
12
11
2020
pubmed:
13
11
2020
medline:
15
12
2020
Statut:
ppublish
Résumé
Whole-genome sequencing projects are increasingly populating the tree of life and characterizing biodiversity
Identifiants
pubmed: 33177665
doi: 10.1038/s41586-020-2873-9
pii: 10.1038/s41586-020-2873-9
pmc: PMC7759463
doi:
Types de publication
Journal Article
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
252-257Subventions
Organisme : NHLBI NIH HHS
ID : U01 HL137183
Pays : United States
Organisme : Howard Hughes Medical Institute
Pays : United States
Organisme : NHGRI NIH HHS
ID : T32 HG008345
Pays : United States
Organisme : NHGRI NIH HHS
ID : R01 HG010053
Pays : United States
Organisme : NHGRI NIH HHS
ID : R01 HG010485
Pays : United States
Organisme : NHGRI NIH HHS
ID : U54 HG007990
Pays : United States
Organisme : NIGMS NIH HHS
ID : R35 GM133412
Pays : United States
Commentaires et corrections
Type : ErratumIn
Références
Lewin, H. A. et al. Earth BioGenome project: sequencing life for the future of life. Proc. Natl Acad. Sci. USA 115, 4325–4333 (2018).
pubmed: 29686065
pmcid: 5924910
doi: 10.1073/pnas.1720115115
Genome 10K Community of Scientists. Genome 10K: a proposal to obtain whole-genome sequence for 10,000 vertebrate species. J. Hered. 100, 659–674 (2009).
pmcid: 2877544
doi: 10.1093/jhered/esp086
i5K Consortium. The i5K initiative: advancing arthropod genomics for knowledge, human health, agriculture, and the environment. J. Hered. 104, 595–600 (2013).
pmcid: 4046820
doi: 10.1093/jhered/est050
Cheng, S. et al. 10KP: a phylodiverse genome sequencing plan. Gigascience 7, 1–9 (2018).
pubmed: 29618049
doi: 10.1093/gigascience/giy013
Prum, R. O. et al. A comprehensive phylogeny of birds (Aves) using targeted next-generation DNA sequencing. Nature 526, 569–573 (2015).
pubmed: 26444237
doi: 10.1038/nature15697
Zhang, G. et al. Bird sequencing project takes off. Nature 522, 34 (2015).
pubmed: 26040883
doi: 10.1038/522034d
Boomsma, J. J. et al. The Global Ant Genomics Alliance (GAGA). Myrmecol. News 25, 61–66 (2017).
Chen, L. et al. Large-scale ruminant genome sequencing provides insights into their evolution and distinct traits. Science 364, eaav6202 (2019).
pubmed: 31221828
doi: 10.1126/science.aav6202
Jarvis, E. D. et al. Whole-genome analyses resolve early branches in the tree of life of modern birds. Science 346, 1320–1331 (2014).
pubmed: 25504713
pmcid: 4405904
doi: 10.1126/science.1253451
Zhang, G. et al. Comparative genomics reveals insights into avian genome evolution and adaptation. Science 346, 1311–1320 (2014).
pubmed: 25504712
pmcid: 4390078
doi: 10.1126/science.1251385
Dickinson, E. C. & Remsen, J. V. (eds) The Howard and Moore Complete Checklist of the Birds of the World Volume 1: Non-passerines 4th edn (Aves, 2013).
Dickinson, E. C. & Christidis, L. (eds) The Howard and Moore Complete Checklist of the Birds of the World Volume 2: Passerines 4th edn (Aves, 2014).
BirdLife International. Leucopsar rothschildi. https://doi.org/10.2305/IUCN.UK.2018-2.RLTS.T22710912A129874226.en (The IUCN Red List of Threatened Species, 2018).
Meredith, R. W., Zhang, G., Gilbert, M. T. P., Jarvis, E. D. & Springer, M. S. Evidence for a single loss of mineralized teeth in the common avian ancestor. Science 346, 1254390 (2014).
pubmed: 25504730
doi: 10.1126/science.1254390
Deutekom, E. S., Vosseberg, J., van Dam, T. J. P. & Snel, B. Measuring the impact of gene prediction on gene loss estimates in Eukaryotes by quantifying falsely inferred absences. PLOS Comput. Biol. 15, e1007301 (2019).
pubmed: 31461468
pmcid: 6736253
doi: 10.1371/journal.pcbi.1007301
Plotkin, J. B. & Kudla, G. Synonymous but not the same: the causes and consequences of codon bias. Nat. Rev. Genet. 12, 32–42 (2011).
pubmed: 21102527
doi: 10.1038/nrg2899
Armstrong, J. et al. Progressive Cactus is a multiple-genome aligner for the thousand-genome era. Nature https://doi.org/10.1038/s41586-020-2871-y (2020).
Armstrong, J. Enabling Comparative Genomics at the Scale of Hundreds of Species. PhD thesis, Univ. California Santa Cruz (2019).
Blanchette, M. et al. Aligning multiple genomic sequences with the threaded blockset aligner. Genome Res. 14, 708–715 (2004).
pubmed: 15060014
pmcid: 383317
doi: 10.1101/gr.1933104
Pegueroles, C., Laurie, S. & Albà, M. M. Accelerated evolution after gene duplication: a time-dependent process affecting just one copy. Mol. Biol. Evol. 30, 1830–1842 (2013).
pubmed: 23625888
doi: 10.1093/molbev/mst083
Yuri, T., Kimball, R. T., Braun, E. L. & Braun, M. J. Duplication of accelerated evolution and growth hormone gene in passerine birds. Mol. Biol. Evol. 25, 352–361 (2008).
pubmed: 18048401
doi: 10.1093/molbev/msm260
Armstrong, J., Fiddes, I. T., Diekhans, M. & Paten, B. Whole-genome alignment and comparative annotation. Annu. Rev. Anim. Biosci. 7, 41–64 (2019).
pubmed: 30379572
doi: 10.1146/annurev-animal-020518-115005
Schusdziarra, C., Blamowska, M., Azem, A. & Hell, K. Methylation-controlled J-protein MCJ acts in the import of proteins into human mitochondria. Hum. Mol. Genet. 22, 1348–1357 (2013).
pubmed: 23263864
doi: 10.1093/hmg/dds541
Zhang, B., Peñagaricano, F., Driver, A., Chen, H. & Khatib, H. Differential expression of heat shock protein genes and their splice variants in bovine preimplantation embryos. J. Dairy Sci. 94, 4174–4182 (2011).
pubmed: 21787952
doi: 10.3168/jds.2010-4137
Mlitz, V. et al. Trichohyalin-like proteins have evolutionarily conserved roles in the morphogenesis of skin appendages. J. Invest. Dermatol. 134, 2685–2692 (2014).
pubmed: 24780931
pmcid: 4260798
doi: 10.1038/jid.2014.204
Riede, T., Suthers, R. A., Fletcher, N. H. & Blevins, W. E. Songbirds tune their vocal tract to the fundamental frequency of their song. Proc. Natl Acad. Sci. USA 103, 5543–5548 (2006).
pubmed: 16567614
pmcid: 1459391
doi: 10.1073/pnas.0601262103
Drake, J. A. et al. Conserved noncoding sequences are selectively constrained and not mutation cold spots. Nat. Genet. 38, 223–227 (2006).
pubmed: 16380714
doi: 10.1038/ng1710
McLean, C. Y. et al. Human-specific loss of regulatory DNA and the evolution of human-specific traits. Nature 471, 216–219 (2011).
pubmed: 21390129
pmcid: 3071156
doi: 10.1038/nature09774
Mank, J. E., Axelsson, E. & Ellegren, H. Fast-X on the Z: rapid evolution of sex-linked genes in birds. Genome Res. 17, 618–624 (2007).
pubmed: 17416747
pmcid: 1855182
doi: 10.1101/gr.6031907
Axelsson, E., Webster, M. T., Smith, N. G. C., Burt, D. W. & Ellegren, H. Comparison of the chicken and turkey genomes reveals a higher rate of nucleotide divergence on microchromosomes than macrochromosomes. Genome Res. 15, 120–125 (2005).
pubmed: 15590944
pmcid: 540272
doi: 10.1101/gr.3021305
Haeussler, M. et al. The UCSC Genome Browser database: 2019 update. Nucleic Acids Res. 47, D853–D858 (2019).
pubmed: 30407534
doi: 10.1093/nar/gky1095
Cooper, G. M., Brudno, M., Green, E. D., Batzoglou, S. & Sidow, A. Quantitative estimates of sequence divergence for comparative analyses of mammalian genomes. Genome Res. 13, 813–820 (2003).
pubmed: 12727901
pmcid: 430923
doi: 10.1101/gr.1064503
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. B 57, 289–300 (1995).
Gelabert, P. et al. Evolutionary history, genomic adaptation to toxic diet, and extinction of the Carolina parakeet. Curr. Biol. 30, 108–114.e5 (2020).
pubmed: 31839456
doi: 10.1016/j.cub.2019.10.066
Feng, S. et al. The genomic footprints of the fall and recovery of the crested ibis. Curr. Biol. 29, 340–349.e7 (2019).
pubmed: 30639104
pmcid: 6345625
doi: 10.1016/j.cub.2018.12.008
Brown, J. W., Wang, N. & Smith, S. A. The development of scientific consensus: analyzing conflict and concordance among avian phylogenies. Mol. Phylogenet. Evol. 116, 69–77 (2017).
pubmed: 28797692
doi: 10.1016/j.ympev.2017.08.002
Luo, R. et al. SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler. Gigascience 1, 18 (2012).
pubmed: 23587118
pmcid: 3626529
doi: 10.1186/2047-217X-1-18
Gnerre, S. et al. High-quality draft assemblies of mammalian genomes from massively parallel sequence data. Proc. Natl Acad. Sci. USA 108, 1513–1518 (2011).
pubmed: 21187386
doi: 10.1073/pnas.1017351108
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
pubmed: 26059717
doi: 10.1093/bioinformatics/btv351
Dierckxsens, N., Mardulyn, P. & Smits, G. NOVOPlasty: de novo assembly of organelle genomes from whole genome data. Nucleic Acids Res. 45, e18 (2017).
pubmed: 28204566
doi: 10.1093/nar/gkw1060
Meng, G., Li, Y., Yang, C. & Liu, S. MitoZ: a toolkit for animal mitochondrial genome assembly, annotation and visualization. Nucleic Acids Res. 47, e63 (2019).
pubmed: 30864657
pmcid: 6582343
doi: 10.1093/nar/gkz173
Benson, G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 27, 573–580 (1999).
pubmed: 9862982
pmcid: 148217
doi: 10.1093/nar/27.2.573
Smit, A. F. A. and Hubley, R. and Green, P. RepeatMasker Open-4.0. http://www.repeatmasker.org/ (2013–2015)
Smit, A. F. A. & Hubley, R. RepeatModeler Open-1.0. http://www.repeatmasker.org/RepeatModeler/ (2008–2015).
Revell, L. J. phytools: an R package for phylogenetic comparative biology (and other things). Methods Ecol. Evol. 3, 217–223 (2012).
doi: 10.1111/j.2041-210X.2011.00169.x
Faircloth, B. C. et al. Ultraconserved elements anchor thousands of genetic markers spanning multiple evolutionary timescales. Syst. Biol. 61, 717–726 (2012).
pubmed: 22232343
doi: 10.1093/sysbio/sys004
Faircloth, B. C. PHYLUCE is a software package for the analysis of conserved genomic loci. Bioinformatics 32, 786–788 (2016).
pubmed: 26530724
doi: 10.1093/bioinformatics/btv646
Kozlov, A. M., Aberer, A. J. & Stamatakis, A. ExaML version 3: a tool for phylogenomic analyses on supercomputers. Bioinformatics 31, 2577–2579 (2015).
pubmed: 25819675
pmcid: 4514929
doi: 10.1093/bioinformatics/btv184
Fitch, W. M. Distinguishing homologous from analogous proteins. Syst. Zool. 19, 99–113 (1970).
pubmed: 5449325
doi: 10.2307/2412448
Fitch, W. M. Homology: a personal view on some of the problems. Trends Genet. 16, 227–231 (2000).
pubmed: 10782117
doi: 10.1016/S0168-9525(00)02005-9
Dewey, C. N. Positional orthology: putting genomic evolutionary relationships into context. Brief. Bioinform. 12, 401–412 (2011).
pubmed: 21705766
pmcid: 3178058
doi: 10.1093/bib/bbr040
Fernández, R., Gabaldon, T. & Dessimoz, C. in Phylogenetics in the Genomic Era (eds. Scornavacca, C. et al.) 2.4:1–2.4:14 (2020).
Jolliffe, I. T. & Greenacre, M. J. Theory and applications of correspondence analysis. Biometrics 42, 223 (1986).
doi: 10.2307/2531266
Wright, F. The ‘effective number of codons’ used in a gene. Gene 87, 23–29 (1990).
pubmed: 2110097
doi: 10.1016/0378-1119(90)90491-9
Bao, W., Kojima, K. K. & Kohany, O. Repbase update, a database of repetitive elements in eukaryotic genomes. Mob. DNA 6, 11 (2015).
pubmed: 26045719
pmcid: 4455052
doi: 10.1186/s13100-015-0041-9
Hubisz, M. J., Pollard, K. S. & Siepel, A. PHAST and RPHAST: phylogenetic analysis with space/time models. Brief. Bioinform. 12, 41–51 (2011).
doi: 10.1093/bib/bbq072
pubmed: 21278375
Charlesworth, B., Coyne, J. A. & Barton, N. H. The relative rates of evolution of sex chromosomes and autosomes. Am. Nat. 130, 113–146 (1987).
doi: 10.1086/284701
Pollard, K. S., Hubisz, M. J., Rosenbloom, K. R. & Siepel, A. Detection of nonneutral substitution rates on mammalian phylogenies. Genome Res. 20, 110–121 (2010).
pubmed: 19858363
pmcid: 2798823
doi: 10.1101/gr.097857.109
Zerbino, D. R., Johnson, N., Juettemann, T., Wilder, S. P. & Flicek, P. WiggleTools: parallel processing of large collections of genome-wide datasets for visualization and statistical analysis. Bioinformatics 30, 1008–1009 (2014).
pubmed: 24363377
doi: 10.1093/bioinformatics/btt737
Fang, S. et al. NONCODEV5: a comprehensive annotation database for long non-coding RNAs. Nucleic Acids Res. 46, D308–D314 (2018).
pubmed: 29140524
doi: 10.1093/nar/gkx1107
Fornes, O. et al. JASPAR 2020: update of the open-access database of transcription factor binding profiles. Nucleic Acids Res. 48, D87–D92 (2020).
pubmed: 31701148
doi: 10.1093/nar/gkaa516
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).
pubmed: 23329690
pmcid: 3603318
doi: 10.1093/molbev/mst010
R Core Team. R: a language and environment for statistical computing. http://www.R-project.org/ (R Foundation for Statistical Computing, 2013).
Nguyen, L.-T., Schmidt, H. A., von Haeseler, A. & Minh, B. Q. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268–274 (2015).
Slater, G. S. C. & Birney, E. Automated generation of heuristics for biological sequence comparison. BMC Bioinformatics 6, 31 (2005).
pubmed: 15713233
pmcid: 553969
doi: 10.1186/1471-2105-6-31