Towards complete and error-free genome assemblies of all vertebrate species.


Journal

Nature
ISSN: 1476-4687
Titre abrégé: Nature
Pays: England
ID NLM: 0410462

Informations de publication

Date de publication:
04 2021
Historique:
received: 22 05 2020
accepted: 12 03 2021
entrez: 29 4 2021
pubmed: 30 4 2021
medline: 11 1 2022
Statut: ppublish

Résumé

High-quality and complete reference genome assemblies are fundamental for the application of genomics to biology, disease, and biodiversity conservation. However, such assemblies are available for only a few non-microbial species

Identifiants

pubmed: 33911273
doi: 10.1038/s41586-021-03451-0
pii: 10.1038/s41586-021-03451-0
pmc: PMC8081667
doi:

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, N.I.H., Intramural Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

737-746

Subventions

Organisme : NIDCD NIH HHS
ID : R21 DC014432
Pays : United States
Organisme : Wellcome Trust
Pays : United Kingdom
Organisme : NHGRI NIH HHS
ID : R01 HG010485
Pays : United States
Organisme : Intramural NIH HHS
ID : ZIA HG200398
Pays : United States
Organisme : Biotechnology and Biological Sciences Research Council
ID : BBS/E/T/000PR9817
Pays : United Kingdom
Organisme : NIGMS NIH HHS
ID : R01 GM130691
Pays : United States
Organisme : NHGRI NIH HHS
ID : R44 HG008118
Pays : United States
Organisme : Medical Research Council
ID : MR/T021985/1
Pays : United Kingdom

Commentaires et corrections

Type : CommentIn

Références

International Human Genome Sequencing Consortium. Initial sequencing and analysis of the human genome. Nature 409, 860–921 (2001).
doi: 10.1038/35057062
Sulston, J. et al. The C. elegans genome sequencing project: a beginning. Nature 356, 37–41 (1992).
pubmed: 1538779 doi: 10.1038/356037a0
Mouse Genome Sequencing Consortium. Initial sequencing and comparative analysis of the mouse genome. Nature 420, 520–562 (2002).
doi: 10.1038/nature01262
Howe, K. et al. The zebrafish reference genome sequence and its relationship to the human genome. Nature 496, 498–503 (2013).
pubmed: 23594743 pmcid: 3703927 doi: 10.1038/nature12111
Genome 10K Community of Scientists. Genome 10K: a proposal to obtain whole-genome sequence for 10,000 vertebrate species. J. Hered. 100, 659–674 (2009).
pmcid: 2877544 doi: 10.1093/jhered/esp086
Koepfli, K.-P., Paten, B., the Genome 10K Community of Scientists & O’Brien, S. J. The Genome 10K Project: a way forward. Annu. Rev. Anim. Biosci. 3, 57–111 (2015).
pubmed: 25689317 pmcid: 5837290 doi: 10.1146/annurev-animal-090414-014900
Venter, J. C. et al. The sequence of the human genome. Science 291, 1304–1351 (2001).
pubmed: 11181995 doi: 10.1126/science.1058040
Adams, M. D. et al. The genome sequence of Drosophila melanogaster. Science 287, 2185–2195 (2000).
pubmed: 10731132 doi: 10.1126/science.287.5461.2185
Shendure, J. & Ji, H. Next-generation DNA sequencing. Nat. Biotechnol. 26, 1135–1145 (2008).
pubmed: 18846087 doi: 10.1038/nbt1486
Yin, Z.-T. et al. Revisiting avian ‘missing’ genes from de novo assembled transcripts. BMC Genomics 20, 4 (2019).
pubmed: 30611188 pmcid: 6321700 doi: 10.1186/s12864-018-5407-1
Korlach, J. et al. De novo PacBio long-read and phased avian genome assemblies correct and add to reference genes generated with intermediate and short reads. Gigascience 6, 1–16 (2017).
pubmed: 29020750 pmcid: 5632298 doi: 10.1093/gigascience/gix085
Kelley, D. R. & Salzberg, S. L. Detection and correction of false segmental duplications caused by genome mis-assembly. Genome Biol. 11, R28 (2010).
pubmed: 20219098 pmcid: 2864568 doi: 10.1186/gb-2010-11-3-r28
Roach, M. J., Schmidt, S. A. & Borneman, A. R. Purge Haplotigs: allelic contig reassignment for third-gen diploid genome assemblies. BMC Bioinformatics 19, 460 (2018).
pubmed: 30497373 pmcid: 6267036 doi: 10.1186/s12859-018-2485-7
Guan, D. et al. Identifying and removing haplotypic duplication in primary genome assemblies. Bioinformatics 36, 2896–2898 (2020).
pubmed: 31971576 pmcid: 7203741 doi: 10.1093/bioinformatics/btaa025
Bradnam, K. R. et al. Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species. Gigascience 2, 10 (2013).
pubmed: 23870653 pmcid: 3844414 doi: 10.1186/2047-217X-2-10
Zhang, G. et al. Comparative genomics reveals insights into avian genome evolution and adaptation. Science 346, 1311–1320 (2014).
pubmed: 25504712 pmcid: 4390078 doi: 10.1126/science.1251385
Chin, C.-S. et al. Phased diploid genome assembly with single-molecule real-time sequencing. Nat. Methods 13, 1050–1054 (2016).
pubmed: 27749838 pmcid: 5503144 doi: 10.1038/nmeth.4035
Bresler, G., Bresler, M. & Tse, D. Optimal assembly for high throughput shotgun sequencing. BMC Bioinformatics 14 (Suppl. 5), S18 (2013).
pubmed: 23902516 pmcid: 3706340 doi: 10.1186/1471-2105-14-S5-S18
Warren, W. C. et al. The genome of a songbird. Nature 464, 757–762 (2010).
pubmed: 20360741 pmcid: 3187626 doi: 10.1038/nature08819
Koren, S. et al. De novo assembly of haplotype-resolved genomes with trio binning. Nat. Biotechnol. (2018).
Koren, S., Phillippy, A. M., Simpson, J. T., Loman, N. J. & Loose, M. Reply to ‘Errors in long-read assemblies can critically affect protein prediction’. Nat. Biotechnol. 37, 127–128 (2019).
pubmed: 30670797 doi: 10.1038/s41587-018-0005-y
Vollger, M. R. et al. Long-read sequence and assembly of segmental duplications. Nat. Methods 16, 88–94 (2019).
pubmed: 30559433 doi: 10.1038/s41592-018-0236-3
Rhie, A., Walenz, B. P., Koren, S. & Phillippy, A. M. Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies. Genome Biol. 21, 245 (2020).
pubmed: 32928274 pmcid: 7488777 doi: 10.1186/s13059-020-02134-9
Waterhouse, R. M. et al. BUSCO applications from quality assessments to gene prediction and phylogenomics. Mol. Biol. Evol. 35, 543–548 (2018).
pubmed: 29220515 doi: 10.1093/molbev/msx319
Howe, K. et al. Significantly improving the quality of genome assemblies through curation. Gigascience 10, giaa153 (2021).
pubmed: 33420778 pmcid: 7794651 doi: 10.1093/gigascience/giaa153
Zhou, Y. et al. Platypus and echidna genomes reveal mammalian biology and evolution. Nature https://doi.org/10.1038/s41586-020-03039-0 (2021).
Kim, J. et al. False gene and chromosome losses affected by assembly and sequence errors. Preprint at https://doi.org/10.1101/2021.04.09.438906 (2021).
Lewin, H. A., Graves, J. A. M., Ryder, O. A., Graphodatsky, A. S. & O’Brien, S. J. Precision nomenclature for the new genomics. Gigascience 8, giz086 (2019).
pubmed: 31437278 pmcid: 6705538 doi: 10.1093/gigascience/giz086
Kronenberg, Z. N. et al. Extended haplotype phasing of de novo genome assemblies with FALCON-Phase. Nat. Commun. https://doi.org/10.1038/s41467-020-20536-y (2021).
Ewing, B., Hillier, L., Wendl, M. C. & Green, P. Base-calling of automated sequencer traces using phred. I. Accuracy assessment. Genome Res. 8, 175–185 (1998).
pubmed: 9521921 doi: 10.1101/gr.8.3.175
Tomaszkiewicz, M., Medvedev, P. & Makova, K. D. Y and W chromosome assemblies: approaches and discoveries. Trends Genet. 33, 266–282 (2017).
pubmed: 28236503 doi: 10.1016/j.tig.2017.01.008
Kolesnikov, A. A. & Gerasimov, E. S. Diversity of mitochondrial genome organization. Biochem. (Mosc.) 77, 1424–1435 (2012).
doi: 10.1134/S0006297912130020
Formenti, G. et al. Complete vertebrate mitogenomes reveal widespread repeats and gene duplications. Genome Biol. (in the press).
Harrison, G. L. A. et al. Four new avian mitochondrial genomes help get to basic evolutionary questions in the late cretaceous. Mol. Biol. Evol. 21, 974–983 (2004).
pubmed: 14739240 doi: 10.1093/molbev/msh065
Zhao, H. et al. The complete mitochondrial genome of the Anabas testudineus (Perciformes, Anabantidae). Mitochondrial DNA A DNA Mapp. Seq. Anal. 27, 1005–1007 (2016).
pubmed: 24960569 doi: 10.3109/19401736.2014.926526
Suzuki, A. et al. How the kinetochore couples microtubule force and centromere stretch to move chromosomes. Nat. Cell Biol. 18, 382–392 (2016).
pubmed: 26974660 pmcid: 4814359 doi: 10.1038/ncb3323
Pfenning, A. R. et al. Convergent transcriptional specializations in the brains of humans and song-learning birds. Science 346, 1256846 (2014).
pubmed: 25504733 pmcid: 4385736 doi: 10.1126/science.1256846
Robinson, R. For mammals, loss of yolk and gain of milk went hand in hand. PLoS Biol. 6, e77 (2008).
pubmed: 20076706 pmcid: 2267822 doi: 10.1371/journal.pbio.0060077
Brandl, K. et al. Yip1 domain family, member 6 (Yipf6) mutation induces spontaneous intestinal inflammation in mice. Proc. Natl Acad. Sci. USA 109, 12650–12655 (2012).
pubmed: 22802641 pmcid: 3412000 doi: 10.1073/pnas.1210366109
Malmstrøm, M. et al. Evolution of the immune system influences speciation rates in teleost fishes. Nat. Genet. 48, 1204–1210 (2016).
pubmed: 27548311 doi: 10.1038/ng.3645
Japundžić-Žigon, N., Lozić, M., Šarenac, O. & Murphy, D. Vasopressin & oxytocin in control of the cardiovascular system: an updated review. Curr. Neuropharmacol. 18, 14–33 (2020).
pubmed: 31544693 pmcid: 7327933 doi: 10.2174/1570159X17666190717150501
Cataldo, I., Azhari, A. & Esposito, G. A review of oxytocin and arginine-vasopressin receptors and their modulation of autism spectrum disorder. Front. Mol. Neurosci. 11, 27 (2018).
pubmed: 29487501 pmcid: 5816822 doi: 10.3389/fnmol.2018.00027
Warren, W. C. et al. Genome analysis of the platypus reveals unique signatures of evolution. Nature 453, 175–183 (2008).
pubmed: 18464734 pmcid: 2803040 doi: 10.1038/nature06936
Ko, B. J. et al. Widespread false gene gains caused by duplication errors in genome assemblies. Preprint at https://doi.org/10.1101/2021.04.09.438957 (2021).
Lemaire, S. et al. Characterizing the interplay between gene nucleotide composition bias and splicing. Genome Biol. 20, 259 (2019).
pubmed: 31783898 pmcid: 6883713 doi: 10.1186/s13059-019-1869-y
Zhang, L., Kasif, S., Cantor, C. R. & Broude, N. E. GC/AT-content spikes as genomic punctuation marks. Proc. Natl Acad. Sci. USA 101, 16855–16860 (2004).
pubmed: 15548610 pmcid: 534751 doi: 10.1073/pnas.0407821101
Jarvis, E. D. et al. Global view of the functional molecular organization of the avian cerebrum: mirror images and functional columns. J. Comp. Neurol. 521, 3614–3665 (2013).
pubmed: 23818122 pmcid: 4145244 doi: 10.1002/cne.23404
Kubikova, L., Wada, K. & Jarvis, E. D. Dopamine receptors in a songbird brain. J. Comp. Neurol. 518, 741–769 (2010).
pubmed: 20058221 pmcid: 2904815 doi: 10.1002/cne.22255
Sémon, M. & Wolfe, K. H. Rearrangement rate following the whole-genome duplication in teleosts. Mol. Biol. Evol. 24, 860–867 (2007).
pubmed: 17218642 doi: 10.1093/molbev/msm003
Jebb, D. et al. Six reference-quality genomes reveal evolution of bat adaptations. Nature 583, 578–584 (2020).
pubmed: 32699395 pmcid: 8075899 doi: 10.1038/s41586-020-2486-3
Schneider, V. A. et al. Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly. Genome Res. 27, 849–864 (2017).
pubmed: 28396521 pmcid: 5411779 doi: 10.1101/gr.213611.116
Warren, W. C. et al. A new chicken genome assembly provides insight into avian genome structure. G3 (Bethesda) 7, 109–117 (2017).
doi: 10.1534/g3.116.035923
Meredith, R. W. et al. Impacts of the Cretaceous Terrestrial Revolution and KPg extinction on mammal diversification. Science 334, 521–524 (2011).
pubmed: 21940861 doi: 10.1126/science.1211028
Rodriguez-Agudo, D. et al. StarD5: an ER stress protein regulates plasma membrane and intracellular cholesterol homeostasis. J. Lipid Res. 60, 1087–1098 (2019).
pubmed: 31015253 pmcid: 6547630 doi: 10.1194/jlr.M091967
Kim, J. et al. Reconstruction and evolutionary history of eutherian chromosomes. Proc. Natl Acad. Sci. USA 114, E5379–E5388 (2017).
pubmed: 28630326 pmcid: 5502614 doi: 10.1073/pnas.1702012114
Lin, B., Dutta, B. & Fraser, I. D. C. Systematic investigation of multi-TLR sensing identifies regulators of sustained gene activation in macrophages. Cell Syst. 5, 25–37.e3 (2017).
pubmed: 28750197 pmcid: 5584636 doi: 10.1016/j.cels.2017.06.014
Theofanopoulou, C., Gedman, G. L., Cahill, J. A., Boeckx, C. & Jarvis, E. D. Universal nomenclature for oxytocin-vasotocin ligand and receptor families. Nature https://doi.org/10.1038/s41586-020-03040-7 (2021).
Ocampo Daza, D. & Haitina, T. Reconstruction of the carbohydrate 6-O sulfotransferase gene family evolution in vertebrates reveals novel member, CHST16, lost in amniotes. Genome Biol. Evol. 12, 993–1012 (2020).
pubmed: 32652010 doi: 10.1093/gbe/evz274
Damas, J. et al. Broad host range of SARS-CoV-2 predicted by comparative and structural analysis of ACE2 in vertebrates. Proc. Natl Acad. Sci. USA 117, 22311–22322 (2020).
pubmed: 32826334 pmcid: 7486773 doi: 10.1073/pnas.2010146117
Dussex, N. et al. Population genomics reveals the impact of long-term small population size in the critically endangered kākāpō. Cell Genom. (in the press).
Teeling, E. C. et al. Bat biology, genomes, and the Bat1K project: to generate chromosome-level genomes for all living bat species. Annu. Rev. Anim. Biosci. 6, 23–46 (2018).
pubmed: 29166127 doi: 10.1146/annurev-animal-022516-022811
Lewin, H. A. et al. Earth BioGenome Project: sequencing life for the future of life. Proc. Natl Acad. Sci. USA 115, 4325–4333 (2018).
pubmed: 29686065 pmcid: 5924910 doi: 10.1073/pnas.1720115115
Jarvis, E. D. et al. Whole-genome analyses resolve early branches in the tree of life of modern birds. Science 346, 1320–1331 (2014).
pubmed: 25504713 pmcid: 4405904 doi: 10.1126/science.1253451
Li, S. et al. Genomic signatures of near-extinction and rebirth of the crested ibis and other endangered bird species. Genome Biol. 15, 557 (2014).
pubmed: 25496777 pmcid: 4290368 doi: 10.1186/s13059-014-0557-1
Koren, S. & Phillippy, A. M. One chromosome, one contig: complete microbial genomes from long-read sequencing and assembly. Curr. Opin. Microbiol. 23, 110–120 (2015).
pubmed: 25461581 doi: 10.1016/j.mib.2014.11.014
Jenjaroenpun, P. et al. Complete genomic and transcriptional landscape analysis using third-generation sequencing: a case study of Saccharomyces cerevisiae CEN.PK113-7D. Nucleic Acids Res. 46, e38 (2018).
pubmed: 29346625 pmcid: 5909453 doi: 10.1093/nar/gky014
Tyson, J. R. et al. MinION-based long-read sequencing and assembly extends the Caenorhabditis elegans reference genome. Genome Res. 28, 266–274 (2018).
pubmed: 29273626 pmcid: 5793790 doi: 10.1101/gr.221184.117
Miga, K. H. et al. Telomere-to-telomere assembly of a complete human X chromosome. Nature 585, 79–84 (2020).
pubmed: 32663838 pmcid: 7484160 doi: 10.1038/s41586-020-2547-7
Logsdon, G. A. et al. The structure, function and evolution of a complete human chromosome 8. Nature https://doi.org/10.1038/s41586-021-03420-7 (2021).
Beçak, M. L., Beçak, W., Roberts, F. L., Shoffner, R. N. & Volpe, P. (eds.) Chromosome Atlas: Fish, Amphibians, Reptiles, and Birds Vol. 2 (Springer, 1973).
Vurture, G. W. et al. GenomeScope: fast reference-free genome profiling from short reads. Bioinformatics 33, 2202–2204 (2017).
pubmed: 28369201 pmcid: 5870704 doi: 10.1093/bioinformatics/btx153
Kumar, S., Stecher, G., Suleski, M. & Hedges, S. B. TimeTree: a resource for timelines, timetrees, and divergence times. Mol. Biol. Evol. 34, 1812–1819 (2017).
pubmed: 28387841 doi: 10.1093/molbev/msx116
Ondov, B. D. et al. Mash: fast genome and metagenome distance estimation using MinHash. Genome Biol. 17, 132 (2016).
pubmed: 27323842 pmcid: 4915045 doi: 10.1186/s13059-016-0997-x
Ning, Z. & Harry, E. Scaff10X https://github.com/wtsi-hpag/Scaff10X .
Morgulis, A., Gertz, E. M., Schäffer, A. A. & Agarwala, R. WindowMasker: window-based masker for sequenced genomes. Bioinformatics 22, 134–141 (2006).
pubmed: 16287941 doi: 10.1093/bioinformatics/bti774
Chin, C.-S. et al. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat. Methods 10, 563–569 (2013).
pubmed: 23644548 doi: 10.1038/nmeth.2474
Koren, S. et al. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 27, 722–736 (2017).
pubmed: 28298431 pmcid: 5411767 doi: 10.1101/gr.215087.116
Weisenfeld, N. I., Kumar, V., Shah, P., Church, D. M. & Jaffe, D. B. Direct determination of diploid genome sequences. Genome Res. 27, 757–767 (2017).
pubmed: 28381613 pmcid: 5411770 doi: 10.1101/gr.214874.116
Ghurye, J. et al. Integrating Hi-C links with assembly graphs for chromosome-scale assembly. PLoS Comput. Biol. 15, e1007273 (2019).
pubmed: 31433799 pmcid: 6719893 doi: 10.1371/journal.pcbi.1007273
Lieberman-Aiden, E. et al. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 326, 289–293 (2009).
pubmed: 19815776 pmcid: 2858594 doi: 10.1126/science.1181369
Luo, R. et al. SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler. Gigascience 1, 18 (2012).
pubmed: 23587118 pmcid: 3626529 doi: 10.1186/2047-217X-1-18
English, A. C. et al. Mind the gap: upgrading genomes with Pacific Biosciences RS long-read sequencing technology. PLoS ONE 7, e47768 (2012).
pubmed: 23185243 pmcid: 3504050 doi: 10.1371/journal.pone.0047768
Bishara, A. et al. Read clouds uncover variation in complex regions of the human genome. Genome Res. 25, 1570–1580 (2015).
pubmed: 26286554 pmcid: 4579342 doi: 10.1101/gr.191189.115
Walker, B. J. et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS ONE 9, e112963 (2014).
pubmed: 25409509 pmcid: 4237348 doi: 10.1371/journal.pone.0112963
Garrison, E. & Marth, G. Haplotype-based variant detection from short-read sequencing. Preprint at http://arxiv.org/abs/1207.3907 (2012).
Jain, C., Koren, S., Dilthey, A., Phillippy, A. M. & Aluru, S. A fast adaptive algorithm for computing whole-genome homology maps. Bioinformatics 34, i748–i756 (2018).
pubmed: 30423094 pmcid: 6129286 doi: 10.1093/bioinformatics/bty597
Bionano Genomics, Inc. Bionano Software Downloads. https://bionanogenomics.com/support/software-downloads/ .
Arima Genomics, Inc. Arima Genomics Mapping Pipeline. https://github.com/ArimaGenomics/mapping_pipeline .
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
pubmed: 19451168 pmcid: 2705234 doi: 10.1093/bioinformatics/btp324
Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).
pubmed: 29750242 pmcid: 6137996 doi: 10.1093/bioinformatics/bty191
Chaisson, M. J. & Tesler, G. Mapping single molecule sequencing reads using basic local alignment with successive refinement (BLASR): application and theory. BMC Bioinformatics 13, 238 (2012).
pubmed: 22988817 pmcid: 3572422 doi: 10.1186/1471-2105-13-238
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
pubmed: 19505943 pmcid: 2723002 doi: 10.1093/bioinformatics/btp352
Dierckxsens, N., Mardulyn, P. & Smits, G. NOVOPlasty: de novo assembly of organelle genomes from whole genome data. Nucleic Acids Res. 45, e18 (2017).
pubmed: 28204566
Soorni, A., Haak, D., Zaitlin, D. & Bombarely, A. Organelle_PBA, a pipeline for assembling chloroplast and mitochondrial genomes from PacBio DNA sequencing data. BMC Genomics 18, 49 (2017).
pubmed: 28061749 pmcid: 5219736 doi: 10.1186/s12864-016-3412-9
Chow, W. et al. gEVAL — a web-based browser for evaluating genome assemblies. Bioinformatics 32, 2508–2510 (2016).
pubmed: 27153597 pmcid: 4978925 doi: 10.1093/bioinformatics/btw159
Durand, N. C. et al. Juicebox provides a visualization system for Hi-C contact maps with unlimited zoom. Cell Syst. 3, 99–101 (2016).
pubmed: 27467250 pmcid: 5596920 doi: 10.1016/j.cels.2015.07.012
Kerpedjiev, P. et al. HiGlass: web-based visual exploration and analysis of genome interaction maps. Genome Biol. 19, 125 (2018).
pubmed: 30143029 pmcid: 6109259 doi: 10.1186/s13059-018-1486-1
Camacho, C. et al. BLAST+: architecture and applications. BMC Bioinformatics 10, 421 (2009).
pubmed: 20003500 pmcid: 2803857 doi: 10.1186/1471-2105-10-421
Harris, R. S. Improved Pairwise Alignment of Genomic DNA. Thesis, Pennsylvania State Univ. (2007).
Kent, W. J., Baertsch, R., Hinrichs, A., Miller, W. & Haussler, D. Evolution’s cauldron: duplication, deletion, and rearrangement in the mouse and human genomes. Proc. Natl Acad. Sci. USA 100, 11484–11489 (2003).
pubmed: 14500911 pmcid: 208784 doi: 10.1073/pnas.1932072100
Kolmogorov, M., Raney, B., Paten, B. & Pham, S. Ragout—a reference-assisted assembly tool for bacterial genomes. Bioinformatics 30, i302–i309 (2014).
pubmed: 24931998 pmcid: 4058940 doi: 10.1093/bioinformatics/btu280
Farré, M. et al. Novel insights into chromosome evolution in birds, archosaurs, and reptiles. Genome Biol. Evol. 8, 2442–2451 (2016).
pubmed: 27401172 pmcid: 5010900 doi: 10.1093/gbe/evw166
Guan, D. Asset. https://github.com/dfguan/asset .
Tarailo-Graovac, M. & Chen, N. Using RepeatMasker to identify repetitive elements in genomic sequences. Curr. Protoc. Bioinformatics 25, 4.10.1–4.10.14 (2009).
doi: 10.1002/0471250953.bi0410s25
Krumsiek, J., Arnold, R. & Rattei, T. Gepard: a rapid and sensitive tool for creating dotplots on genome scale. Bioinformatics 23, 1026–1028 (2007).
pubmed: 17309896 doi: 10.1093/bioinformatics/btm039
Harry, E. PretextView. https://github.com/wtsi-hpag/PretextView .
Kurtz, S. et al. Versatile and open software for comparing large genomes. Genome Biol. 5, R12 (2004).
pubmed: 14759262 pmcid: 395750 doi: 10.1186/gb-2004-5-2-r12
Nattestad, M. Dot. https://github.com/MariaNattestad/dot .

Auteurs

Arang Rhie (A)

Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA.

Shane A McCarthy (SA)

Department of Genetics, University of Cambridge, Cambridge, UK.
Wellcome Sanger Institute, Cambridge, UK.

Olivier Fedrigo (O)

Vertebrate Genome Lab, The Rockefeller University, New York, NY, USA.

Joana Damas (J)

The Genome Center, University of California Davis, Davis, CA, USA.

Giulio Formenti (G)

Vertebrate Genome Lab, The Rockefeller University, New York, NY, USA.
Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA.

Sergey Koren (S)

Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA.

Marcela Uliano-Silva (M)

Leibniz Institute for Zoo and Wildlife Research, Department of Evolutionary Genetics, Berlin, Germany.
Berlin Center for Genomics in Biodiversity Research, Berlin, Germany.

William Chow (W)

Wellcome Sanger Institute, Cambridge, UK.

Arkarachai Fungtammasan (A)

DNAnexus Inc., Mountain View, CA, USA.

Juwan Kim (J)

Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea.

Chul Lee (C)

Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea.

Byung June Ko (BJ)

Department of Agricultural Biotechnology and Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul, Republic of Korea.

Mark Chaisson (M)

University of Southern California, Los Angeles, CA, USA.

Gregory L Gedman (GL)

Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA.

Lindsey J Cantin (LJ)

Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA.

Francoise Thibaud-Nissen (F)

National Center for Biotechnology Information, National Library of Medicine, NIH, Bethesda, MD, USA.

Leanne Haggerty (L)

European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK.

Iliana Bista (I)

Department of Genetics, University of Cambridge, Cambridge, UK.
Wellcome Sanger Institute, Cambridge, UK.

Michelle Smith (M)

Wellcome Sanger Institute, Cambridge, UK.

Bettina Haase (B)

Vertebrate Genome Lab, The Rockefeller University, New York, NY, USA.

Jacquelyn Mountcastle (J)

Vertebrate Genome Lab, The Rockefeller University, New York, NY, USA.

Sylke Winkler (S)

Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany.
DRESDEN-concept Genome Center, Dresden, Germany.

Sadye Paez (S)

Vertebrate Genome Lab, The Rockefeller University, New York, NY, USA.
Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA.

Jason Howard (J)

Novogene, Durham, NC, USA.

Sonja C Vernes (SC)

Neurogenetics of Vocal Communication Group, Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands.
Donders Institute for Brain, Cognition and Behaviour, Nijmegen, The Netherlands.
School of Biology, University of St Andrews, St Andrews, UK.

Tanya M Lama (TM)

University of Massachusetts Cooperative Fish and Wildlife Research Unit, Amherst, MA, USA.

Frank Grutzner (F)

School of Biological Science, The Environment Institute, University of Adelaide, Adelaide, South Australia, Australia.

Wesley C Warren (WC)

Bond Life Sciences Center, University of Missouri, Columbia, MO, USA.

Christopher N Balakrishnan (CN)

Department of Biology, East Carolina University, Greenville, NC, USA.

Dave Burt (D)

UQ Genomics, University of Queensland, Brisbane, Queensland, Australia.

Julia M George (JM)

Department of Biological Sciences, Clemson University, Clemson, SC, USA.

Matthew T Biegler (MT)

Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA.

David Iorns (D)

The Genetic Rescue Foundation, Wellington, New Zealand.

Andrew Digby (A)

Kākāpō Recovery, Department of Conservation, Invercargill, New Zealand.

Daryl Eason (D)

Kākāpō Recovery, Department of Conservation, Invercargill, New Zealand.

Bruce Robertson (B)

Department of Zoology, University of Otago, Dunedin, New Zealand.

Taylor Edwards (T)

University of Arizona Genetics Core, Tucson, AZ, USA.

Mark Wilkinson (M)

Department of Life Sciences, Natural History Museum, London, UK.

George Turner (G)

School of Natural Sciences, Bangor University, Gwynedd, UK.

Axel Meyer (A)

Department of Biology, University of Konstanz, Konstanz, Germany.

Andreas F Kautt (AF)

Department of Biology, University of Konstanz, Konstanz, Germany.
Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, USA.

Paolo Franchini (P)

Department of Biology, University of Konstanz, Konstanz, Germany.

H William Detrich (HW)

Department of Marine and Environmental Sciences, Northeastern University Marine Science Center, Nahant, MA, USA.

Hannes Svardal (H)

Department of Biology, University of Antwerp, Antwerp, Belgium.
Naturalis Biodiversity Center, Leiden, The Netherlands.

Maximilian Wagner (M)

Institute of Biology, Karl-Franzens University of Graz, Graz, Austria.

Gavin J P Naylor (GJP)

Florida Museum of Natural History, University of Florida, Gainesville, FL, USA.

Martin Pippel (M)

Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany.
Center for Systems Biology, Dresden, Germany.

Milan Malinsky (M)

Wellcome Sanger Institute, Cambridge, UK.
Zoological Institute, University of Basel, Basel, Switzerland.

Mark Mooney (M)

Tag.bio, San Francisco, CA, USA.

Maria Simbirsky (M)

DNAnexus Inc., Mountain View, CA, USA.

Brett T Hannigan (BT)

DNAnexus Inc., Mountain View, CA, USA.

Trevor Pesout (T)

UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA.

Marlys Houck (M)

San Diego Zoo Global, Escondido, CA, USA.

Ann Misuraca (A)

San Diego Zoo Global, Escondido, CA, USA.

Sarah B Kingan (SB)

Pacific Biosciences, Menlo Park, CA, USA.

Richard Hall (R)

Pacific Biosciences, Menlo Park, CA, USA.

Zev Kronenberg (Z)

Pacific Biosciences, Menlo Park, CA, USA.

Ivan Sović (I)

Pacific Biosciences, Menlo Park, CA, USA.
Digital BioLogic, Ivanić-Grad, Croatia.

Christopher Dunn (C)

Pacific Biosciences, Menlo Park, CA, USA.

Zemin Ning (Z)

Wellcome Sanger Institute, Cambridge, UK.

Alex Hastie (A)

Bionano Genomics, San Diego, CA, USA.

Joyce Lee (J)

Bionano Genomics, San Diego, CA, USA.

Siddarth Selvaraj (S)

Arima Genomics, San Diego, CA, USA.

Richard E Green (RE)

UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA.
Dovetail Genomics, Santa Cruz, CA, USA.

Nicholas H Putnam (NH)

Independent Researcher, Santa Cruz, CA, USA.

Ivo Gut (I)

CNAG-CRG, Centre for Genomic Regulation, Barcelona Institute of Science and Technology, Barcelona, Spain.
Universitat Pompeu Fabra, Barcelona, Spain.

Jay Ghurye (J)

Dovetail Genomics, Santa Cruz, CA, USA.
Department of Computer Science, University of Maryland College Park, College Park, MD, USA.

Erik Garrison (E)

UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA.

Ying Sims (Y)

Wellcome Sanger Institute, Cambridge, UK.

Joanna Collins (J)

Wellcome Sanger Institute, Cambridge, UK.

Sarah Pelan (S)

Wellcome Sanger Institute, Cambridge, UK.

James Torrance (J)

Wellcome Sanger Institute, Cambridge, UK.

Alan Tracey (A)

Wellcome Sanger Institute, Cambridge, UK.

Jonathan Wood (J)

Wellcome Sanger Institute, Cambridge, UK.

Robel E Dagnew (RE)

University of Southern California, Los Angeles, CA, USA.

Dengfeng Guan (D)

Department of Genetics, University of Cambridge, Cambridge, UK.
School of Computer Science and Technology, Center for Bioinformatics, Harbin Institute of Technology, Harbin, China.

Sarah E London (SE)

Department of Psychology, Institute for Mind and Biology, University of Chicago, Chicago, IL, USA.

David F Clayton (DF)

Department of Genetics and Biochemistry, Clemson University, Clemson, SC, USA.

Claudio V Mello (CV)

Department of Behavioral Neuroscience, Oregon Health and Science University, Portland, OR, USA.

Samantha R Friedrich (SR)

Department of Behavioral Neuroscience, Oregon Health and Science University, Portland, OR, USA.

Peter V Lovell (PV)

Department of Behavioral Neuroscience, Oregon Health and Science University, Portland, OR, USA.

Ekaterina Osipova (E)

Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany.
Center for Systems Biology, Dresden, Germany.
Max Planck Institute for the Physics of Complex Systems, Dresden, Germany.

Farooq O Al-Ajli (FO)

Monash University Malaysia Genomics Facility, School of Science, Selangor Darul Ehsan, Malaysia.
Tropical Medicine and Biology Multidisciplinary Platform, Monash University Malaysia, Selangor Darul Ehsan, Malaysia.
Qatar Falcon Genome Project, Doha, Qatar.

Simona Secomandi (S)

Department of Biosciences, University of Milan, Milan, Italy.

Heebal Kim (H)

Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea.
Department of Agricultural Biotechnology and Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul, Republic of Korea.
eGnome, Inc., Seoul, Republic of Korea.

Constantina Theofanopoulou (C)

Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA.

Michael Hiller (M)

LOEWE Centre for Translational Biodiversity Genomics, Frankfurt, Germany.
Senckenberg Research Institute, Frankfurt, Germany.
Goethe-University, Faculty of Biosciences, Frankfurt, Germany.

Yang Zhou (Y)

BGI-Shenzhen, Shenzhen, China.

Robert S Harris (RS)

Department of Biology, Pennsylvania State University, University Park, PA, USA.

Kateryna D Makova (KD)

Department of Biology, Pennsylvania State University, University Park, PA, USA.
Center for Medical Genomics, Pennsylvania State University, University Park, PA, USA.
Center for Computational Biology and Bioinformatics, Pennsylvania State University, University Park, PA, USA.

Paul Medvedev (P)

Center for Medical Genomics, Pennsylvania State University, University Park, PA, USA.
Center for Computational Biology and Bioinformatics, Pennsylvania State University, University Park, PA, USA.
Department of Computer Science and Engineering, Pennsylvania State University, University Park, PA, USA.
Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA, USA.

Jinna Hoffman (J)

National Center for Biotechnology Information, National Library of Medicine, NIH, Bethesda, MD, USA.

Patrick Masterson (P)

National Center for Biotechnology Information, National Library of Medicine, NIH, Bethesda, MD, USA.

Karen Clark (K)

National Center for Biotechnology Information, National Library of Medicine, NIH, Bethesda, MD, USA.

Fergal Martin (F)

European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK.

Kevin Howe (K)

European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK.

Paul Flicek (P)

European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK.

Brian P Walenz (BP)

Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA.

Woori Kwak (W)

eGnome, Inc., Seoul, Republic of Korea.
Hoonygen, Seoul, Korea.

Hiram Clawson (H)

UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA.

Mark Diekhans (M)

UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA.

Luis Nassar (L)

UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA.

Benedict Paten (B)

UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA.

Robert H S Kraus (RHS)

Department of Biology, University of Konstanz, Konstanz, Germany.
Department of Migration, Max Planck Institute of Animal Behavior, Radolfzell, Germany.

Andrew J Crawford (AJ)

Department of Biological Sciences, Universidad de los Andes, Bogotá, Colombia.

M Thomas P Gilbert (MTP)

Center for Evolutionary Hologenomics, The GLOBE Institute, University of Copenhagen, Copenhagen, Denmark.
University Museum, NTNU, Trondheim, Norway.

Guojie Zhang (G)

China National Genebank, BGI-Shenzhen, Shenzhen, China.
Villum Center for Biodiversity Genomics, Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China.
Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China.

Byrappa Venkatesh (B)

Institute of Molecular and Cell Biology, A*STAR, Biopolis, Singapore, Singapore.

Robert W Murphy (RW)

Centre for Biodiversity, Royal Ontario Museum, Toronto, Ontario, Canada.

Klaus-Peter Koepfli (KP)

Smithsonian Conservation Biology Institute, Center for Species Survival, National Zoological Park, Washington, DC, USA.

Beth Shapiro (B)

Department of Ecology and Evolutionary Biology, University of California Santa Cruz, Santa Cruz, CA, USA.
Howard Hughes Medical Institute, Chevy Chase, MD, USA.

Warren E Johnson (WE)

Smithsonian Conservation Biology Institute, Center for Species Survival, National Zoological Park, Washington, DC, USA.
The Walter Reed Biosystematics Unit, Museum Support Center MRC-534, Smithsonian Institution, Suitland, MD, USA.
Walter Reed Army Institute of Research, Silver Spring, MD, USA.

Federica Di Palma (F)

Department of Biological Sciences, Earlham Institute, University of East Anglia, Norwich, UK.

Tomas Marques-Bonet (T)

Institute of Evolutionary Biology (UPF-CSIC), PRBB, Barcelona, Spain.
Catalan Institution of Research and Advanced Studies (ICREA), Barcelona, Spain.
Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain.
Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Barcelona, Spain.

Emma C Teeling (EC)

School of Biology and Environmental Science, University College Dublin, Dublin, Ireland.

Tandy Warnow (T)

Department of Computer Science, The University of Illinois at Urbana-Champaign, Urbana, IL, USA.

Jennifer Marshall Graves (JM)

School of Life Science, La Trobe University, Melbourne, Victoria, Australia.

Oliver A Ryder (OA)

San Diego Zoo Global, Escondido, CA, USA.
Department of Evolution, Behavior, and Ecology, University of California San Diego, La Jolla, CA, USA.

David Haussler (D)

UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA.
Department of Ecology and Evolutionary Biology, University of California Santa Cruz, Santa Cruz, CA, USA.

Stephen J O'Brien (SJ)

Laboratory of Genomics Diversity-Center for Computer Technologies, ITMO University, St. Petersburg, Russian Federation.
Guy Harvey Oceanographic Center, Halmos College of Natural Sciences and Oceanography, Nova Southeastern University, Fort Lauderdale, FL, USA.

Jonas Korlach (J)

Pacific Biosciences, Menlo Park, CA, USA.

Harris A Lewin (HA)

The Genome Center, University of California Davis, Davis, CA, USA.
Department of Evolution and Ecology, University of California Davis, Davis, CA, USA.
John Muir Institute for the Environment, University of California Davis, Davis, CA, USA.

Kerstin Howe (K)

Wellcome Sanger Institute, Cambridge, UK. kj2@sanger.ac.uk.

Eugene W Myers (EW)

Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany. gene@mpi-cbg.de.
Center for Systems Biology, Dresden, Germany. gene@mpi-cbg.de.
Faculty of Computer Science, Technical University Dresden, Dresden, Germany. gene@mpi-cbg.de.

Richard Durbin (R)

Department of Genetics, University of Cambridge, Cambridge, UK. rd109@cam.ac.uk.
Wellcome Sanger Institute, Cambridge, UK. rd109@cam.ac.uk.

Adam M Phillippy (AM)

Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA. adam.phillippy@nih.gov.

Erich D Jarvis (ED)

Vertebrate Genome Lab, The Rockefeller University, New York, NY, USA. ejarvis@rockefeller.edu.
Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA. ejarvis@rockefeller.edu.
Howard Hughes Medical Institute, Chevy Chase, MD, USA. ejarvis@rockefeller.edu.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing
Robotic Surgical Procedures Animals Humans Telemedicine Models, Animal

Odour generalisation and detection dog training.

Lyn Caldicott, Thomas W Pike, Helen E Zulch et al.
1.00
Animals Odorants Dogs Generalization, Psychological Smell
Animals TOR Serine-Threonine Kinases Colorectal Neoplasms Colitis Mice

Classifications MeSH