Towards complete and error-free genome assemblies of all vertebrate species.
Journal
Nature
ISSN: 1476-4687
Titre abrégé: Nature
Pays: England
ID NLM: 0410462
Informations de publication
Date de publication:
04 2021
04 2021
Historique:
received:
22
05
2020
accepted:
12
03
2021
entrez:
29
4
2021
pubmed:
30
4
2021
medline:
11
1
2022
Statut:
ppublish
Résumé
High-quality and complete reference genome assemblies are fundamental for the application of genomics to biology, disease, and biodiversity conservation. However, such assemblies are available for only a few non-microbial species
Identifiants
pubmed: 33911273
doi: 10.1038/s41586-021-03451-0
pii: 10.1038/s41586-021-03451-0
pmc: PMC8081667
doi:
Types de publication
Journal Article
Research Support, N.I.H., Extramural
Research Support, N.I.H., Intramural
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
737-746Subventions
Organisme : NIDCD NIH HHS
ID : R21 DC014432
Pays : United States
Organisme : Wellcome Trust
Pays : United Kingdom
Organisme : NHGRI NIH HHS
ID : R01 HG010485
Pays : United States
Organisme : Intramural NIH HHS
ID : ZIA HG200398
Pays : United States
Organisme : Biotechnology and Biological Sciences Research Council
ID : BBS/E/T/000PR9817
Pays : United Kingdom
Organisme : NIGMS NIH HHS
ID : R01 GM130691
Pays : United States
Organisme : NHGRI NIH HHS
ID : R44 HG008118
Pays : United States
Organisme : Medical Research Council
ID : MR/T021985/1
Pays : United Kingdom
Commentaires et corrections
Type : CommentIn
Références
International Human Genome Sequencing Consortium. Initial sequencing and analysis of the human genome. Nature 409, 860–921 (2001).
doi: 10.1038/35057062
Sulston, J. et al. The C. elegans genome sequencing project: a beginning. Nature 356, 37–41 (1992).
pubmed: 1538779
doi: 10.1038/356037a0
Mouse Genome Sequencing Consortium. Initial sequencing and comparative analysis of the mouse genome. Nature 420, 520–562 (2002).
doi: 10.1038/nature01262
Howe, K. et al. The zebrafish reference genome sequence and its relationship to the human genome. Nature 496, 498–503 (2013).
pubmed: 23594743
pmcid: 3703927
doi: 10.1038/nature12111
Genome 10K Community of Scientists. Genome 10K: a proposal to obtain whole-genome sequence for 10,000 vertebrate species. J. Hered. 100, 659–674 (2009).
pmcid: 2877544
doi: 10.1093/jhered/esp086
Koepfli, K.-P., Paten, B., the Genome 10K Community of Scientists & O’Brien, S. J. The Genome 10K Project: a way forward. Annu. Rev. Anim. Biosci. 3, 57–111 (2015).
pubmed: 25689317
pmcid: 5837290
doi: 10.1146/annurev-animal-090414-014900
Venter, J. C. et al. The sequence of the human genome. Science 291, 1304–1351 (2001).
pubmed: 11181995
doi: 10.1126/science.1058040
Adams, M. D. et al. The genome sequence of Drosophila melanogaster. Science 287, 2185–2195 (2000).
pubmed: 10731132
doi: 10.1126/science.287.5461.2185
Shendure, J. & Ji, H. Next-generation DNA sequencing. Nat. Biotechnol. 26, 1135–1145 (2008).
pubmed: 18846087
doi: 10.1038/nbt1486
Yin, Z.-T. et al. Revisiting avian ‘missing’ genes from de novo assembled transcripts. BMC Genomics 20, 4 (2019).
pubmed: 30611188
pmcid: 6321700
doi: 10.1186/s12864-018-5407-1
Korlach, J. et al. De novo PacBio long-read and phased avian genome assemblies correct and add to reference genes generated with intermediate and short reads. Gigascience 6, 1–16 (2017).
pubmed: 29020750
pmcid: 5632298
doi: 10.1093/gigascience/gix085
Kelley, D. R. & Salzberg, S. L. Detection and correction of false segmental duplications caused by genome mis-assembly. Genome Biol. 11, R28 (2010).
pubmed: 20219098
pmcid: 2864568
doi: 10.1186/gb-2010-11-3-r28
Roach, M. J., Schmidt, S. A. & Borneman, A. R. Purge Haplotigs: allelic contig reassignment for third-gen diploid genome assemblies. BMC Bioinformatics 19, 460 (2018).
pubmed: 30497373
pmcid: 6267036
doi: 10.1186/s12859-018-2485-7
Guan, D. et al. Identifying and removing haplotypic duplication in primary genome assemblies. Bioinformatics 36, 2896–2898 (2020).
pubmed: 31971576
pmcid: 7203741
doi: 10.1093/bioinformatics/btaa025
Bradnam, K. R. et al. Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species. Gigascience 2, 10 (2013).
pubmed: 23870653
pmcid: 3844414
doi: 10.1186/2047-217X-2-10
Zhang, G. et al. Comparative genomics reveals insights into avian genome evolution and adaptation. Science 346, 1311–1320 (2014).
pubmed: 25504712
pmcid: 4390078
doi: 10.1126/science.1251385
Chin, C.-S. et al. Phased diploid genome assembly with single-molecule real-time sequencing. Nat. Methods 13, 1050–1054 (2016).
pubmed: 27749838
pmcid: 5503144
doi: 10.1038/nmeth.4035
Bresler, G., Bresler, M. & Tse, D. Optimal assembly for high throughput shotgun sequencing. BMC Bioinformatics 14 (Suppl. 5), S18 (2013).
pubmed: 23902516
pmcid: 3706340
doi: 10.1186/1471-2105-14-S5-S18
Warren, W. C. et al. The genome of a songbird. Nature 464, 757–762 (2010).
pubmed: 20360741
pmcid: 3187626
doi: 10.1038/nature08819
Koren, S. et al. De novo assembly of haplotype-resolved genomes with trio binning. Nat. Biotechnol. (2018).
Koren, S., Phillippy, A. M., Simpson, J. T., Loman, N. J. & Loose, M. Reply to ‘Errors in long-read assemblies can critically affect protein prediction’. Nat. Biotechnol. 37, 127–128 (2019).
pubmed: 30670797
doi: 10.1038/s41587-018-0005-y
Vollger, M. R. et al. Long-read sequence and assembly of segmental duplications. Nat. Methods 16, 88–94 (2019).
pubmed: 30559433
doi: 10.1038/s41592-018-0236-3
Rhie, A., Walenz, B. P., Koren, S. & Phillippy, A. M. Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies. Genome Biol. 21, 245 (2020).
pubmed: 32928274
pmcid: 7488777
doi: 10.1186/s13059-020-02134-9
Waterhouse, R. M. et al. BUSCO applications from quality assessments to gene prediction and phylogenomics. Mol. Biol. Evol. 35, 543–548 (2018).
pubmed: 29220515
doi: 10.1093/molbev/msx319
Howe, K. et al. Significantly improving the quality of genome assemblies through curation. Gigascience 10, giaa153 (2021).
pubmed: 33420778
pmcid: 7794651
doi: 10.1093/gigascience/giaa153
Zhou, Y. et al. Platypus and echidna genomes reveal mammalian biology and evolution. Nature https://doi.org/10.1038/s41586-020-03039-0 (2021).
Kim, J. et al. False gene and chromosome losses affected by assembly and sequence errors. Preprint at https://doi.org/10.1101/2021.04.09.438906 (2021).
Lewin, H. A., Graves, J. A. M., Ryder, O. A., Graphodatsky, A. S. & O’Brien, S. J. Precision nomenclature for the new genomics. Gigascience 8, giz086 (2019).
pubmed: 31437278
pmcid: 6705538
doi: 10.1093/gigascience/giz086
Kronenberg, Z. N. et al. Extended haplotype phasing of de novo genome assemblies with FALCON-Phase. Nat. Commun. https://doi.org/10.1038/s41467-020-20536-y (2021).
Ewing, B., Hillier, L., Wendl, M. C. & Green, P. Base-calling of automated sequencer traces using phred. I. Accuracy assessment. Genome Res. 8, 175–185 (1998).
pubmed: 9521921
doi: 10.1101/gr.8.3.175
Tomaszkiewicz, M., Medvedev, P. & Makova, K. D. Y and W chromosome assemblies: approaches and discoveries. Trends Genet. 33, 266–282 (2017).
pubmed: 28236503
doi: 10.1016/j.tig.2017.01.008
Kolesnikov, A. A. & Gerasimov, E. S. Diversity of mitochondrial genome organization. Biochem. (Mosc.) 77, 1424–1435 (2012).
doi: 10.1134/S0006297912130020
Formenti, G. et al. Complete vertebrate mitogenomes reveal widespread repeats and gene duplications. Genome Biol. (in the press).
Harrison, G. L. A. et al. Four new avian mitochondrial genomes help get to basic evolutionary questions in the late cretaceous. Mol. Biol. Evol. 21, 974–983 (2004).
pubmed: 14739240
doi: 10.1093/molbev/msh065
Zhao, H. et al. The complete mitochondrial genome of the Anabas testudineus (Perciformes, Anabantidae). Mitochondrial DNA A DNA Mapp. Seq. Anal. 27, 1005–1007 (2016).
pubmed: 24960569
doi: 10.3109/19401736.2014.926526
Suzuki, A. et al. How the kinetochore couples microtubule force and centromere stretch to move chromosomes. Nat. Cell Biol. 18, 382–392 (2016).
pubmed: 26974660
pmcid: 4814359
doi: 10.1038/ncb3323
Pfenning, A. R. et al. Convergent transcriptional specializations in the brains of humans and song-learning birds. Science 346, 1256846 (2014).
pubmed: 25504733
pmcid: 4385736
doi: 10.1126/science.1256846
Robinson, R. For mammals, loss of yolk and gain of milk went hand in hand. PLoS Biol. 6, e77 (2008).
pubmed: 20076706
pmcid: 2267822
doi: 10.1371/journal.pbio.0060077
Brandl, K. et al. Yip1 domain family, member 6 (Yipf6) mutation induces spontaneous intestinal inflammation in mice. Proc. Natl Acad. Sci. USA 109, 12650–12655 (2012).
pubmed: 22802641
pmcid: 3412000
doi: 10.1073/pnas.1210366109
Malmstrøm, M. et al. Evolution of the immune system influences speciation rates in teleost fishes. Nat. Genet. 48, 1204–1210 (2016).
pubmed: 27548311
doi: 10.1038/ng.3645
Japundžić-Žigon, N., Lozić, M., Šarenac, O. & Murphy, D. Vasopressin & oxytocin in control of the cardiovascular system: an updated review. Curr. Neuropharmacol. 18, 14–33 (2020).
pubmed: 31544693
pmcid: 7327933
doi: 10.2174/1570159X17666190717150501
Cataldo, I., Azhari, A. & Esposito, G. A review of oxytocin and arginine-vasopressin receptors and their modulation of autism spectrum disorder. Front. Mol. Neurosci. 11, 27 (2018).
pubmed: 29487501
pmcid: 5816822
doi: 10.3389/fnmol.2018.00027
Warren, W. C. et al. Genome analysis of the platypus reveals unique signatures of evolution. Nature 453, 175–183 (2008).
pubmed: 18464734
pmcid: 2803040
doi: 10.1038/nature06936
Ko, B. J. et al. Widespread false gene gains caused by duplication errors in genome assemblies. Preprint at https://doi.org/10.1101/2021.04.09.438957 (2021).
Lemaire, S. et al. Characterizing the interplay between gene nucleotide composition bias and splicing. Genome Biol. 20, 259 (2019).
pubmed: 31783898
pmcid: 6883713
doi: 10.1186/s13059-019-1869-y
Zhang, L., Kasif, S., Cantor, C. R. & Broude, N. E. GC/AT-content spikes as genomic punctuation marks. Proc. Natl Acad. Sci. USA 101, 16855–16860 (2004).
pubmed: 15548610
pmcid: 534751
doi: 10.1073/pnas.0407821101
Jarvis, E. D. et al. Global view of the functional molecular organization of the avian cerebrum: mirror images and functional columns. J. Comp. Neurol. 521, 3614–3665 (2013).
pubmed: 23818122
pmcid: 4145244
doi: 10.1002/cne.23404
Kubikova, L., Wada, K. & Jarvis, E. D. Dopamine receptors in a songbird brain. J. Comp. Neurol. 518, 741–769 (2010).
pubmed: 20058221
pmcid: 2904815
doi: 10.1002/cne.22255
Sémon, M. & Wolfe, K. H. Rearrangement rate following the whole-genome duplication in teleosts. Mol. Biol. Evol. 24, 860–867 (2007).
pubmed: 17218642
doi: 10.1093/molbev/msm003
Jebb, D. et al. Six reference-quality genomes reveal evolution of bat adaptations. Nature 583, 578–584 (2020).
pubmed: 32699395
pmcid: 8075899
doi: 10.1038/s41586-020-2486-3
Schneider, V. A. et al. Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly. Genome Res. 27, 849–864 (2017).
pubmed: 28396521
pmcid: 5411779
doi: 10.1101/gr.213611.116
Warren, W. C. et al. A new chicken genome assembly provides insight into avian genome structure. G3 (Bethesda) 7, 109–117 (2017).
doi: 10.1534/g3.116.035923
Meredith, R. W. et al. Impacts of the Cretaceous Terrestrial Revolution and KPg extinction on mammal diversification. Science 334, 521–524 (2011).
pubmed: 21940861
doi: 10.1126/science.1211028
Rodriguez-Agudo, D. et al. StarD5: an ER stress protein regulates plasma membrane and intracellular cholesterol homeostasis. J. Lipid Res. 60, 1087–1098 (2019).
pubmed: 31015253
pmcid: 6547630
doi: 10.1194/jlr.M091967
Kim, J. et al. Reconstruction and evolutionary history of eutherian chromosomes. Proc. Natl Acad. Sci. USA 114, E5379–E5388 (2017).
pubmed: 28630326
pmcid: 5502614
doi: 10.1073/pnas.1702012114
Lin, B., Dutta, B. & Fraser, I. D. C. Systematic investigation of multi-TLR sensing identifies regulators of sustained gene activation in macrophages. Cell Syst. 5, 25–37.e3 (2017).
pubmed: 28750197
pmcid: 5584636
doi: 10.1016/j.cels.2017.06.014
Theofanopoulou, C., Gedman, G. L., Cahill, J. A., Boeckx, C. & Jarvis, E. D. Universal nomenclature for oxytocin-vasotocin ligand and receptor families. Nature https://doi.org/10.1038/s41586-020-03040-7 (2021).
Ocampo Daza, D. & Haitina, T. Reconstruction of the carbohydrate 6-O sulfotransferase gene family evolution in vertebrates reveals novel member, CHST16, lost in amniotes. Genome Biol. Evol. 12, 993–1012 (2020).
pubmed: 32652010
doi: 10.1093/gbe/evz274
Damas, J. et al. Broad host range of SARS-CoV-2 predicted by comparative and structural analysis of ACE2 in vertebrates. Proc. Natl Acad. Sci. USA 117, 22311–22322 (2020).
pubmed: 32826334
pmcid: 7486773
doi: 10.1073/pnas.2010146117
Dussex, N. et al. Population genomics reveals the impact of long-term small population size in the critically endangered kākāpō. Cell Genom. (in the press).
Teeling, E. C. et al. Bat biology, genomes, and the Bat1K project: to generate chromosome-level genomes for all living bat species. Annu. Rev. Anim. Biosci. 6, 23–46 (2018).
pubmed: 29166127
doi: 10.1146/annurev-animal-022516-022811
Lewin, H. A. et al. Earth BioGenome Project: sequencing life for the future of life. Proc. Natl Acad. Sci. USA 115, 4325–4333 (2018).
pubmed: 29686065
pmcid: 5924910
doi: 10.1073/pnas.1720115115
Jarvis, E. D. et al. Whole-genome analyses resolve early branches in the tree of life of modern birds. Science 346, 1320–1331 (2014).
pubmed: 25504713
pmcid: 4405904
doi: 10.1126/science.1253451
Li, S. et al. Genomic signatures of near-extinction and rebirth of the crested ibis and other endangered bird species. Genome Biol. 15, 557 (2014).
pubmed: 25496777
pmcid: 4290368
doi: 10.1186/s13059-014-0557-1
Koren, S. & Phillippy, A. M. One chromosome, one contig: complete microbial genomes from long-read sequencing and assembly. Curr. Opin. Microbiol. 23, 110–120 (2015).
pubmed: 25461581
doi: 10.1016/j.mib.2014.11.014
Jenjaroenpun, P. et al. Complete genomic and transcriptional landscape analysis using third-generation sequencing: a case study of Saccharomyces cerevisiae CEN.PK113-7D. Nucleic Acids Res. 46, e38 (2018).
pubmed: 29346625
pmcid: 5909453
doi: 10.1093/nar/gky014
Tyson, J. R. et al. MinION-based long-read sequencing and assembly extends the Caenorhabditis elegans reference genome. Genome Res. 28, 266–274 (2018).
pubmed: 29273626
pmcid: 5793790
doi: 10.1101/gr.221184.117
Miga, K. H. et al. Telomere-to-telomere assembly of a complete human X chromosome. Nature 585, 79–84 (2020).
pubmed: 32663838
pmcid: 7484160
doi: 10.1038/s41586-020-2547-7
Logsdon, G. A. et al. The structure, function and evolution of a complete human chromosome 8. Nature https://doi.org/10.1038/s41586-021-03420-7 (2021).
Beçak, M. L., Beçak, W., Roberts, F. L., Shoffner, R. N. & Volpe, P. (eds.) Chromosome Atlas: Fish, Amphibians, Reptiles, and Birds Vol. 2 (Springer, 1973).
Vurture, G. W. et al. GenomeScope: fast reference-free genome profiling from short reads. Bioinformatics 33, 2202–2204 (2017).
pubmed: 28369201
pmcid: 5870704
doi: 10.1093/bioinformatics/btx153
Kumar, S., Stecher, G., Suleski, M. & Hedges, S. B. TimeTree: a resource for timelines, timetrees, and divergence times. Mol. Biol. Evol. 34, 1812–1819 (2017).
pubmed: 28387841
doi: 10.1093/molbev/msx116
Ondov, B. D. et al. Mash: fast genome and metagenome distance estimation using MinHash. Genome Biol. 17, 132 (2016).
pubmed: 27323842
pmcid: 4915045
doi: 10.1186/s13059-016-0997-x
Ning, Z. & Harry, E. Scaff10X https://github.com/wtsi-hpag/Scaff10X .
Morgulis, A., Gertz, E. M., Schäffer, A. A. & Agarwala, R. WindowMasker: window-based masker for sequenced genomes. Bioinformatics 22, 134–141 (2006).
pubmed: 16287941
doi: 10.1093/bioinformatics/bti774
Chin, C.-S. et al. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat. Methods 10, 563–569 (2013).
pubmed: 23644548
doi: 10.1038/nmeth.2474
Koren, S. et al. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 27, 722–736 (2017).
pubmed: 28298431
pmcid: 5411767
doi: 10.1101/gr.215087.116
Weisenfeld, N. I., Kumar, V., Shah, P., Church, D. M. & Jaffe, D. B. Direct determination of diploid genome sequences. Genome Res. 27, 757–767 (2017).
pubmed: 28381613
pmcid: 5411770
doi: 10.1101/gr.214874.116
Ghurye, J. et al. Integrating Hi-C links with assembly graphs for chromosome-scale assembly. PLoS Comput. Biol. 15, e1007273 (2019).
pubmed: 31433799
pmcid: 6719893
doi: 10.1371/journal.pcbi.1007273
Lieberman-Aiden, E. et al. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 326, 289–293 (2009).
pubmed: 19815776
pmcid: 2858594
doi: 10.1126/science.1181369
Luo, R. et al. SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler. Gigascience 1, 18 (2012).
pubmed: 23587118
pmcid: 3626529
doi: 10.1186/2047-217X-1-18
English, A. C. et al. Mind the gap: upgrading genomes with Pacific Biosciences RS long-read sequencing technology. PLoS ONE 7, e47768 (2012).
pubmed: 23185243
pmcid: 3504050
doi: 10.1371/journal.pone.0047768
Bishara, A. et al. Read clouds uncover variation in complex regions of the human genome. Genome Res. 25, 1570–1580 (2015).
pubmed: 26286554
pmcid: 4579342
doi: 10.1101/gr.191189.115
Walker, B. J. et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS ONE 9, e112963 (2014).
pubmed: 25409509
pmcid: 4237348
doi: 10.1371/journal.pone.0112963
Garrison, E. & Marth, G. Haplotype-based variant detection from short-read sequencing. Preprint at http://arxiv.org/abs/1207.3907 (2012).
Jain, C., Koren, S., Dilthey, A., Phillippy, A. M. & Aluru, S. A fast adaptive algorithm for computing whole-genome homology maps. Bioinformatics 34, i748–i756 (2018).
pubmed: 30423094
pmcid: 6129286
doi: 10.1093/bioinformatics/bty597
Bionano Genomics, Inc. Bionano Software Downloads. https://bionanogenomics.com/support/software-downloads/ .
Arima Genomics, Inc. Arima Genomics Mapping Pipeline. https://github.com/ArimaGenomics/mapping_pipeline .
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
pubmed: 19451168
pmcid: 2705234
doi: 10.1093/bioinformatics/btp324
Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).
pubmed: 29750242
pmcid: 6137996
doi: 10.1093/bioinformatics/bty191
Chaisson, M. J. & Tesler, G. Mapping single molecule sequencing reads using basic local alignment with successive refinement (BLASR): application and theory. BMC Bioinformatics 13, 238 (2012).
pubmed: 22988817
pmcid: 3572422
doi: 10.1186/1471-2105-13-238
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
pubmed: 19505943
pmcid: 2723002
doi: 10.1093/bioinformatics/btp352
Dierckxsens, N., Mardulyn, P. & Smits, G. NOVOPlasty: de novo assembly of organelle genomes from whole genome data. Nucleic Acids Res. 45, e18 (2017).
pubmed: 28204566
Soorni, A., Haak, D., Zaitlin, D. & Bombarely, A. Organelle_PBA, a pipeline for assembling chloroplast and mitochondrial genomes from PacBio DNA sequencing data. BMC Genomics 18, 49 (2017).
pubmed: 28061749
pmcid: 5219736
doi: 10.1186/s12864-016-3412-9
Chow, W. et al. gEVAL — a web-based browser for evaluating genome assemblies. Bioinformatics 32, 2508–2510 (2016).
pubmed: 27153597
pmcid: 4978925
doi: 10.1093/bioinformatics/btw159
Durand, N. C. et al. Juicebox provides a visualization system for Hi-C contact maps with unlimited zoom. Cell Syst. 3, 99–101 (2016).
pubmed: 27467250
pmcid: 5596920
doi: 10.1016/j.cels.2015.07.012
Kerpedjiev, P. et al. HiGlass: web-based visual exploration and analysis of genome interaction maps. Genome Biol. 19, 125 (2018).
pubmed: 30143029
pmcid: 6109259
doi: 10.1186/s13059-018-1486-1
Camacho, C. et al. BLAST+: architecture and applications. BMC Bioinformatics 10, 421 (2009).
pubmed: 20003500
pmcid: 2803857
doi: 10.1186/1471-2105-10-421
Harris, R. S. Improved Pairwise Alignment of Genomic DNA. Thesis, Pennsylvania State Univ. (2007).
Kent, W. J., Baertsch, R., Hinrichs, A., Miller, W. & Haussler, D. Evolution’s cauldron: duplication, deletion, and rearrangement in the mouse and human genomes. Proc. Natl Acad. Sci. USA 100, 11484–11489 (2003).
pubmed: 14500911
pmcid: 208784
doi: 10.1073/pnas.1932072100
Kolmogorov, M., Raney, B., Paten, B. & Pham, S. Ragout—a reference-assisted assembly tool for bacterial genomes. Bioinformatics 30, i302–i309 (2014).
pubmed: 24931998
pmcid: 4058940
doi: 10.1093/bioinformatics/btu280
Farré, M. et al. Novel insights into chromosome evolution in birds, archosaurs, and reptiles. Genome Biol. Evol. 8, 2442–2451 (2016).
pubmed: 27401172
pmcid: 5010900
doi: 10.1093/gbe/evw166
Guan, D. Asset. https://github.com/dfguan/asset .
Tarailo-Graovac, M. & Chen, N. Using RepeatMasker to identify repetitive elements in genomic sequences. Curr. Protoc. Bioinformatics 25, 4.10.1–4.10.14 (2009).
doi: 10.1002/0471250953.bi0410s25
Krumsiek, J., Arnold, R. & Rattei, T. Gepard: a rapid and sensitive tool for creating dotplots on genome scale. Bioinformatics 23, 1026–1028 (2007).
pubmed: 17309896
doi: 10.1093/bioinformatics/btm039
Harry, E. PretextView. https://github.com/wtsi-hpag/PretextView .
Kurtz, S. et al. Versatile and open software for comparing large genomes. Genome Biol. 5, R12 (2004).
pubmed: 14759262
pmcid: 395750
doi: 10.1186/gb-2004-5-2-r12
Nattestad, M. Dot. https://github.com/MariaNattestad/dot .