Gamete binning: chromosome-level and haplotype-resolved genome assembly enabled by high-throughput single-cell sequencing of gamete genomes.


Journal

Genome biology
ISSN: 1474-760X
Titre abrégé: Genome Biol
Pays: England
ID NLM: 100960660

Informations de publication

Date de publication:
29 12 2020
Historique:
received: 15 05 2020
accepted: 11 12 2020
entrez: 29 12 2020
pubmed: 30 12 2020
medline: 1 12 2021
Statut: epublish

Résumé

Generating chromosome-level, haplotype-resolved assemblies of heterozygous genomes remains challenging. To address this, we developed gamete binning, a method based on single-cell sequencing of haploid gametes enabling separation of the whole-genome sequencing reads into haplotype-specific reads sets. After assembling the reads of each haplotype, the contigs are scaffolded to chromosome level using a genetic map derived from the gametes. We assemble the two genomes of a diploid apricot tree based on whole-genome sequencing of 445 individual pollen grains. The two haplotype assemblies (N50: 25.5 and 25.8 Mb) feature a haplotyping precision of greater than 99% and are accurately scaffolded to chromosome-level.

Identifiants

pubmed: 33372615
doi: 10.1186/s13059-020-02235-5
pii: 10.1186/s13059-020-02235-5
pmc: PMC7771071
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

306

Références

Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W435-9
pubmed: 16845043
Nat Genet. 2018 Nov;50(11):1565-1573
pubmed: 30297971
Comput Struct Biotechnol J. 2019 Dec 09;18:66-72
pubmed: 31908732
Mol Biol Evol. 2020 Dec 16;37(12):3684-3698
pubmed: 32668004
Science. 2014 Jul 18;345(6194):1251788
pubmed: 25035500
Nat Biotechnol. 2015 Mar;33(3):290-5
pubmed: 25690850
Genome Res. 2017 May;27(5):722-736
pubmed: 28298431
Bioinformatics. 2018 Feb 15;34(4):550-557
pubmed: 29444236
PLoS Comput Biol. 2019 Aug 21;15(8):e1007273
pubmed: 31433799
PLoS Comput Biol. 2018 Jan 26;14(1):e1005944
pubmed: 29373581
Bioinformatics. 2018 Sep 15;34(18):3094-3100
pubmed: 29750242
BMC Res Notes. 2018 Aug 13;11(1):584
pubmed: 30103816
Nucleic Acids Res. 2019 Jan 8;47(D1):D807-D811
pubmed: 30395283
Bioinformatics. 2020 Jun 1;36(12):3669-3679
pubmed: 32167530
Nat Plants. 2019 Aug;5(8):833-845
pubmed: 31383970
Nat Biotechnol. 2020 Dec 7;:
pubmed: 33288905
Genome Biol. 2009;10(3):R25
pubmed: 19261174
Genome Res. 2013 May;23(5):826-32
pubmed: 23282328
Nat Genet. 2019 Mar;51(3):541-547
pubmed: 30804557
Gigascience. 2019 Dec 1;8(12):
pubmed: 31816089
Nat Commun. 2020 Feb 20;11(1):989
pubmed: 32080174
BMC Bioinformatics. 2005 Feb 15;6:31
pubmed: 15713233
Hortic Res. 2019 Nov 18;6:128
pubmed: 31754435
Genome Biol. 2019 Dec 16;20(1):277
pubmed: 31842948
Genome Biol. 2020 Feb 7;21(1):30
pubmed: 32033565
Genome Res. 2019 Nov;29(11):1889-1899
pubmed: 31649061
Genome Biol. 2020 Sep 14;21(1):245
pubmed: 32928274
Bioinformatics. 2008 Dec 15;24(24):2938-9
pubmed: 18974171
Ann Bot. 2011 Sep;108(4):617-25
pubmed: 21474504
Bioinformatics. 2015 Oct 1;31(19):3210-2
pubmed: 26059717
Semin Cell Dev Biol. 2013 Aug-Sep;24(8-9):643-52
pubmed: 23665005
Nat Biotechnol. 2019 May;37(5):540-546
pubmed: 30936562
Bioinformatics. 2017 Sep 1;33(17):2759-2761
pubmed: 28472236
BMC Bioinformatics. 2018 Nov 29;19(1):460
pubmed: 30497373
Bioinformatics. 2017 Feb 15;33(4):574-576
pubmed: 27797770
Nat Biotechnol. 2018 Oct 22;:
pubmed: 30346939
Biotechnol Adv. 2014 Jan-Feb;32(1):122-36
pubmed: 24406816
Gigascience. 2017 Oct 1;6(10):1-16
pubmed: 29020750
Bioinformatics. 2011 Nov 1;27(21):2987-93
pubmed: 21903627
Proc Natl Acad Sci U S A. 2011 Jan 4;108(1):12-7
pubmed: 21169219
Nat Commun. 2020 May 19;11(1):2494
pubmed: 32427850
Genome Biol. 2019 Nov 14;20(1):238
pubmed: 31727128
Nat Biotechnol. 2013 Dec;31(12):1111-8
pubmed: 24185094
Nat Commun. 2019 Sep 20;10(1):4310
pubmed: 31541084
Nat Genet. 2013 May;45(5):487-94
pubmed: 23525075
Nat Methods. 2015 Apr;12(4):357-60
pubmed: 25751142
Nat Plants. 2017 Sep;3(9):742-748
pubmed: 28848243
Nat Commun. 2019 Apr 16;10(1):1784
pubmed: 30992455
Ann Bot. 2012 Oct;110(5):1067-78
pubmed: 22875815
Ann Bot. 2007 Oct;100(4):875-88
pubmed: 17684025
PLoS One. 2014 Nov 19;9(11):e112963
pubmed: 25409509
Bioinformatics. 2011 Mar 15;27(6):764-70
pubmed: 21217122
Genome Res. 2013 Feb;23(2):396-408
pubmed: 23149293
G3 (Bethesda). 2015 Jan 13;5(3):385-98
pubmed: 25585881
Genome Biol. 2008 Jan 11;9(1):R7
pubmed: 18190707
Nat Commun. 2019 Sep 20;10(1):4309
pubmed: 31541091
Bioinformatics. 2009 Aug 15;25(16):2078-9
pubmed: 19505943
Genome Biol. 2019 Oct 28;20(1):224
pubmed: 31661016
BMC Genomics. 2019 Apr 8;20(1):275
pubmed: 30961563
Nat Genet. 2017 Apr;49(4):643-650
pubmed: 28263316
Bioinformatics. 2010 Mar 15;26(6):841-2
pubmed: 20110278
BMC Bioinformatics. 2008 Jun 13;9:278
pubmed: 18554390
Bioinformatics. 2004 Nov 1;20(16):2878-9
pubmed: 15145805
J Mol Biol. 1990 Oct 5;215(3):403-10
pubmed: 2231712

Auteurs

José A Campoy (JA)

Department of Chromosome Biology, Max Planck Institute for Plant Breeding Research, Carl-von-Linné-Weg 10, 50829, Cologne, Germany.

Hequan Sun (H)

Department of Chromosome Biology, Max Planck Institute for Plant Breeding Research, Carl-von-Linné-Weg 10, 50829, Cologne, Germany.
Faculty of Biology, LMU Munich, Großhaderner Str. 2, 82152, Planegg-Martinsried, Germany.

Manish Goel (M)

Department of Chromosome Biology, Max Planck Institute for Plant Breeding Research, Carl-von-Linné-Weg 10, 50829, Cologne, Germany.

Wen-Biao Jiao (WB)

Department of Chromosome Biology, Max Planck Institute for Plant Breeding Research, Carl-von-Linné-Weg 10, 50829, Cologne, Germany.

Kat Folz-Donahue (K)

FACS & Imaging Core Facility, Max Planck Institute for Biology of Ageing, 50931, Cologne, Germany.

Nan Wang (N)

Center for Plant Molecular Biology (ZMBP), University of Tübingen, Auf der Morgenstelle 32, 72076, Tübingen, Germany.

Manuel Rubio (M)

Departament of Plant Breeding, CEBAS-CSIC, PO Box 164, E-30100 Espinardo, Murcia, Spain.

Chang Liu (C)

Center for Plant Molecular Biology (ZMBP), University of Tübingen, Auf der Morgenstelle 32, 72076, Tübingen, Germany.
Institute of Biology, University of Hohenheim, Garbenstraße 30, 70599, Stuttgart, Germany.

Christian Kukat (C)

FACS & Imaging Core Facility, Max Planck Institute for Biology of Ageing, 50931, Cologne, Germany.

David Ruiz (D)

Departament of Plant Breeding, CEBAS-CSIC, PO Box 164, E-30100 Espinardo, Murcia, Spain.

Bruno Huettel (B)

Max Planck-Genome-center Cologne, Carl-von-Linné-Weg 10, 50829, Cologne, Germany.

Korbinian Schneeberger (K)

Department of Chromosome Biology, Max Planck Institute for Plant Breeding Research, Carl-von-Linné-Weg 10, 50829, Cologne, Germany. schneeberger@mpipz.mpg.de.
Faculty of Biology, LMU Munich, Großhaderner Str. 2, 82152, Planegg-Martinsried, Germany. schneeberger@mpipz.mpg.de.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing
Humans Macular Degeneration Mendelian Randomization Analysis Life Style Genome-Wide Association Study
Coal Metagenome Phylogeny Bacteria Genome, Bacterial

Classifications MeSH