Fully phased human genome assembly without parental data using single-cell strand sequencing and long reads.


Journal

Nature biotechnology
ISSN: 1546-1696
Titre abrégé: Nat Biotechnol
Pays: United States
ID NLM: 9604648

Informations de publication

Date de publication:
03 2021
Historique:
received: 22 11 2019
accepted: 16 09 2020
pubmed: 9 12 2020
medline: 15 4 2021
entrez: 8 12 2020
Statut: ppublish

Résumé

Human genomes are typically assembled as consensus sequences that lack information on parental haplotypes. Here we describe a reference-free workflow for diploid de novo genome assembly that combines the chromosome-wide phasing and scaffolding capabilities of single-cell strand sequencing

Identifiants

pubmed: 33288906
doi: 10.1038/s41587-020-0719-5
pii: 10.1038/s41587-020-0719-5
pmc: PMC7954704
mid: NIHMS1663376
doi:

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

302-308

Subventions

Organisme : NHGRI NIH HHS
ID : R01 HG002898
Pays : United States
Organisme : NHGRI NIH HHS
ID : R01 HG010169
Pays : United States
Organisme : NHGRI NIH HHS
ID : T32 HG008345
Pays : United States
Organisme : NHGRI NIH HHS
ID : U01 HG010971
Pays : United States
Organisme : NHGRI NIH HHS
ID : R01 HG010485
Pays : United States
Organisme : NHGRI NIH HHS
ID : U24 HG007497
Pays : United States
Organisme : NHGRI NIH HHS
ID : R01 HG002385
Pays : United States
Organisme : Howard Hughes Medical Institute
Pays : United States
Organisme : NCI NIH HHS
ID : P30 CA034196
Pays : United States

Références

Bioinformatics. 2018 Jul 1;34(13):i115-i123
pubmed: 29949971
Elife. 2017 Dec 12;6:
pubmed: 29231811
Genome Res. 2017 May;27(5):677-685
pubmed: 27895111
Genome Res. 2017 May;27(5):757-767
pubmed: 28381613
Nat Biotechnol. 2014 Mar;32(3):246-51
pubmed: 24531798
Nat Methods. 2012 Nov;9(11):1107-12
pubmed: 23042453
Nat Genet. 2020 Aug;52(8):849-858
pubmed: 32541924
Nature. 2015 Jan 29;517(7536):608-11
pubmed: 25383537
Bioinformatics. 2020 Feb 15;36(4):1260-1261
pubmed: 31504176
Bioinformatics. 2009 Aug 15;25(16):2078-9
pubmed: 19505943
Bioinformatics. 2010 Mar 15;26(6):841-2
pubmed: 20110278
Bioinformatics. 2012 Jul 15;28(14):1919-20
pubmed: 22576172
Nat Biotechnol. 2020 Mar;38(3):343-354
pubmed: 31873213
Science. 2017 Apr 7;356(6333):92-95
pubmed: 28336562
Cell Syst. 2016 Jul;3(1):95-8
pubmed: 27467249
Cell. 2019 Jan 24;176(3):663-675.e19
pubmed: 30661756
Brief Bioinform. 2018 Jan 1;19(1):118-135
pubmed: 27769991
Genome Biol. 2019 Jun 3;20(1):116
pubmed: 31159868
Nat Commun. 2019 Oct 11;10(1):4660
pubmed: 31604920
Nat Biotechnol. 2018 Nov;36(10):983-987
pubmed: 30247488
Bioinformatics. 2012 Oct 1;28(19):2520-2
pubmed: 22908215
Nat Commun. 2017 Nov 3;8(1):1293
pubmed: 29101320
Science. 2018 Jun 8;360(6393):
pubmed: 29880660
Genome Res. 2017 May;27(5):737-746
pubmed: 28100585
Nat Protoc. 2017 Jun;12(6):1151-1176
pubmed: 28492527
Nucleic Acids Res. 1999 Jan 15;27(2):573-80
pubmed: 9862982
Nature. 2020 Sep;585(7823):79-84
pubmed: 32663838
J Comput Biol. 2015 Jun;22(6):498-509
pubmed: 25658651
Genome Res. 2016 Nov;26(11):1565-1574
pubmed: 27646535
PLoS Biol. 2007 Sep 4;5(10):e254
pubmed: 17803354
Nat Methods. 2019 Jan;16(1):88-94
pubmed: 30559433
Ann Hum Genet. 2020 Mar;84(2):125-140
pubmed: 31711268
Nucleic Acids Res. 2012 May;40(9):e69
pubmed: 22302147
Nat Biotechnol. 2019 May;37(5):540-546
pubmed: 30936562
Genome Res. 2016 Nov;26(11):1575-1587
pubmed: 27472961
Nat Biotechnol. 2019 Oct;37(10):1155-1162
pubmed: 31406327
Int J Mol Sci. 2021 Mar 31;22(7):
pubmed: 33807210
Genome Res. 2017 May;27(5):722-736
pubmed: 28298431
Nat Biotechnol. 2018 Oct 22;:
pubmed: 30346939
Genome Res. 2017 May;27(5):849-864
pubmed: 28396521
Nat Biotechnol. 2020 Sep;38(9):1044-1053
pubmed: 32686750
Nat Biotechnol. 2021 Mar;39(3):309-312
pubmed: 33288905
Bioinformatics. 2017 Sep 01;33(17):2737-2739
pubmed: 28475666
Nat Methods. 2020 Feb;17(2):155-158
pubmed: 31819265
Nature. 2015 Oct 1;526(7571):68-74
pubmed: 26432245
Genome Med. 2013 Sep 13;5(9):82
pubmed: 24028793
Nucleic Acids Res. 2016 Aug 19;44(14):6787-93
pubmed: 27185886
Nat Commun. 2019 Apr 16;10(1):1784
pubmed: 30992455
Nat Methods. 2016 Dec;13(12):1050-1054
pubmed: 27749838
Nat Rev Genet. 2015 Nov;16(11):627-40
pubmed: 26442640
Bioinformatics. 2018 Sep 15;34(18):3094-3100
pubmed: 29750242

Auteurs

David Porubsky (D)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.

Peter Ebert (P)

Heinrich Heine University Düsseldorf, Medical Faculty, Institute for Medical Biometry and Bioinformatics, Düsseldorf, Germany.

Peter A Audano (PA)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.

Mitchell R Vollger (MR)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.

William T Harvey (WT)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.

Pierre Marijon (P)

Heinrich Heine University Düsseldorf, Medical Faculty, Institute for Medical Biometry and Bioinformatics, Düsseldorf, Germany.

Jana Ebler (J)

Heinrich Heine University Düsseldorf, Medical Faculty, Institute for Medical Biometry and Bioinformatics, Düsseldorf, Germany.

Katherine M Munson (KM)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.

Melanie Sorensen (M)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.

Arvis Sulovari (A)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.

Marina Haukness (M)

UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA.

Maryam Ghareghani (M)

Heinrich Heine University Düsseldorf, Medical Faculty, Institute for Medical Biometry and Bioinformatics, Düsseldorf, Germany.
Center for Bioinformatics, Saarland University, and Max Planck Institute for Informatics, Saarbrücken, Germany.

Peter M Lansdorp (PM)

Terry Fox Laboratory, BC Cancer Agency, Vancouver, British Columbia, Canada.
Department of Medical Genetics, University of British Columbia, Vancouver, British Columbia, Canada.

Benedict Paten (B)

UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA.

Scott E Devine (SE)

Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA.

Ashley D Sanders (AD)

European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany.

Charles Lee (C)

The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA.
The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China.
Department of Life Science, Ewha Womans University, Seoul, Republic of Korea.

Mark J P Chaisson (MJP)

Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA.

Jan O Korbel (JO)

European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany.

Evan E Eichler (EE)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA. eee@gs.washington.edu.
Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA. eee@gs.washington.edu.

Tobias Marschall (T)

Heinrich Heine University Düsseldorf, Medical Faculty, Institute for Medical Biometry and Bioinformatics, Düsseldorf, Germany. tobias.marschall@hhu.de.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C

Classifications MeSH