Resolving the full spectrum of human genome variation using Linked-Reads.
Journal
Genome research
ISSN: 1549-5469
Titre abrégé: Genome Res
Pays: United States
ID NLM: 9518021
Informations de publication
Date de publication:
04 2019
04 2019
Historique:
received:
09
01
2018
accepted:
21
02
2019
pubmed:
22
3
2019
medline:
14
6
2019
entrez:
22
3
2019
Statut:
ppublish
Résumé
Large-scale population analyses coupled with advances in technology have demonstrated that the human genome is more diverse than originally thought. To date, this diversity has largely been uncovered using short-read whole-genome sequencing. However, these short-read approaches fail to give a complete picture of a genome. They struggle to identify structural events, cannot access repetitive regions, and fail to resolve the human genome into haplotypes. Here, we describe an approach that retains long range information while maintaining the advantages of short reads. Starting from ∼1 ng of high molecular weight DNA, we produce barcoded short-read libraries. Novel informatic approaches allow for the barcoded short reads to be associated with their original long molecules producing a novel data type known as "Linked-Reads". This approach allows for simultaneous detection of small and large variants from a single library. In this manuscript, we show the advantages of Linked-Reads over standard short-read approaches for reference-based analysis. Linked-Reads allow mapping to 38 Mb of sequence not accessible to short reads, adding sequence in 423 difficult-to-sequence genes including disease-relevant genes
Identifiants
pubmed: 30894395
pii: gr.234443.118
doi: 10.1101/gr.234443.118
pmc: PMC6442396
doi:
Substances chimiques
Intercellular Signaling Peptides and Proteins
0
Membrane Proteins
0
SMN1 protein, human
0
SMN2 protein, human
0
STRC protein, human
0
Survival of Motor Neuron 1 Protein
0
Survival of Motor Neuron 2 Protein
0
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
635-645Subventions
Organisme : NCATS NIH HHS
ID : UL1 TR001863
Pays : United States
Informations de copyright
© 2019 Marks et al.; Published by Cold Spring Harbor Laboratory Press.
Références
BMC Genomics. 2012 Aug 05;13:375
pubmed: 22863213
Genome Res. 2017 May;27(5):677-685
pubmed: 27895111
Hum Cell. 2017 Jul;30(3):149-161
pubmed: 28364362
Genome Res. 2017 May;27(5):849-864
pubmed: 28396521
Bioinformatics. 2010 Mar 15;26(6):841-2
pubmed: 20110278
Nat Methods. 2018 Aug;15(8):595-597
pubmed: 30013044
Genome Res. 2017 May;27(5):757-767
pubmed: 28381613
Nature. 2015 Oct 1;526(7571):75-81
pubmed: 26432246
Genome Biol. 2020 Mar 19;21(1):72
pubmed: 32192518
Nat Methods. 2017 Sep;14(9):915-920
pubmed: 28714986
PLoS Genet. 2013;9(8):e1003709
pubmed: 23990802
Nucleic Acids Res. 2018 Feb 28;46(4):e19
pubmed: 29186506
Nat Biotechnol. 2014 Mar;32(3):246-51
pubmed: 24531798
Bioinformatics. 2018 Jan 15;34(2):353-360
pubmed: 29112732
Genet Med. 2016 Dec;18(12):1282-1289
pubmed: 27228465
Genome Biol. 2017 Mar 6;18(1):36
pubmed: 28260531
Genome Res. 2017 Nov;27(11):1950-1960
pubmed: 29021291
Genome Res. 2017 Jan;27(1):157-164
pubmed: 27903644
Nature. 2016 Aug 17;536(7616):285-91
pubmed: 27535533
Genome Biol. 2014 Jun 26;15(6):R84
pubmed: 24970577
PLoS One. 2012;7(1):e30377
pubmed: 22276185
Nature. 2015 Jan 29;517(7536):608-11
pubmed: 25383537
Nat Biotechnol. 2016 Mar;34(3):303-11
pubmed: 26829319
Genome Res. 2015 Oct;25(10):1570-80
pubmed: 26286554
Genome Med. 2017 Jun 19;9(1):57
pubmed: 28629429
Nat Commun. 2017 Jan 24;8:14061
pubmed: 28117401
BMC Genomics. 2016 Jan 16;17:64
pubmed: 26772178
PLoS Genet. 2018 Apr 5;14(4):e1007308
pubmed: 29621242
Nature. 2015 Oct 1;526(7571):68-74
pubmed: 26432245
Nat Genet. 2014 Dec;46(12):1343-9
pubmed: 25326703
Sci Data. 2016 Jun 07;3:160025
pubmed: 27271295
Nat Commun. 2019 Apr 16;10(1):1784
pubmed: 30992455
Nat Rev Genet. 2011 May;12(5):363-76
pubmed: 21358748
Genetics. 2016 Apr;202(4):1251-4
pubmed: 27053122
Sci Rep. 2020 Feb 13;10(1):2512
pubmed: 32054878
Cell. 2018 Jul 12;174(2):433-447.e19
pubmed: 29909985