Low-coverage sequencing cost-effectively detects known and novel variation in underrepresented populations.
Africa
GWAS
GWAS arrays
cost comparison
low-coverage sequencing
study design
whole-genome sequencing
Journal
American journal of human genetics
ISSN: 1537-6605
Titre abrégé: Am J Hum Genet
Pays: United States
ID NLM: 0370475
Informations de publication
Date de publication:
01 04 2021
01 04 2021
Historique:
received:
19
11
2020
accepted:
05
03
2021
pubmed:
27
3
2021
medline:
8
5
2021
entrez:
26
3
2021
Statut:
ppublish
Résumé
Genetic studies in underrepresented populations identify disproportionate numbers of novel associations. However, most genetic studies use genotyping arrays and sequenced reference panels that best capture variation most common in European ancestry populations. To compare data generation strategies best suited for underrepresented populations, we sequenced the whole genomes of 91 individuals to high coverage as part of the Neuropsychiatric Genetics of African Population-Psychosis (NeuroGAP-Psychosis) study with participants from Ethiopia, Kenya, South Africa, and Uganda. We used a downsampling approach to evaluate the quality of two cost-effective data generation strategies, GWAS arrays versus low-coverage sequencing, by calculating the concordance of imputed variants from these technologies with those from deep whole-genome sequencing data. We show that low-coverage sequencing at a depth of ≥4× captures variants of all frequencies more accurately than all commonly used GWAS arrays investigated and at a comparable cost. Lower depths of sequencing (0.5-1×) performed comparably to commonly used low-density GWAS arrays. Low-coverage sequencing is also sensitive to novel variation; 4× sequencing detects 45% of singletons and 95% of common variants identified in high-coverage African whole genomes. Low-coverage sequencing approaches surmount the problems induced by the ascertainment of common genotyping arrays, effectively identify novel variation particularly in underrepresented populations, and present opportunities to enhance variant discovery at a cost similar to traditional approaches.
Identifiants
pubmed: 33770507
pii: S0002-9297(21)00096-3
doi: 10.1016/j.ajhg.2021.03.012
pmc: PMC8059370
pii:
doi:
Types de publication
Journal Article
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
656-668Subventions
Organisme : NIMH NIH HHS
ID : R01 MH120642
Pays : United States
Organisme : NIMH NIH HHS
ID : K01 MH121659
Pays : United States
Organisme : NIMH NIH HHS
ID : T32 MH017119
Pays : United States
Organisme : NIMH NIH HHS
ID : U01 MH125045
Pays : United States
Organisme : Medical Research Council
ID : MR/M025470/1
Pays : United Kingdom
Organisme : NIMH NIH HHS
ID : R00 MH117229
Pays : United States
Informations de copyright
Copyright © 2021 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Références
PLoS One. 2017 Nov 29;12(11):e0188466
pubmed: 29186155
Nature. 2020 May;581(7809):434-443
pubmed: 32461654
Genome Biol. 2020 Jul 27;21(1):184
pubmed: 32718320
Pharmgenomics Pers Med. 2018 Apr 10;11:59-66
pubmed: 29692621
BMC Med Genomics. 2020 Jan 29;13(1):11
pubmed: 31996208
G3 (Bethesda). 2018 Oct 3;8(10):3255-3267
pubmed: 30131328
BMJ Open. 2019 Feb 19;9(2):e025469
pubmed: 30782936
Nat Genet. 2021 Jan;53(1):120-126
pubmed: 33414550
Nature. 2015 Jul 30;523(7562):588-91
pubmed: 26176920
PLoS Genet. 2019 Dec 23;15(12):e1008500
pubmed: 31869403
Am J Hum Genet. 2009 Feb;84(2):235-50
pubmed: 19215730
Nat Genet. 2017 Feb;49(2):186-192
pubmed: 28067910
PLoS Genet. 2014 Jun 12;10(6):e1004393
pubmed: 24921250
Arch Gen Psychiatry. 2007 Aug;64(8):966-74
pubmed: 17679641
Nat Genet. 2016 Oct;48(10):1279-83
pubmed: 27548312
Am J Hum Genet. 2007 Nov;81(5):1084-97
pubmed: 17924348
PLoS Genet. 2012 Jan;8(1):e1002397
pubmed: 22253600
Genome Med. 2019 Nov 26;11(1):74
pubmed: 31771638
Am J Hum Genet. 2018 Sep 6;103(3):338-348
pubmed: 30100085
Am J Hum Genet. 2015 Jun 4;96(6):986-91
pubmed: 26027499
Nat Genet. 2016 Oct;48(10):1284-1287
pubmed: 27571263
Bioinformatics. 2019 Aug 1;35(15):2555-2561
pubmed: 30576415
Nature. 2014 Jul 24;511(7510):421-7
pubmed: 25056061
BMC Genomics. 2021 Mar 20;22(1):197
pubmed: 33743587
Mol Ecol. 2013 Jun;22(11):3028-35
pubmed: 23174005
Genomics. 2011 Dec;98(6):422-30
pubmed: 21903159
Nat Rev Genet. 2010 Jul;11(7):499-511
pubmed: 20517342
Nature. 2020 Oct;586(7831):741-748
pubmed: 33116287
Nature. 2015 Oct 1;526(7571):68-74
pubmed: 26432245
Curr Opin Genet Dev. 2018 Dec;53:113-120
pubmed: 30240950
Nat Genet. 2019 Dec;51(12):1670-1678
pubmed: 31740837
Nat Genet. 2012 May 20;44(6):631-5
pubmed: 22610117
Nature. 2021 Feb;590(7845):290-299
pubmed: 33568819
Bioessays. 2013 Sep;35(9):780-6
pubmed: 23836388