Accuracy of haplotype estimation and whole genome imputation affects complex trait analyses in complex biobanks.


Journal

Communications biology
ISSN: 2399-3642
Titre abrégé: Commun Biol
Pays: England
ID NLM: 101719179

Informations de publication

Date de publication:
26 01 2023
Historique:
received: 26 07 2022
accepted: 12 01 2023
entrez: 25 1 2023
pubmed: 26 1 2023
medline: 28 1 2023
Statut: epublish

Résumé

Sample recruitment for research consortia, biobanks, and personal genomics companies span years, necessitating genotyping in batches, using different technologies. As marker content on genotyping arrays varies, integrating such datasets is non-trivial and its impact on haplotype estimation (phasing) and whole genome imputation, necessary steps for complex trait analysis, remains under-evaluated. Using the iPSYCH dataset, comprising 130,438 individuals, genotyped in two stages, on different arrays, we evaluated phasing and imputation performance across multiple phasing methods and data integration protocols. While phasing accuracy varied by choice of method and data integration protocol, imputation accuracy varied mostly between data integration protocols. We demonstrate an attenuation in imputation accuracy within samples of non-European origin, highlighting challenges to studying complex traits in diverse populations. Finally, imputation errors can bias association tests, reduce predictive utility of polygenic scores. Carefully optimized data integration strategies enhance accuracy and replicability of complex trait analyses in complex biobanks.

Identifiants

pubmed: 36697501
doi: 10.1038/s42003-023-04477-y
pii: 10.1038/s42003-023-04477-y
pmc: PMC9876938
doi:

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

101

Subventions

Organisme : U.S. Department of Health & Human Services | NIH | National Institute of Mental Health (NIMH)
ID : R01MH130581
Organisme : U.S. Department of Health & Human Services | NIH | National Institute on Aging (U.S. National Institute on Aging)
ID : UH2 AG064706, U19 AG023122

Informations de copyright

© 2023. The Author(s).

Références

Am J Hum Genet. 2012 Jan 13;90(1):7-24
pubmed: 22243964
Am J Hum Genet. 2017 Jul 6;101(1):5-22
pubmed: 28686856
Nat Genet. 2016 Oct;48(10):1279-83
pubmed: 27548312
Bioinformatics. 2011 Nov 1;27(21):2987-93
pubmed: 21903627
Bioinformatics. 2010 Nov 15;26(22):2867-73
pubmed: 20926424
PLoS Genet. 2019 Dec 23;15(12):e1008500
pubmed: 31869403
Nat Commun. 2019 Nov 28;10(1):5436
pubmed: 31780650
Nat Rev Genet. 2011 Sep 16;12(10):703-14
pubmed: 21921926
Sci Data. 2019 Oct 31;6(1):257
pubmed: 31672996
J Inherit Metab Dis. 2007 Aug;30(4):530-6
pubmed: 17632694
Annu Rev Genomics Hum Genet. 2013;14:441-65
pubmed: 23724904
Hum Genet. 2012 Jan;131(1):111-9
pubmed: 21735171
Genetics. 2015 Aug;200(4):1285-95
pubmed: 26092716
Gigascience. 2019 Jul 1;8(7):
pubmed: 31307061
Biol Psychiatry. 2021 Nov 1;90(9):611-620
pubmed: 34304866
Annu Rev Genomics Hum Genet. 2018 Aug 31;19:73-96
pubmed: 29799802
Pharmacogenomics. 2009 Feb;10(2):191-201
pubmed: 19207020
Eur J Hum Genet. 2012 May;20(5):572-6
pubmed: 22189269
Hum Mol Genet. 2018 Oct 15;27(20):3641-3649
pubmed: 30124842
Am J Hum Genet. 2018 Sep 6;103(3):338-348
pubmed: 30100085
Nat Genet. 2016 Oct;48(10):1284-1287
pubmed: 27571263
Gigascience. 2015 Feb 25;4:7
pubmed: 25722852
Genet Epidemiol. 2010 Sep;34(6):537-42
pubmed: 20717975
Gigascience. 2021 Feb 16;10(2):
pubmed: 33590861
Am J Hum Genet. 2017 Apr 6;100(4):635-649
pubmed: 28366442
Dan Med Bull. 1997 Feb;44(1):82-4
pubmed: 9062767
Nat Genet. 2019 Apr;51(4):584-591
pubmed: 30926966
Annu Rev Genomics Hum Genet. 2009;10:387-406
pubmed: 19715440
Am J Hum Genet. 2011 Jan 7;88(1):76-82
pubmed: 21167468
PLoS Genet. 2020 Nov 16;16(11):e1009049
pubmed: 33196638
Genetics. 2003 Dec;165(4):2213-33
pubmed: 14704198
BMC Res Notes. 2014 Dec 11;7:901
pubmed: 25495213
Eur J Epidemiol. 2014 Aug;29(8):541-9
pubmed: 24965263
Nat Genet. 2016 Jul;48(7):811-6
pubmed: 27270109
Nature. 2018 Oct;562(7726):203-209
pubmed: 30305743
Nat Genet. 2018 Aug;50(8):1112-1121
pubmed: 30038396
Nat Rev Genet. 2010 Jul;11(7):499-511
pubmed: 20517342
J Dairy Sci. 2015 Jun;98(6):4131-8
pubmed: 25841966
PLoS Genet. 2018 Apr 5;14(4):e1007308
pubmed: 29621242
Scand J Public Health. 2011 Jul;39(7 Suppl):54-7
pubmed: 21775352
Nature. 2015 Oct 1;526(7571):68-74
pubmed: 26432245
Genome Med. 2020 Nov 23;12(1):100
pubmed: 33225976
Scand J Public Health. 2011 Jul;39(7 Suppl):22-5
pubmed: 21775345
Am J Hum Genet. 2007 Sep;81(3):559-75
pubmed: 17701901
Am J Hum Genet. 2008 Jul;83(1):132-5; author reply 135-9
pubmed: 18606306
Nat Genet. 2018 Sep;50(9):1219-1224
pubmed: 30104762
Genome Biol. 2016 Mar 23;17:53
pubmed: 27009100
Nat Genet. 2016 Nov;48(11):1443-1448
pubmed: 27694958
Sci Data. 2016 Jun 07;3:160025
pubmed: 27271295
Mol Psychiatry. 2018 Jan;23(1):6-14
pubmed: 28924187
Nat Genet. 2006 Aug;38(8):904-9
pubmed: 16862161
Hum Genet. 2013 May;132(5):509-22
pubmed: 23334152
Cell. 2020 Feb 6;180(3):568-584.e23
pubmed: 31981491

Auteurs

Vivek Appadurai (V)

Institute of Biological Psychiatry, Mental Health Center Sankt Hans, Roskilde, 4000, Denmark. vivek.appadurai@regionh.dk.
The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark. vivek.appadurai@regionh.dk.

Jonas Bybjerg-Grauholm (J)

The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark.
Danish Center for Neonatal Screening, Statens Serum Institut, Copenhagen, Denmark.

Morten Dybdahl Krebs (MD)

Institute of Biological Psychiatry, Mental Health Center Sankt Hans, Roskilde, 4000, Denmark.
The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark.

Anders Rosengren (A)

Institute of Biological Psychiatry, Mental Health Center Sankt Hans, Roskilde, 4000, Denmark.
The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark.

Alfonso Buil (A)

Institute of Biological Psychiatry, Mental Health Center Sankt Hans, Roskilde, 4000, Denmark.
The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark.

Andrés Ingason (A)

Institute of Biological Psychiatry, Mental Health Center Sankt Hans, Roskilde, 4000, Denmark.
The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark.

Ole Mors (O)

The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark.
Psychosis Research Unit, Aarhus University Hospital - Psychiatry, Aarhus, Denmark.

Anders D Børglum (AD)

The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark.
Department of Biomedicine and Center for Integrative Sequencing, iSEQ, Aarhus University, Aarhus, Denmark.
Center for Genomics and Personalized Medicine, CGPM, Aarhus University, Aarhus, Denmark.

David M Hougaard (DM)

The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark.
Danish Center for Neonatal Screening, Statens Serum Institut, Copenhagen, Denmark.

Merete Nordentoft (M)

The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark.
Mental Health Services in the Capital Region of Denmark, Copenhagen, Denmark.
Department of Clinical Medicine, Faculty of Health Sciences, University of Copenhagen, Copenhagen, Denmark.

Preben B Mortensen (PB)

The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark.
NCRR - National Center for Register-Based Research, Business and Social Sciences, Aarhus University, Aarhus, Denmark.
CIRRAU - Centre for Integrated Register-Based Research, Aarhus University, Aarhus, Denmark.

Olivier Delaneau (O)

Department of Computational Biology, University of Lausanne, Lausanne, Switzerland.

Thomas Werge (T)

Institute of Biological Psychiatry, Mental Health Center Sankt Hans, Roskilde, 4000, Denmark.
The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark.

Andrew J Schork (AJ)

Institute of Biological Psychiatry, Mental Health Center Sankt Hans, Roskilde, 4000, Denmark. andrew.joseph.schork@regionh.dk.
The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark. andrew.joseph.schork@regionh.dk.
The Translational Genomics Research Institute, Phoenix, AZ, USA. andrew.joseph.schork@regionh.dk.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH