Challenges of Adjusting Single-Nucleotide Polymorphism Effect Sizes for Linkage Disequilibrium.

Linkage disequilibrium Polygenic risk score Tikhonov regularisation

Journal

Human heredity
ISSN: 1423-0062
Titre abrégé: Hum Hered
Pays: Switzerland
ID NLM: 0200525

Informations de publication

Date de publication:
12 Feb 2021
Historique:
received: 14 08 2020
accepted: 19 11 2020
entrez: 14 2 2021
pubmed: 15 2 2021
medline: 15 2 2021
Statut: aheadofprint

Résumé

Genome-wide association studies (GWAS) were successful in identifying SNPs showing association with disease, but their individual effect sizes are small and require large sample sizes to achieve statistical significance. Methods of post-GWAS analysis, including gene-based, gene-set and polygenic risk scores, combine the SNP effect sizes in an attempt to boost the power of the analyses. To avoid giving undue weight to SNPs in linkage disequilibrium (LD), the LD needs to be taken into account in these analyses. We review methods that attempt to adjust the effect sizes (β-coefficients) of summary statistics, instead of simple LD pruning. We subject LD adjustment approaches to a mathematical analysis, recognising Tikhonov regularisation as a framework for comparison. Observing the similarity of the processes involved with the more straightforward Tikhonov-regularised ordinary least squares estimate for multivariate regression coefficients, we note that current methods based on a Bayesian model for the effect sizes effectively provide an implicit choice of the regularisation parameter, which is convenient, but at the price of reduced transparency and, especially in smaller LD blocks, a risk of incomplete LD correction. There is no simple answer to the question which method is best, but where interpretability of the LD adjustment is essential, as in research aiming at identifying the genomic aetiology of disorders, our study suggests that a more direct choice of mild regularisation in the correction of effect sizes may be preferable.

Sections du résumé

BACKGROUND BACKGROUND
Genome-wide association studies (GWAS) were successful in identifying SNPs showing association with disease, but their individual effect sizes are small and require large sample sizes to achieve statistical significance. Methods of post-GWAS analysis, including gene-based, gene-set and polygenic risk scores, combine the SNP effect sizes in an attempt to boost the power of the analyses. To avoid giving undue weight to SNPs in linkage disequilibrium (LD), the LD needs to be taken into account in these analyses.
OBJECTIVES OBJECTIVE
We review methods that attempt to adjust the effect sizes (β-coefficients) of summary statistics, instead of simple LD pruning.
METHODS METHODS
We subject LD adjustment approaches to a mathematical analysis, recognising Tikhonov regularisation as a framework for comparison.
RESULTS RESULTS
Observing the similarity of the processes involved with the more straightforward Tikhonov-regularised ordinary least squares estimate for multivariate regression coefficients, we note that current methods based on a Bayesian model for the effect sizes effectively provide an implicit choice of the regularisation parameter, which is convenient, but at the price of reduced transparency and, especially in smaller LD blocks, a risk of incomplete LD correction.
CONCLUSIONS CONCLUSIONS
There is no simple answer to the question which method is best, but where interpretability of the LD adjustment is essential, as in research aiming at identifying the genomic aetiology of disorders, our study suggests that a more direct choice of mild regularisation in the correction of effect sizes may be preferable.

Identifiants

pubmed: 33582669
pii: 000513303
doi: 10.1159/000513303
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

1-11

Subventions

Organisme : Medical Research Council
ID : MR/T033371/1
Pays : United Kingdom
Organisme : Medical Research Council
ID : MR/L010305/1
Pays : United Kingdom
Organisme : Medical Research Council
ID : G0801418
Pays : United Kingdom
Organisme : Medical Research Council
ID : MR/T04604X/1
Pays : United Kingdom
Organisme : Medical Research Council
ID : MR/P005748/1
Pays : United Kingdom
Organisme : Medical Research Council
ID : MR/L023784/2
Pays : United Kingdom

Informations de copyright

© 2021 The Author(s) Published by S. Karger AG, Basel.

Auteurs

Valentina Escott-Price (V)

Division of Psychiatry and Clinical Neurosciences, Cardiff University, Cardiff, United Kingdom, EscottPriceV@cf.ac.uk.
Dementia Research Institute, Cardiff University, Cardiff, United Kingdom, EscottPriceV@cf.ac.uk.

Karl Michael Schmidt (KM)

School of Mathematics, Cardiff University, Cardiff, United Kingdom.

Classifications MeSH