sumSTAAR: A flexible framework for gene-based association studies using GWAS summary statistics.
Journal
PLoS computational biology
ISSN: 1553-7358
Titre abrégé: PLoS Comput Biol
Pays: United States
ID NLM: 101238922
Informations de publication
Date de publication:
06 2022
06 2022
Historique:
received:
01
11
2021
accepted:
05
05
2022
revised:
14
06
2022
pubmed:
3
6
2022
medline:
18
6
2022
entrez:
2
6
2022
Statut:
epublish
Résumé
Gene-based association analysis is an effective gene-mapping tool. Many gene-based methods have been proposed recently. However, their power depends on the underlying genetic architecture, which is rarely known in complex traits, and so it is likely that a combination of such methods could serve as a universal approach. Several frameworks combining different gene-based methods have been developed. However, they all imply a fixed set of methods, weights and functional annotations. Moreover, most of them use individual phenotypes and genotypes as input data. Here, we introduce sumSTAAR, a framework for gene-based association analysis using summary statistics obtained from genome-wide association studies (GWAS). It is an extended and modified version of STAAR framework proposed by Li and colleagues in 2020. The sumSTAAR framework offers a wider range of gene-based methods to combine. It allows the user to arbitrarily define a set of these methods, weighting functions and probabilities of genetic variants being causal. The methods used in the framework were adapted to analyse genes with large number of SNPs to decrease the running time. The framework includes the polygene pruning procedure to guard against the influence of the strong GWAS signals outside the gene. We also present new improved matrices of correlations between the genotypes of variants within genes. These matrices estimated on a sample of 265,000 individuals are a state-of-the-art replacement of widely used matrices based on the 1000 Genomes Project data.
Identifiants
pubmed: 35653402
doi: 10.1371/journal.pcbi.1010172
pii: PCOMPBIOL-D-21-01971
pmc: PMC9197066
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
e1010172Déclaration de conflit d'intérêts
The authors have declared that no competing interests exist.
Références
Genetics. 2017 Nov;207(3):883-891
pubmed: 28878002
PLoS Genet. 2020 Dec 15;16(12):e1009060
pubmed: 33320851
Nat Genet. 2012 Mar 18;44(4):369-75, S1-3
pubmed: 22426310
Nat Genet. 2018 Sep;50(9):1335-1341
pubmed: 30104761
Am J Hum Genet. 2019 Mar 7;104(3):410-421
pubmed: 30849328
PLoS One. 2020 Jun 19;15(6):e0233847
pubmed: 32559184
Nat Commun. 2018 Mar 2;9(1):905
pubmed: 29500382
Sci Rep. 2021 Jan 28;11(1):2484
pubmed: 33510330
Am J Hum Genet. 2012 Aug 10;91(2):224-37
pubmed: 22863193
Am J Hum Genet. 2017 Oct 5;101(4):539-551
pubmed: 28942963
Nat Genet. 2021 Nov;53(11):1616-1621
pubmed: 34737426
Am J Hum Genet. 2008 Sep;83(3):311-21
pubmed: 18691683
Genet Epidemiol. 2008 Feb;32(2):108-18
pubmed: 17849491
Front Genet. 2020 May 19;11:437
pubmed: 32508874
Sci Rep. 2019 Apr 2;9(1):5461
pubmed: 30940856
Nat Rev Genet. 2010 Jun;11(6):446-50
pubmed: 20479774
Bioinformatics. 2019 Oct 1;35(19):3701-3708
pubmed: 30860568
Genet Epidemiol. 2020 Jun;44(4):352-367
pubmed: 32100372
Genet Epidemiol. 2018 Sep;42(6):516-527
pubmed: 29932245
Nat Genet. 2020 Sep;52(9):969-983
pubmed: 32839606
Genet Epidemiol. 2013 Nov;37(7):726-42
pubmed: 24130119
Nat Commun. 2020 Jun 5;11(1):2850
pubmed: 32503972
Nature. 2018 Oct;562(7726):203-209
pubmed: 30305743
Am J Epidemiol. 2017 Nov 1;186(9):1026-1034
pubmed: 28641372
PLoS One. 2015 Jun 25;10(6):e0128999
pubmed: 26111046
PLoS One. 2018 Jan 8;13(1):e0190486
pubmed: 29309409
PLoS Med. 2015 Mar 31;12(3):e1001779
pubmed: 25826379