A fully adjusted two-stage procedure for rank-normalization in genetic association studies.
rank-normalization
rare variants
whole-genome sequencing.
Journal
Genetic epidemiology
ISSN: 1098-2272
Titre abrégé: Genet Epidemiol
Pays: United States
ID NLM: 8411723
Informations de publication
Date de publication:
04 2019
04 2019
Historique:
received:
29
06
2018
accepted:
11
12
2018
pubmed:
18
1
2019
medline:
11
5
2019
entrez:
18
1
2019
Statut:
ppublish
Résumé
When testing genotype-phenotype associations using linear regression, departure of the trait distribution from normality can impact both Type I error rate control and statistical power, with worse consequences for rarer variants. Because genotypes are expected to have small effects (if any) investigators now routinely use a two-stage method, in which they first regress the trait on covariates, obtain residuals, rank-normalize them, and then use the rank-normalized residuals in association analysis with the genotypes. Potential confounding signals are assumed to be removed at the first stage, so in practice, no further adjustment is done in the second stage. Here, we show that this widely used approach can lead to tests with undesirable statistical properties, due to both combination of a mis-specified mean-variance relationship and remaining covariate associations between the rank-normalized residuals and genotypes. We demonstrate these properties theoretically, and also in applications to genome-wide and whole-genome sequencing association studies. We further propose and evaluate an alternative fully adjusted two-stage approach that adjusts for covariates both when residuals are obtained and in the subsequent association test. This method can reduce excess Type I errors and improve statistical power.
Identifiants
pubmed: 30653739
doi: 10.1002/gepi.22188
pmc: PMC6416071
mid: NIHMS1004041
doi:
Substances chimiques
Hemoglobins
0
Types de publication
Journal Article
Research Support, N.I.H., Extramural
Langues
eng
Sous-ensembles de citation
IM
Pagination
263-275Subventions
Organisme : NHLBI NIH HHS
ID : HHSN268201100037C
Pays : United States
Organisme : NHLBI NIH HHS
ID : HHSN268201300005C
Pays : United States
Organisme : NHLBI NIH HHS
ID : R01 HL120393
Pays : United States
Organisme : NHLBI NIH HHS
ID : HHSN268201300001C
Pays : United States
Organisme : NHLBI NIH HHS
ID : R01 HL092577
Pays : United States
Organisme : NHLBI NIH HHS
ID : N01HC65236
Pays : United States
Organisme : NIDDK NIH HHS
ID : P30 DK072488
Pays : United States
Organisme : NHLBI NIH HHS
ID : HHSN268201500001I
Pays : United States
Organisme : NHLBI NIH HHS
ID : N01HC25195
Pays : United States
Organisme : NHLBI NIH HHS
ID : N01-HC-65235
Pays : United States
Organisme : NHLBI NIH HHS
ID : HHSN268201300003I
Pays : United States
Organisme : NHLBI NIH HHS
ID : HHSN268201300004C
Pays : United States
Organisme : NHLBI NIH HHS
ID : N01-HC-65236
Pays : United States
Organisme : NHLBI NIH HHS
ID : HHSN268201500001C
Pays : United States
Organisme : NHLBI NIH HHS
ID : 1R35HL135818, 3R01HL-117626-02S1, 3R01HL-120393-02S1, R01 HL092577-06S1, R01HL120393-03S1, T32 HL129982, U01 HL072515, U01 HL137181, U01 HL84756
Pays : United States
Organisme : NHLBI NIH HHS
ID : HHSN268201300048C
Pays : United States
Organisme : NHLBI NIH HHS
ID : HHSN268201300004I
Pays : United States
Organisme : NHLBI NIH HHS
ID : N01HC65235
Pays : United States
Organisme : NHLBI NIH HHS
ID : U01 HL137181
Pays : United States
Organisme : NHLBI NIH HHS
ID : HHSN268201300005I
Pays : United States
Organisme : NHLBI NIH HHS
ID : N01HC65234
Pays : United States
Organisme : NHLBI NIH HHS
ID : T32 HL129982
Pays : United States
Organisme : NHLBI NIH HHS
ID : U01 HL084756
Pays : United States
Organisme : NHLBI NIH HHS
ID : R35 HL135818
Pays : United States
Organisme : NIDDK NIH HHS
ID : P30 DK72488
Pays : United States
Organisme : NHLBI NIH HHS
ID : N01HC65233
Pays : United States
Organisme : NHLBI NIH HHS
ID : HHSN268201300049C
Pays : United States
Organisme : NHLBI NIH HHS
ID : N01HC65237
Pays : United States
Organisme : NHLBI NIH HHS
ID : HHSN268201300001I/N01-HC-65233
Pays : United States
Organisme : NHLBI NIH HHS
ID : HHSN268201300047C
Pays : United States
Organisme : NHLBI NIH HHS
ID : HHSN268201300050C
Pays : United States
Organisme : NHLBI NIH HHS
ID : U01 HL072515
Pays : United States
Organisme : NHGRI NIH HHS
ID : HHSN268201300003C
Pays : United States
Organisme : NHLBI NIH HHS
ID : N01-HC-65237
Pays : United States
Organisme : NHLBI NIH HHS
ID : U01 HL137162
Pays : United States
Organisme : NHLBI NIH HHS
ID : HHSN268201500014C
Pays : United States
Organisme : NHLBI NIH HHS
ID : HHSN268201300046C
Pays : United States
Organisme : NHGRI NIH HHS
ID : R01 HG005827
Pays : United States
Organisme : NHLBI NIH HHS
ID : U01 HL84756
Pays : United States
Organisme : NHGRI NIH HHS
ID : R01HG005827
Pays : United States
Organisme : NHLBI NIH HHS
ID : N01-HC-65234
Pays : United States
Organisme : NHLBI NIH HHS
ID : HHSN268201300002I
Pays : United States
Organisme : NHLBI NIH HHS
ID : R01 HL117626
Pays : United States
Informations de copyright
© 2019 Wiley Periodicals, Inc.
Références
Eur J Hum Genet. 2016 Aug;24(8):1188-94
pubmed: 26733287
Nature. 2015 Feb 12;518(7538):187-196
pubmed: 25673412
Methods Mol Biol. 2013;1019:215-36
pubmed: 23756893
Behav Genet. 1987 May;17(3):243-56
pubmed: 3632560
Am J Hum Genet. 2015 Jul 2;97(1):35-53
pubmed: 26094574
Genet Epidemiol. 2012 Dec;36(8):890-4
pubmed: 22941732
Am J Hum Genet. 2016 Jul 7;99(1):22-39
pubmed: 27346689
Eur J Hum Genet. 2018 Aug;26(8):1194-1201
pubmed: 29706643
Am J Hum Genet. 2002 May;70(5):1247-56
pubmed: 11923912
Genet Epidemiol. 2011 Nov;35(7):592-6
pubmed: 21769934
Nat Genet. 2012 Feb 19;44(3):307-11
pubmed: 22344219
Behav Genet. 2009 Sep;39(5):580-95
pubmed: 19526352
Biometrics. 1999 Dec;55(4):997-1004
pubmed: 11315092
Nat Genet. 2017 Jan;49(1):54-64
pubmed: 27841878
Am J Hum Genet. 2011 Jul 15;89(1):82-93
pubmed: 21737059
Am J Hum Genet. 2014 Feb 6;94(2):233-45
pubmed: 24507775