Leveraging summary statistics to make inferences about complex phenotypes in large biobanks.


Journal

Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing
ISSN: 2335-6936
Titre abrégé: Pac Symp Biocomput
Pays: United States
ID NLM: 9711271

Informations de publication

Date de publication:
2019
Historique:
entrez: 10 4 2019
pubmed: 10 4 2019
medline: 2 1 2020
Statut: ppublish

Résumé

As genetic sequencing becomes less expensive and data sets linking genetic data and medical records (e.g., Biobanks) become larger and more common, issues of data privacy and computational challenges become more necessary to address in order to realize the benefits of these datasets. One possibility for alleviating these issues is through the use of already-computed summary statistics (e.g., slopes and standard errors from a regression model of a phenotype on a genotype). If groups share summary statistics from their analyses of biobanks, many of the privacy issues and computational challenges concerning the access of these data could be bypassed. In this paper we explore the possibility of using summary statistics from simple linear models of phenotype on genotype in order to make inferences about more complex phenotypes (those that are derived from two or more simple phenotypes). We provide exact formulas for the slope, intercept, and standard error of the slope for linear regressions when combining phenotypes. Derived equations are validated via simulation and tested on a real data set exploring the genetics of fatty acids.

Identifiants

pubmed: 30963077
pmc: PMC6417828
mid: NIHMS999825

Substances chimiques

Fatty Acids 0

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

391-402

Subventions

Organisme : NHGRI NIH HHS
ID : R15 HG006915
Pays : United States

Références

J Law Med Ethics. 2016 Mar;44(1):156-60
pubmed: 27256131
Prostaglandins Leukot Essent Fatty Acids. 2015 Mar;94:65-72
pubmed: 25500335
Biometrics. 1997 Dec;53(4):1253-61
pubmed: 9423247
PLoS Med. 2015 Mar 31;12(3):e1001779
pubmed: 25826379
Atherosclerosis. 2012 Dec;225(2):425-31
pubmed: 22727409
PLoS One. 2018 Apr 13;13(4):e0194882
pubmed: 29652918

Auteurs

Angela Gasdaska (A)

Department of Mathematics and Computer Science and Department of Quantitative Theory and Methods, Emory University, Atlanta, GA 30322, USA, aegasdaska@gmail.com.

Derek Friend (D)

Department of Geography, University of Nevada, Reno, NV 89557, USA, derekfriend@outlook.com.

Rachel Chen (R)

Department of Statistics, North Carolina State University, Raleigh, NC 27695, USA, rschen@ncsu.edu.

Jason Westra (J)

Department of Math, Computer Science, and Statistics, Dordt College, Sioux Center, IA 51250, USA, westrajason@hotmail.com.

Matthew Zawistowski (M)

Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA, mattz@umich.edu.

William Lindsey (W)

Department of Math, Computer Science, and Statistics, Dordt College, Sioux Center, IA 51250, USA William.Lindsey@dordt.edu.

Nathan Tintle (N)

Department of Math, Computer Science, and Statistics, Dordt College, Sioux Center, IA 51250, USA Nathan.Tintle@dordt.edu.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH