Kinpute: using identity by descent to improve genotype imputation.


Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
01 11 2019
Historique:
received: 23 08 2018
revised: 21 02 2019
accepted: 26 03 2019
pubmed: 29 3 2019
medline: 1 7 2020
entrez: 29 3 2019
Statut: ppublish

Résumé

Genotype imputation, though generally accurate, often results in many genotypes being poorly imputed, particularly in studies where the individuals are not well represented by standard reference panels. When individuals in the study share regions of the genome identical by descent (IBD), it is possible to use this information in combination with a study-specific reference panel (SSRP) to improve the imputation results. Kinpute uses IBD information-due to recent, familial relatedness or distant, unknown ancestors-in conjunction with the output from linkage disequilibrium (LD) based imputation methods to compute more accurate genotype probabilities. Kinpute uses a novel method for IBD imputation, which works even in the absence of a pedigree, and results in substantially improved imputation quality. Given initial estimates of average IBD between subjects in the study sample, Kinpute uses a novel algorithm to select an optimal set of individuals to sequence and use as an SSRP. Kinpute is designed to use as input both this SSRP and the genotype probabilities output from other LD-based imputation software, and uses a new method to combine the LD imputed genotype probabilities with IBD configurations to substantially improve imputation. We tested Kinpute on a human population isolate where 98 individuals have been sequenced. In half of this sample, whose sequence data was masked, we used Impute2 to perform LD-based imputation and Kinpute was used to obtain higher accuracy genotype probabilities. Measures of imputation accuracy improved significantly, particularly for those genotypes that Impute2 imputed with low certainty. Kinpute is an open-source and freely available C++ software package that can be downloaded from. Supplementary data are available at Bioinformatics online.

Identifiants

pubmed: 30918937
pii: 5421157
doi: 10.1093/bioinformatics/btz221
pmc: PMC6821425
doi:

Types de publication

Journal Article Research Support, N.I.H., Extramural

Langues

eng

Sous-ensembles de citation

IM

Pagination

4321-4326

Subventions

Organisme : NHGRI NIH HHS
ID : R01 HG002899
Pays : United States

Informations de copyright

© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Références

Genet Epidemiol. 2018 Mar;42(2):201-213
pubmed: 29319195
Genet Epidemiol. 2011 Dec;35(8):853-60
pubmed: 22006673
Nat Genet. 2016 Oct;48(10):1279-83
pubmed: 27548312
Nat Genet. 2006 Sep;38(9):1002-4
pubmed: 16921375
Nat Genet. 2008 Sep;40(9):1068-75
pubmed: 19165921
PLoS Comput Biol. 2015 Mar 03;11(3):e1004139
pubmed: 25735005
Genet Epidemiol. 2014 Nov;38(7):579-90
pubmed: 25132070
Sci Rep. 2017 Jul 20;7(1):6079
pubmed: 28729679
Am J Hum Genet. 2018 Sep 6;103(3):338-348
pubmed: 30100085
Nat Genet. 2012 Jul 22;44(8):955-9
pubmed: 22820512
Nat Genet. 2016 Oct;48(10):1284-1287
pubmed: 27571263
BMC Proc. 2014 Jun 17;8(Suppl 1 Genetic Analysis Workshop 18Vanessa Olmo):S19
pubmed: 25519371
PLoS Genet. 2014 Apr 17;10(4):e1004234
pubmed: 24743097
Genet Epidemiol. 2014 Sep;38(6):531-41
pubmed: 25044249
Eur J Hum Genet. 2015 Jul;23(7):975-83
pubmed: 25293720
Am J Hum Genet. 2016 Jan 7;98(1):116-26
pubmed: 26748515
Bioinformatics. 2015 Mar 1;31(5):782-4
pubmed: 25338720
Nat Genet. 2015 Nov;47(11):1272-1281
pubmed: 26366554
Genet Epidemiol. 2017 Dec;41(8):744-755
pubmed: 28861891
Genet Epidemiol. 2011 Sep;35(6):557-67
pubmed: 21769932
Genetics. 2012 Feb;190(2):679-89
pubmed: 22135348
Nature. 2015 Oct 1;526(7571):68-74
pubmed: 26432245
Genome Res. 2019 Jan;29(1):125-134
pubmed: 30514702
Genet Epidemiol. 2012 May;36(4):312-9
pubmed: 22460724
Biometrics. 1974 Dec;30(4):667-80
pubmed: 4429760
Eur J Hum Genet. 2013 Feb;21(2):205-11
pubmed: 22781100
Am J Hum Genet. 2013 Apr 4;92(4):504-16
pubmed: 23561844
Eur J Hum Genet. 2017 Jun;25(7):869-876
pubmed: 28401899
Eur J Hum Genet. 2014 Nov;22(11):1321-6
pubmed: 24896149

Auteurs

Mark Abney (M)

Department of Human Genetics, University of Chicago, Chicago, IL, USA.

Aisha ElSherbiny (A)

Department of Human Genetics, University of Chicago, Chicago, IL, USA.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH