The hazards of genotype imputation in chromosomal regions under selection: A case study using the Lactase gene region.


Journal

Annals of human genetics
ISSN: 1469-1809
Titre abrégé: Ann Hum Genet
Pays: England
ID NLM: 0416661

Informations de publication

Date de publication:
01 2022
Historique:
revised: 15 07 2021
received: 15 05 2021
accepted: 11 08 2021
pubmed: 16 9 2021
medline: 22 1 2022
entrez: 15 9 2021
Statut: ppublish

Résumé

Although imputation of missing SNP results has been widely used in genetic studies, claims about the quality and usefulness of imputation have outnumbered the few studies that have questioned its limitations. But it is becoming clear that these limitations are real-for example, disease association signals can be missed in regions of LD breakdown. Here, as a case study, using the chromosomal region of the well-known lactase gene, LCT, we address the issue of imputation in the context of variants that have become frequent in a limited number of modern population groups only recently, due to selection. We study SNPs in a 500 bp region covering the enhancer of LCT, and compare imputed genotypes with directly genotyped data. We examine the haplotype pairs of all individuals with discrepant and missing genotypes. We highlight the nonrandom nature of the allelic errors and show that most incorrect imputations and missing data result from long haplotypes that are evolutionarily closely related to those carrying the derived alleles, while some relate to rare and recombinant haplotypes. We conclude that bias of incorrectly imputed and missing genotypes can decrease the accuracy of imputed results substantially.

Identifiants

pubmed: 34523124
doi: 10.1111/ahg.12444
doi:

Substances chimiques

Lactase EC 3.2.1.108

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

24-33

Subventions

Organisme : Wellcome Trust
ID : 209106/Z/17/Z
Pays : United Kingdom

Informations de copyright

© 2021 The Authors. Annals of Human Genetics published by University College London (UCL) and John Wiley & Sons Ltd.

Références

Allentoft, M. E., Sikora, M., Sjogren, K. G., Rasmussen, S., Rasmussen, M., Stenderup, J., Damgaard, P. B., Schroeder, H., Ahlström, T., Vinner, L., Malaspinas, A. S., Margaryan, A., Higham, T., Chivall, D., Lynnerup, N., Harvig, L., Baron, J., Della Casa, P., Dąbrowski, P., … Willerslev, E. (2015). Population genomics of Bronze Age Eurasia. Nature, 522, 167-172. https://doi.org/10.1038/nature14507
Andrew, T., Maniatis, N., Carbonaro, F., Liew, S. H., Lau, W., Spector, T. D., & Hammond, C. J. (2008). Identification and replication of three novel myopia common susceptibility gene loci on chromosome 3q26 using linkage and linkage disequilibrium mapping. Plos Genetics, 4, e1000220. https://doi.org/10.1371/journal.pgen.1000220
1000 Human Genomes Project Consortium, Abecasis, G. R., Auton, A., Brooks, L. D., DePristo, M. A., Durbin, R. M., … McVean, G. A. (2012). An integrated map of genetic variation from 1,092 human genomes. Nature, 491(7422), 56-65. https://doi.org/10.1038/nature11632
1000 Human Genomes Project Consortium, Auton, A., Brooks, L. D., Durbin, R. M., Garrison, E. P., Kang, H. M., Korbel, J. O., … Absecasis, G. R. (2015). A global reference for human genetic variation. Nature, 526(7571), 68-74. https://doi.org/10.1038/nature15393
Bersaglieri, T., Sabeti, P. C., Patterson, N., Vanderploeg, T., Schaffner, S. F., Drake, J. A., Rhodes, M., Reich, D. E., & Hirschhorn, J. N. (2004). Genetic signatures of strong recent positive selection at the lactase gene. American Journal of Human Genetics, 74, 1111-1120. https://doi.org/10.1086/421051
Bycroft, C., Freeman, C., Petkova, D., Band, G., Elliott, L., Sharp, K., Motyer, A., Vukcevic, D., Delaneau, O., O'Connell, J., Cortes, A., Welsh, S., Young, A., Effingham, M., McVean, G., Leslie, S., Allen, N., Donnelly, P., & Marchini, J. (2018). The UK Biobank resource with deep phenotyping and genomic data. Nature, 562(7726), 203-209. https://doi.org/10.1038/s41586-018-0579-z
Coelho, M., Luiselli, D., Bertorelle, G., Lopes, A. I., Seixas, S., Destro-Bisol, G., & Rocha, J. (2005). Microsatellite variation and evolution of human lactase persistence. Human Genetics, 117(4), 329-339. https://doi.org/10.1007/s00439-005-1322-z
Cruz-Dávalos, D. I., Nieves-Colón, M. A., Sockell, A., Poznik, G. D., Schroeder, H., Stone, A. C., Bustamante, C. D., Malaspinas, A.-S., & Ávila-Arcos, M. C. (2018). In-solution Y-chromosome capture-enrichment on ancient DNA libraries. Bmc Genomics [Electronic Resource], 19(1), 608. https://doi.org/10.1186/s12864-018-4945-x
Delaneau, O., Marchini, J. & 1000 Genomes Project Consortium. (2014). Integrating sequence and array data to create an improved 1000 Genomes Project haplotype reference panel. Nature communications, 5, 3934. https://doi.org/10.1038/ncomms4934
Delaneau, O., Zagury, J. F., & Marchini, J. (2013). Improved whole-chromosome phasing for disease and population genetic studies. Nature Methods, 10(1), 5-6. https://doi.org/10.1038/nmeth.2307
Direk, K., Lau, W., Small, K. S., Maniatis, N., & Andrew, T. (2014). ABCC5 transporter is a novel type 2 diabetes susceptibility gene in European and African American populations. Annals of Human Genetics, 78(5), 333-344. https://doi.org/10.1111/ahg.12072
Elding, H., Lau, W., Swallow, D. M., & Maniatis, N. (2011). Dissecting the genetics of complex inheritance: linkage disequilibrium mapping provides insight into Crohn disease. American Journal of Human Genetics, 89(6), 798-805. https://doi.org/10.1016/j.ajhg.2011.11.006
Elding, H., Lau, W., Swallow, D. M., & Maniatis, N. (2013). Refinement in localization and identification of gene regions associated with Crohn disease. American Journal of Human Genetics, 92(1), 107-113. https://doi.org/10.1016/j.ajhg.2012.11.004
Hellenthal, G., Busby, G. B. J., Band, G., Wilson, J. F., Capelli, C., Falush, D., & Myers, S. (2014). A genetic atlas of human admixture history. Science, 343(6172), 747-751. https://doi.org/10.1126/science.1243518
Hollox, E. J., Poulter, M., Zvarik, M., Ferak, V., Krause, A., Jenkins, T., Saha, N., Kozlov, A. I., & Swallow, D. M. (2001). Lactase haplotype diversity in the Old World. American Journal of Human Genetics, 68(1), 160-172. https://doi.org/10.1086/316924
Howie, B., Fuchsberger, C., Stephens, M., Marchini, J., & Abecasis, G. R. (2012). Fast and accurate genotype imputation in genome-wide association studies through pre-phasing. Nature Genetics, 44(8), 955-959. https://doi.org/10.1038/ng.2354
Howie, B., Marchini, J., & Stephens, M. (2011). Genotype imputation with thousands of genomes. G3 (Bethesda), 1(6), 457-470. https://doi.org/10.1534/g3.111.001198
Ilardo, M. A., Moltke, I., Korneliussen, T. S., Cheng, J., Stern, A. J., Racimo, F., de Barros Damgaard, P., Sikora, M., Seguin-Orlando, A., Rasmussen, S., van den Munckhof, I. C. L., Ter Horst, R., Joosten, L. A. B., Netea, M. G., Salingkat, S., Nielsen, R., & Willerslev, E. (2018). Physiological and Genetic Adaptations to Diving in Sea Nomads. Cell, 173(3), 569-580 e515. https://doi.org/10.1016/j.cell.2018.03.054
Ingram, C. J. E., Mulcare, C. A., Itan, Y., Thomas, M. G., & Swallow, D. M. (2009). Lactose digestion and the evolutionary genetics of lactase persistence. Human Genetics, 124(6), 579-591. https://doi.org/10.1007/s00439-008-0593-6
Itan, Y., Powell, A., Beaumont, M. A., Burger, J., & Thomas, M. G. (2009). The origins of lactase persistence in Europe. Plos Computational Biology, 5(8), e1000491. https://doi.org/10.1371/journal.pcbi.1000491
Lau, W., Andrew, T., & Maniatis, N. (2017). High-Resolution Genetic Maps Identify Multiple Type 2 Diabetes Loci at Regulatory Hotspots in African Americans and Europeans. American Journal of Human Genetics, 100(5), 803-816. https://doi.org/10.1016/j.ajhg.2017.04.007
Liebert, A., Jones, B. L., Danielsen, E. T., Olsen, A. K., Swallow, D. M., & Troelsen, J. T. (2016). In Vitro Functional Analyses of Infrequent Nucleotide Variants in the Lactase Enhancer Reveal Different Molecular Routes to Increased Lactase Promoter Activity and Lactase Persistence. Annals of Human Genetics, 80(6), 307-318. https://doi.org/10.1111/ahg.12167
Liebert, A., López, S., Jones, B. L., Montalva, N., Gerbault, P., Lau, W., Thomas, M. G., Bradman, N., Maniatis, N., & Swallow, D. M. (2017). World-wide distributions of lactase persistence alleles and the complex effects of recombination and selection. Human Genetics, 136(11-12), 1445-1453. https://doi.org/10.1007/s00439-017-1847-y
Maniatis, N., Collins, A., & Morton, N. E. (2007). Effects of single SNPs, haplotypes, and whole-genome LD maps on accuracy of association mapping. Genetic Epidemiology, 31(3), 179-188. https://doi.org/10.1002/gepi.20199
Marchini, J., & Howie, B. (2010). Genotype imputation for genome-wide association studies. Nature Reviews Genetics, 11(7), 499-511. https://doi.org/10.1038/nrg2796
Martiniano, R., Cassidy, L. M., O'Maolduin, R., McLaughlin, R., Silva, N. M., Manco, L., Fidalgo, D., Pereira, T., Coelho, M. J., Serra, M., Burger, J., Parreira, R., Moran, E., Valera, A. C., Porfirio, E., Boaventura, R., Silva, A. M., & Bradley, D. G. (2017). The population genomics of archaeological transition in west Iberia: Investigation of ancient substructure using imputation and haplotype-based methods. Plos Genetics, 13(7), e1006852. https://doi.org/10.1371/journal.pgen.1006852
Mathieson, I., Alpaslan-Roodenberg, S., Posth, C., Szécsényi-Nagy, A., Rohland, N., Mallick, S., Olalde, I., Broomandkhoshbacht, N., Candilio, F., Cheronet, O., Fernandes, D., Ferry, M., Gamarra, B., Fortes, G. G., Haak, W., Harney, E., Jones, E., Keating, D., Krause-Kyora, B., … Reich, D. (2018). The genomic history of southeastern Europe. Nature, 555(7695), 197-203. https://doi.org/10.1038/nature25778
Mathieson, I., Lazaridis, I., Rohland, N., Mallick, S., Patterson, N., Roodenberg, S. A., Harney, E., Stewardson, K., Fernandes, D., Novak, M., Sirak, K., Gamba, C., Jones, E. R., Llamas, B., Dryomov, S., Pickrell, J., Arsuaga, J. L., De Castro, J. M. B., Carbonell, E., … Reich, D. (2015). Genome-wide patterns of selection in 230 ancient Eurasians. Nature, 528(7583), 499-503. https://doi.org/10.1038/nature16152
Mukherjee, S., Kumar, R., Tsakem Lenou, E., Basrur, V., Kontoyiannis, D. L., Ioakeimidis, F., Mosialos, G., Theiss, A. L., Flavell, R. A., & Venuprasad, K. (2020). Deubiquitination of NLRP6 inflammasome by Cyld critically regulates intestinal inflammation. Nature Immunology, 21(6), 626-635. https://doi.org/10.1038/s41590-020-0681-x
Poulter, M., Hollox, E., Harvey, C. B., Mulcare, C., Peuhkuri, K., Kajander, K., Sarner, M., Korpela, R., & Swallow, D. M. (2003). The causal element for the lactase persistence/non-persistence polymorphism is located in a 1 Mb region of linkage disequilibrium in Europeans. Annals of Human Genetics, 67(Pt 4), 298-311.
Ranciaro, A., Campbell, M. C., Hirbo, J. B., Ko, W.-Y., Froment, A., Anagnostou, P., Kotze, M. J., Ibrahim, M., Nyambo, T., Omar, S. A., & Tishkoff, S. A. (2014). Genetic origins of lactase persistence and the spread of pastoralism in Africa. American Journal of Human Genetics, 94(4), 496-510. https://doi.org/10.1016/j.ajhg.2014.02.009
Segurel, L., & Bon, C. (2017). On the Evolution of Lactase Persistence in Humans. Annual Review of Genomics and Human Genetics, 18, 297-319. https://doi.org/10.1146/annurev-genom-091416-035340
Shi, S., Yuan, N., Yang, M., Du, Z., Wang, J., Sheng, X., Wu, J., & Xiao, J. (2018). Comprehensive Assessment of Genotype Imputation Performance. Human Heredity, 83(3), 107-116. https://doi.org/10.1159/000489758
Stephens, M., & Donnelly, P. (2003). A comparison of bayesian methods for haplotype reconstruction from population genotype data. American Journal of Human Genetics, 73(5), 1162-1169. https://doi.org/10.1086/379378
Stephens, M., Smith, N. J., & Donnelly, P. (2004). Documentation for PHASE, version 2.1.
Weng, Z. Q., Saatchi, M., Schnabel, R. D., Taylor, J. F., & Garrick, D. J. (2014). Recombination locations and rates in beef cattle assessed from parent-offspring pairs. Genetics, Selection, Evolution., 46, 34. https://doi.org/10.1186/1297-9686-46-34

Auteurs

Aminah T Ali (AT)

University College London Research Department of Genetics Evolution and Environment, London, UK.

Anke Liebert (A)

University College London Research Department of Genetics Evolution and Environment, London, UK.

Winston Lau (W)

University College London Research Department of Genetics Evolution and Environment, London, UK.

Nikolas Maniatis (N)

University College London Research Department of Genetics Evolution and Environment, London, UK.

Dallas M Swallow (DM)

University College London Research Department of Genetics Evolution and Environment, London, UK.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH