Impact of the inaccessible genome on genotype imputation and genome-wide association studies.

GWAS NGS accessibility genotyping chips web tool

Journal

Human molecular genetics
ISSN: 1460-2083
Titre abrégé: Hum Mol Genet
Pays: England
ID NLM: 9208958

Informations de publication

Date de publication:
20 Apr 2024
Historique:
received: 22 12 2023
revised: 03 03 2024
accepted: 25 03 2024
medline: 21 4 2024
pubmed: 21 4 2024
entrez: 20 4 2024
Statut: aheadofprint

Résumé

Genotype imputation is widely used in genome-wide association studies (GWAS). However, both the genotyping chips and imputation reference panels are dependent on next-generation sequencing (NGS). Due to the nature of NGS, some regions of the genome are inaccessible to sequencing. To date, there has been no complete evaluation of these regions and their impact on the identification of associations in GWAS remains unclear. In this study, we systematically assess the extent to which variants in inaccessible regions are underrepresented on genotyping chips and imputation reference panels, in GWAS results and in variant databases. We also determine the proportion of genes located in inaccessible regions and compare the results across variant masks defined by the 1000 Genomes Project and the TOPMed program. Overall, fewer variants were observed in inaccessible regions in all categories analyzed. Depending on the mask used and normalized for region size, only 4%-17% of the genotyped variants are located in inaccessible regions and 52 to 581 genes were almost completely inaccessible. From the Cooperative Health Research in South Tyrol (CHRIS) study, we present a case study of an association located in an inaccessible region that is driven by genotyped variants and cannot be reproduced by imputation in GRCh37. We conclude that genotyping, NGS, genotype imputation and downstream analyses such as GWAS and fine mapping are systematically biased in inaccessible regions, due to missed variants and spurious associations. To help researchers assess gene and variant accessibility, we provide an online application (https://gab.gm.eurac.edu).

Identifiants

pubmed: 38643062
pii: 7655475
doi: 10.1093/hmg/ddae062
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

© The Author(s) 2024. Published by Oxford University Press.

Auteurs

Eva König (E)

Institute for Biomedicine (affiliated to the University of Lübeck), Eurac Research, Via Volta 21, Bolzano 39100, Italy.

Jonathan Stewart Mitchell (JS)

Institute for Biomedicine (affiliated to the University of Lübeck), Eurac Research, Via Volta 21, Bolzano 39100, Italy.

Michele Filosi (M)

Institute for Biomedicine (affiliated to the University of Lübeck), Eurac Research, Via Volta 21, Bolzano 39100, Italy.

Christian Fuchsberger (C)

Institute for Biomedicine (affiliated to the University of Lübeck), Eurac Research, Via Volta 21, Bolzano 39100, Italy.

Classifications MeSH