MagicalRsq-X: A cross-cohort transferable genotype imputation quality metric.
cross-cohort
genome-wide association studies
genotype imputation
imputation quality
machine learning
quality control
rare variants
variant filtering
whole-genome sequencing
Journal
American journal of human genetics
ISSN: 1537-6605
Titre abrégé: Am J Hum Genet
Pays: United States
ID NLM: 0370475
Informations de publication
Date de publication:
09 Apr 2024
09 Apr 2024
Historique:
received:
25
01
2024
revised:
29
03
2024
accepted:
01
04
2024
medline:
19
4
2024
pubmed:
19
4
2024
entrez:
18
4
2024
Statut:
aheadofprint
Résumé
Since genotype imputation was introduced, researchers have been relying on the estimated imputation quality from imputation software to perform post-imputation quality control (QC). However, this quality estimate (denoted as Rsq) performs less well for lower-frequency variants. We recently published MagicalRsq, a machine-learning-based imputation quality calibration, which leverages additional typed markers from the same cohort and outperforms Rsq as a QC metric. In this work, we extended the original MagicalRsq to allow cross-cohort model training and named the new model MagicalRsq-X. We removed the cohort-specific estimated minor allele frequency and included linkage disequilibrium scores and recombination rates as additional features. Leveraging whole-genome sequencing data from TOPMed, specifically participants in the BioMe, JHS, WHI, and MESA studies, we performed comprehensive cross-cohort evaluations for predominantly European and African ancestral individuals based on their inferred global ancestry with the 1000 Genomes and Human Genome Diversity Project data as reference. Our results suggest MagicalRsq-X outperforms Rsq in almost every setting, with 7.3%-14.4% improvement in squared Pearson correlation with true R
Identifiants
pubmed: 38636510
pii: S0002-9297(24)00116-2
doi: 10.1016/j.ajhg.2024.04.001
pii:
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Informations de copyright
Copyright © 2024. Published by Elsevier Inc.
Déclaration de conflit d'intérêts
Declaration of interests The authors declare no competing interests.