Cluster Buster: A Machine Learning Algorithm for Genotyping SNPs from Raw Data.


Journal

bioRxiv : the preprint server for biology
ISSN: 2692-8205
Titre abrégé: bioRxiv
Pays: United States
ID NLM: 101680187

Informations de publication

Date de publication:
26 Aug 2024
Historique:
medline: 10 9 2024
pubmed: 10 9 2024
entrez: 10 9 2024
Statut: epublish

Résumé

Genotyping single nucleotide polymorphisms (SNPs) is fundamental to disease research, as researchers seek to establish links between genetic variation and disease. Although significant advances in genome technology have been made with the development of bead-based SNP genotyping and Genome Studio software, some SNPs still fail to be genotyped, resulting in "no-calls" that impede downstream analyses. To recover these genotypes, we introduce Cluster Buster, a genotyping neural network and visual inspection system designed to improve the quality of neurodegenerative disease (NDD) research. Concordance analysis with whole genome sequencing (WGS) and imputed genotypes validated the reliability of predicted genotypes, with dozens of high-performing SNPs across

Identifiants

pubmed: 39253512
doi: 10.1101/2024.08.23.609429
pmc: PMC11383048
pii:
doi:

Types de publication

Journal Article Preprint

Langues

eng

Auteurs

Classifications MeSH