Cluster Buster: A Machine Learning Algorithm for Genotyping SNPs from Raw Data.
Journal
bioRxiv : the preprint server for biology
ISSN: 2692-8205
Titre abrégé: bioRxiv
Pays: United States
ID NLM: 101680187
Informations de publication
Date de publication:
26 Aug 2024
26 Aug 2024
Historique:
medline:
10
9
2024
pubmed:
10
9
2024
entrez:
10
9
2024
Statut:
epublish
Résumé
Genotyping single nucleotide polymorphisms (SNPs) is fundamental to disease research, as researchers seek to establish links between genetic variation and disease. Although significant advances in genome technology have been made with the development of bead-based SNP genotyping and Genome Studio software, some SNPs still fail to be genotyped, resulting in "no-calls" that impede downstream analyses. To recover these genotypes, we introduce Cluster Buster, a genotyping neural network and visual inspection system designed to improve the quality of neurodegenerative disease (NDD) research. Concordance analysis with whole genome sequencing (WGS) and imputed genotypes validated the reliability of predicted genotypes, with dozens of high-performing SNPs across
Identifiants
pubmed: 39253512
doi: 10.1101/2024.08.23.609429
pmc: PMC11383048
pii:
doi:
Types de publication
Journal Article
Preprint
Langues
eng