Genotype imputation in F2 crosses of inbred lines.


Journal

Bioinformatics advances
ISSN: 2635-0041
Titre abrégé: Bioinform Adv
Pays: England
ID NLM: 9918282081306676

Informations de publication

Date de publication:
2024
Historique:
received: 22 03 2024
revised: 04 06 2024
accepted: 22 07 2024
medline: 30 7 2024
pubmed: 30 7 2024
entrez: 30 7 2024
Statut: epublish

Résumé

Crosses among inbred lines are a fundamental tool for the discovery of genetic loci associated with phenotypes of interest. In organisms for which large reference panels or SNP chips are not available, imputation from low-pass whole-genome sequencing is an effective method for obtaining genotype data from a large number of individuals. To date, a structured analysis of the conditions required for optimal genotype imputation has not been performed. We report a systematic exploration of the effect of several design variables on imputation performance in F2 crosses of inbred medaka lines using the imputation software STITCH. We determined that, depending on the number of samples, imputation performance reaches a plateau when increasing the per-sample sequencing coverage. We also systematically explored the trade-offs between cost, imputation accuracy, and sample numbers. We developed a computational pipeline to streamline the process, enabling other researchers to perform a similar cost-benefit analysis on their population of interest. The source code for the pipeline is available at https://github.com/birneylab/stitchimpute. While our pipeline has been developed and tested for an F2 population, the software can also be used to analyse populations with a different structure.

Identifiants

pubmed: 39077633
doi: 10.1093/bioadv/vbae107
pii: vbae107
pmc: PMC11286293
doi:

Types de publication

Journal Article

Langues

eng

Pagination

vbae107

Informations de copyright

© The Author(s) 2024. Published by Oxford University Press.

Déclaration de conflit d'intérêts

None declared.

Auteurs

Saul Pierotti (S)

European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Hinxton, Cambridge CB101SD, United Kingdom.

Bettina Welz (B)

Centre for Organismal Studies (COS), Heidelberg University, Heidelberg 69120, Germany.

Mireia Osuna-López (M)

Genomics Core Facility, European Molecular Biology Laboratory (EMBL), Heidelberg 69117, Germany.

Tomas Fitzgerald (T)

European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Hinxton, Cambridge CB101SD, United Kingdom.

Joachim Wittbrodt (J)

Centre for Organismal Studies (COS), Heidelberg University, Heidelberg 69120, Germany.

Ewan Birney (E)

European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Hinxton, Cambridge CB101SD, United Kingdom.

Classifications MeSH