Smooth Descent: A ploidy-aware algorithm to improve linkage mapping in the presence of genotyping errors.

genotyping error identity by descent imputation linkage mapping polyploidy

Journal

Frontiers in genetics
ISSN: 1664-8021
Titre abrégé: Front Genet
Pays: Switzerland
ID NLM: 101560621

Informations de publication

Date de publication:
2023
Historique:
received: 21 09 2022
accepted: 20 02 2023
entrez: 20 3 2023
pubmed: 21 3 2023
medline: 21 3 2023
Statut: epublish

Résumé

Linkage mapping is an approach to order markers based on recombination events. Mapping algorithms cannot easily handle genotyping errors, which are common in high-throughput genotyping data. To solve this issue, strategies have been developed, aimed mostly at identifying and eliminating these errors. One such strategy is SMOOTH, an iterative algorithm to detect genotyping errors. Unlike other approaches, SMOOTH can also be used to impute the most probable alternative genotypes, but its application is limited to diploid species and to markers heterozygous in only one of the parents. In this study we adapted SMOOTH to expand its use to any marker type and to autopolyploids with the use of identity-by-descent probabilities, naming the updated algorithm Smooth Descent (SD). We applied SD to real and simulated data, showing that in the presence of genotyping errors this method produces better genetic maps in terms of marker order and map length. SD is particularly useful for error rates between 5% and 20% and when error rates are not homogeneous among markers or individuals. With a starting error rate of 10%, SD reduced it to ∼5% in diploids, ∼7% in tetraploids and ∼8.5% in hexaploids. Conversely, the correlation between true and estimated genetic maps increased by 0.03 in tetraploids and by 0.2 in hexaploids, while worsening slightly in diploids (∼0.0011). We also show that the combination of genotype curation and map re-estimation allowed us to obtain better genetic maps while correcting wrong genotypes. We have implemented this algorithm in the R package Smooth Descent.

Identifiants

pubmed: 36936433
doi: 10.3389/fgene.2023.1049988
pii: 1049988
pmc: PMC10014611
doi:

Types de publication

Journal Article

Langues

eng

Pagination

1049988

Informations de copyright

Copyright © 2023 Thérèse Navarro, Bourke, van de Weg, Clot, Arens, Finkers and Maliepaard.

Déclaration de conflit d'intérêts

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Références

PLoS Genet. 2008 Oct;4(10):e1000212
pubmed: 18846212
Brief Bioinform. 2021 Jul 20;22(4):
pubmed: 33236761
Plant Biotechnol J. 2012 Feb;10(2):125-38
pubmed: 21831136
PLoS Comput Biol. 2014 Mar 27;10(3):e1003502
pubmed: 24675685
PLoS Biol. 2014 Jun 10;12(6):e1001883
pubmed: 24914810
Genome Biol. 2020 Sep 21;21(1):252
pubmed: 32951599
Brief Bioinform. 2018 May 1;19(3):387-403
pubmed: 28065918
Theor Appl Genet. 2016 Nov;129(11):2117-2132
pubmed: 27502200
BMC Bioinformatics. 2020 Jun 18;21(1):253
pubmed: 32552661
PLoS One. 2014 Jun 06;9(6):e98855
pubmed: 24905985
Trends Plant Sci. 2016 Jul;21(7):609-621
pubmed: 27021699
Front Genet. 2014 Jul 07;5:208
pubmed: 25071835
Genetics. 2016 May;203(1):119-31
pubmed: 26920758
G3 (Bethesda). 2020 Jan 7;10(1):281-292
pubmed: 31732504
Genome Res. 2017 May;27(5):801-812
pubmed: 27940952
Nat Rev Genet. 2011 Sep 16;12(10):703-14
pubmed: 21921926
Proc Natl Acad Sci U S A. 1959 Nov;45(11):1607-20
pubmed: 16590553
Genetics. 2007 Aug;176(4):2521-7
pubmed: 17277374
BMC Bioinformatics. 2012 Sep 26;13:248
pubmed: 23013469
Genome Biol. 2021 Apr 12;22(1):101
pubmed: 33845884
Genetics. 1989 Jan;121(1):185-99
pubmed: 2563713
Am J Hum Genet. 2018 Sep 6;103(3):338-348
pubmed: 30100085
Bioinformatics. 2003 May 1;19(7):889-90
pubmed: 12724300
PLoS One. 2011 May 04;6(5):e19379
pubmed: 21573248
Genomics. 1992 Nov;14(3):604-10
pubmed: 1427888
J Comput Biol. 2012 Jun;19(6):577-90
pubmed: 22697235
BMC Genomics. 2021 Apr 1;22(1):230
pubmed: 33794798
Bioinformatics. 2021 Aug 06;:
pubmed: 34358315
BMC Genomics. 2018 May 9;19(Suppl 2):110
pubmed: 29764364
Genetics. 2018 May;209(1):65-76
pubmed: 29487138
Heredity (Edinb). 2003 Jan;90(1):33-8
pubmed: 12522423
Genetics. 2021 Oct 2;219(2):
pubmed: 34849879
J Comput Biol. 2015 Jun;22(6):498-509
pubmed: 25658651
Brief Bioinform. 2009 Nov;10(6):595-608
pubmed: 19933208
PLoS One. 2012;7(2):e30906
pubmed: 22363513
Front Plant Sci. 2018 Apr 18;9:513
pubmed: 29720992
Bioinformatics. 2018 Oct 15;34(20):3496-3502
pubmed: 29722786
Genome Biol Evol. 2015 Dec 14;8(1):78-93
pubmed: 26668116
Front Genet. 2015 Jun 19;6:220
pubmed: 26150829
Genetics. 2018 Nov;210(3):789-807
pubmed: 30185430
Bioinformatics. 2017 Dec 01;33(23):3726-3732
pubmed: 29036272
G3 (Bethesda). 2019 Oct 7;9(10):3297-3314
pubmed: 31405891
BMC Genomics. 2015 Apr 03;16:260
pubmed: 25885901
Front Plant Sci. 2018 Aug 21;9:1166
pubmed: 30186293
Theor Appl Genet. 2005 Dec;112(1):187-94
pubmed: 16258753
Nat Commun. 2019 Nov 28;10(1):5436
pubmed: 31780650

Auteurs

Alejandro Thérèse Navarro (A)

Plant Breeding, Wageningen University & Research, Wageningen, Netherlands.

Peter M Bourke (PM)

Plant Breeding, Wageningen University & Research, Wageningen, Netherlands.

Eric van de Weg (E)

Plant Breeding, Wageningen University & Research, Wageningen, Netherlands.

Corentin R Clot (CR)

Plant Breeding, Wageningen University & Research, Wageningen, Netherlands.

Paul Arens (P)

Plant Breeding, Wageningen University & Research, Wageningen, Netherlands.

Richard Finkers (R)

Plant Breeding, Wageningen University & Research, Wageningen, Netherlands.

Chris Maliepaard (C)

Plant Breeding, Wageningen University & Research, Wageningen, Netherlands.

Classifications MeSH