Combining Random Forests and a Signal Detection Method Leads to the Robust Detection of Genotype-Phenotype Associations.
Random Forests
boruta
egg weight
eggshell strength
genome wide association studies
signal detection
Journal
Genes
ISSN: 2073-4425
Titre abrégé: Genes (Basel)
Pays: Switzerland
ID NLM: 101551097
Informations de publication
Date de publication:
05 08 2020
05 08 2020
Historique:
received:
09
07
2020
revised:
28
07
2020
accepted:
03
08
2020
entrez:
9
8
2020
pubmed:
9
8
2020
medline:
18
3
2021
Statut:
epublish
Résumé
Genome wide association studies (GWAS) are a well established methodology to identify genomic variants and genes that are responsible for traits of interest in all branches of the life sciences. Despite the long time this methodology has had to mature the reliable detection of genotype-phenotype associations is still a challenge for many quantitative traits mainly because of the large number of genomic loci with weak individual effects on the trait under investigation. Thus, it can be hypothesized that many genomic variants that have a small, however real, effect remain unnoticed in many GWAS approaches. Here, we propose a two-step procedure to address this problem. In a first step, cubic splines are fitted to the test statistic values and genomic regions with spline-peaks that are higher than expected by chance are considered as quantitative trait loci (QTL). Then the SNPs in these QTLs are prioritized with respect to the strength of their association with the phenotype using a Random Forests approach. As a case study, we apply our procedure to real data sets and find trustworthy numbers of, partially novel, genomic variants and genes involved in various egg quality traits.
Identifiants
pubmed: 32764260
pii: genes11080892
doi: 10.3390/genes11080892
pmc: PMC7465705
pii:
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Commentaires et corrections
Type : ErratumIn
Références
Front Plant Sci. 2018 Sep 05;9:1311
pubmed: 30233634
Proc Natl Acad Sci U S A. 2019 Apr 2;116(14):6848-6857
pubmed: 30872480
Front Endocrinol (Lausanne). 2012 Feb 28;3:29
pubmed: 22645519
Cell Stem Cell. 2007 Oct 11;1(4):470-8
pubmed: 18371382
PLoS One. 2010 Nov 22;5(11):e14079
pubmed: 21124933
Genet Sel Evol. 2020 Feb 12;52(1):9
pubmed: 32050893
Mol Ecol Resour. 2018 Jul;18(4):755-766
pubmed: 29504715
BMC Genomics. 2015;16 Suppl 2:S5
pubmed: 25708662
Calcif Tissue Int. 2014 Jan;94(1):46-54
pubmed: 23657489
Am J Hum Genet. 2017 Jul 6;101(1):5-22
pubmed: 28686856
Int J Epidemiol. 2012 Dec;41(6):1798-806
pubmed: 23148107
Genes (Basel). 2020 Apr 24;11(4):
pubmed: 32344666
J Anim Sci. 2014 Sep;92(9):3804-10
pubmed: 25023801
Front Genet. 2020 Apr 15;11:350
pubmed: 32351543
Am J Hum Genet. 2002 Dec;71(6):1386-94
pubmed: 12439824
Hum Genomics. 2003 Nov;1(1):20-9
pubmed: 15601530
BMC Genet. 2013 Sep 24;14:90
pubmed: 24059973
Genetics. 2005 Apr;169(4):2267-75
pubmed: 15716509
Genet Epidemiol. 2003 Dec;25(4):360-6
pubmed: 14639705
Genet Sel Evol. 2020 Mar 17;52(1):14
pubmed: 32183688
Genet Epidemiol. 2010 Jan;34(1):100-5
pubmed: 19434714
Crit Rev Clin Lab Sci. 2010 Aug;47(4):181-95
pubmed: 21182397
Sci Rep. 2019 Jul 17;9(1):10351
pubmed: 31316157
Nat Genet. 2012 Jun 17;44(7):821-4
pubmed: 22706312
Nat Methods. 2011 Sep 04;8(10):833-5
pubmed: 21892150
Genetics. 2016 Apr;202(4):1313-28
pubmed: 26868768
BMC Genomics. 2014 Mar 21;15:220
pubmed: 24649854
BMC Physiol. 2012 Sep 04;12:10
pubmed: 22943410
BMC Proc. 2007;1 Suppl 1:S59
pubmed: 18466559
Mar Biotechnol (NY). 2020 Apr;22(2):153-166
pubmed: 31927644
J Dairy Sci. 2019 Oct;102(10):9409-9421
pubmed: 31447154
PLoS Genet. 2019 Jun 24;15(6):e1008222
pubmed: 31233496
Nat Genet. 2010 Apr;42(4):348-54
pubmed: 20208533
Sci Rep. 2017 Sep 29;7(1):12478
pubmed: 28963534
Histochemistry. 1985;83(3):221-6
pubmed: 2931410
Nat Rev Genet. 2006 Oct;7(10):781-91
pubmed: 16983374
Genetics. 2008 Jun;179(2):1057-68
pubmed: 18558654
Genome Res. 1999 Aug;9(8):720-31
pubmed: 10447507
Arch Environ Contam Toxicol. 1982 Sep;11(5):627-33
pubmed: 6216861
Front Genet. 2018 Jul 04;9:237
pubmed: 30023001
Am J Hum Genet. 2018 May 3;102(5):717-730
pubmed: 29727686
Front Plant Sci. 2018 Apr 26;9:561
pubmed: 29755499
Eur J Hum Genet. 2001 Apr;9(4):291-300
pubmed: 11313774
Front Plant Sci. 2019 Feb 11;10:100
pubmed: 30804969
BMC Genomics. 2020 Jan 13;21(1):41
pubmed: 31931710
Genet Sel Evol. 2015 Apr 17;47:30
pubmed: 25928167
PLoS Comput Biol. 2012;8(12):e1002822
pubmed: 23300413
Dev Biol. 2015 May 1;401(1):2-16
pubmed: 25576029
J Cell Biol. 1998 Aug 24;142(4):1105-19
pubmed: 9722621
New Phytol. 2017 Apr;214(1):21-33
pubmed: 28211582
Brief Bioinform. 2018 Jul 20;19(4):700-712
pubmed: 28158525
Sci Rep. 2017 Dec 22;7(1):18083
pubmed: 29273734
Genet Sel Evol. 2009 Dec 31;41:55
pubmed: 20043827
Am J Hum Genet. 2007 Sep;81(3):559-75
pubmed: 17701901
Genet Epidemiol. 2006 Sep;30(6):531-45
pubmed: 16830340
Front Genet. 2019 Nov 22;10:1091
pubmed: 31824557
Plant Methods. 2013 Jul 22;9:29
pubmed: 23876160
Br Poult Sci. 2018 Jun;59(3):264-269
pubmed: 29667421
BMC Genomics. 2010 Dec 22;11:724
pubmed: 21176216
Genomics. 2020 Jan;112(1):243-251
pubmed: 30772430
Front Plant Sci. 2018 Aug 17;9:1083
pubmed: 30177935
PLoS Genet. 2014 Jul 17;10(7):e1004445
pubmed: 25033443
Front Plant Sci. 2018 Oct 04;9:1464
pubmed: 30337936
Mol Cell Biol. 2002 Mar;22(5):1474-87
pubmed: 11839813
Front Genet. 2018 Apr 26;9:128
pubmed: 29755503
Nat Genet. 2006 Feb;38(2):203-8
pubmed: 16380716
Database (Oxford). 2011 Jul 23;2011:bar030
pubmed: 21785142
Nat Rev Mol Cell Biol. 2011 Jan;12(1):60-70
pubmed: 21179061