Splitting on categorical predictors in random forests.

Categorical predictors Classification Random forest Survival analysis

Journal

PeerJ
ISSN: 2167-8359
Titre abrégé: PeerJ
Pays: United States
ID NLM: 101603425

Informations de publication

Date de publication:
2019
Historique:
received: 26 06 2018
accepted: 22 12 2018
entrez: 13 2 2019
pubmed: 13 2 2019
medline: 13 2 2019
Statut: epublish

Résumé

One reason for the widespread success of random forests (RFs) is their ability to analyze most datasets without preprocessing. For example, in contrast to many other statistical methods and machine learning approaches, no recoding such as dummy coding is required to handle ordinal and nominal predictors. The standard approach for nominal predictors is to consider all 2

Identifiants

pubmed: 30746306
doi: 10.7717/peerj.6339
pii: 6339
pmc: PMC6368971
doi:

Types de publication

Journal Article

Langues

eng

Pagination

e6339

Déclaration de conflit d'intérêts

The authors declare that they have no competing interests.

Références

Stat Med. 1999 Sep 15-30;18(17-18):2529-45
pubmed: 10474158
BMC Proc. 2009 Dec 15;3 Suppl 7:S2
pubmed: 20018009
Evol Comput. 2012 Summer;20(2):249-75
pubmed: 22339368
Stat Appl Genet Mol Biol. 2011;10(1):32
pubmed: 22889876
PLoS One. 2017 Feb 16;12(2):e0169748
pubmed: 28207752
Mach Learn. 2015 Apr;99(1):75-118
pubmed: 28919667

Auteurs

Marvin N Wright (MN)

Leibniz Institute for Prevention Research and Epidemiology-BIPS, Bremen, Germany.

Inke R König (IR)

Institut für Medizinische Biometrie und Statistik, Universität zu Lübeck, Universitätsklinikum Schleswig-Holstein, Campus Lübeck, Lübeck, Germany.

Classifications MeSH