Diversity Forests: Using Split Sampling to Enable Innovative Complex Split Procedures in Random Forests.
Classification
Decision trees
Ensemble learning
Random forests
Journal
SN computer science
ISSN: 2661-8907
Titre abrégé: SN Comput Sci
Pays: Singapore
ID NLM: 101772308
Informations de publication
Date de publication:
2022
2022
Historique:
received:
26
05
2021
accepted:
02
10
2021
entrez:
1
11
2021
pubmed:
2
11
2021
medline:
2
11
2021
Statut:
ppublish
Résumé
The diversity forest algorithm is an alternative candidate node split sampling scheme that makes innovative complex split procedures in random forests possible. While conventional univariable, binary splitting suffices for obtaining strong predictive performance, new complex split procedures can help tackling practically important issues. For example, interactions between features can be exploited effectively by bivariable splitting. With diversity forests, each split is selected from a candidate split set that is sampled in the following way: for The online version contains supplementary material available at 10.1007/s42979-021-00920-1.
Identifiants
pubmed: 34723205
doi: 10.1007/s42979-021-00920-1
pii: 920
pmc: PMC8533673
doi:
Types de publication
Journal Article
Langues
eng
Pagination
1Informations de copyright
© The Author(s) 2021.
Déclaration de conflit d'intérêts
Conflict of InterestThe funding body German Science Foundation had no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript. The author declares that he has no conflict of interest.
Références
PLoS One. 2018 Aug 6;13(8):e0201904
pubmed: 30080866
BMJ. 2020 Apr 7;369:m1328
pubmed: 32265220
BMC Bioinformatics. 2016 Mar 31;17:145
pubmed: 27029549
Stat Med. 2018 Jul 30;37(17):2547-2560
pubmed: 29707855
BMC Bioinformatics. 2007 Jan 25;8:25
pubmed: 17254353
BMC Bioinformatics. 2018 Jul 17;19(1):270
pubmed: 30016950
Front Public Health. 2020 Jul 03;8:357
pubmed: 32719767
Public Health. 2020 Aug;185:27-29
pubmed: 32526559