Beyond central-tendency: If we agree discrete vegetation communities do not exist, should we investigate other methods of clustering?

CLUTO classification stability clustering community theory essentialism graph theory network theory reification vegetation classification vegetation databases

Journal

Ecology and evolution
ISSN: 2045-7758
Titre abrégé: Ecol Evol
Pays: England
ID NLM: 101566408

Informations de publication

Date de publication:
Nov 2023
Historique:
received: 31 08 2023
accepted: 08 10 2023
medline: 29 11 2023
pubmed: 29 11 2023
entrez: 29 11 2023
Statut: epublish

Résumé

Clustering is indispensable in the quest for robust vegetation classification schemes that aim to partition, summarise and communicate patterns. However, clustering solutions are sensitive to methods and data and are therefore unstable, a feature that is usually attributed to noise. Viewed through a central-tendency lens, noise is defined as the degree of departure from type, which is problematic since vegetation types are abstractions of continua, and so noise can only be quantified relative to the particular solution at hand. Graph theory models the structure of vegetation data based on the interconnectivity of samples. Through a graph-theoretic lens, the causes of instability can be quantified in absolute terms via the degree of connectivity among objects. We simulated incremental increases in sampling intensity in a dataset over five iterations and assessed classification stability across successive solutions derived using algorithms implementing, respectively, models of central-tendency and interconnectivity. We used logistic regression to model the likelihood of a sample changing groups between iterations as a function of distance to the centroid and degree of interconnectivity. Our results show that the degree to which samples are interconnected is a more powerful predictor of instability than the degree to which they deviate from their nearest centroid. The removal of weakly interconnected samples resulted in more stable classifications, although solutions with many clusters were apparently inherently less stable than those with few clusters, and improvements in stability flowing from the removal of outliers declined as the number of clusters increased. Our results reinforce the fact that clusters abstracted from continuous data are inherently unstable and that the quest for stable, fine-scale classifications from large regional datasets is illusory. Nevertheless, our results show that using models better suited to the analysis of continuous data may yield more stable classifications of the available data.

Identifiants

pubmed: 38020702
doi: 10.1002/ece3.10757
pii: ECE310757
pmc: PMC10659940
doi:

Types de publication

Journal Article

Langues

eng

Pagination

e10757

Informations de copyright

© 2023 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd.

Déclaration de conflit d'intérêts

The authors declare no conflicts of interest exist in the presentation of this work.

Références

J Environ Manage. 2017 Nov 1;202(Pt 2):447-460
pubmed: 27839846
Nature. 2022 Oct;610(7932):513-518
pubmed: 36224387
Trends Ecol Evol. 1986 Dec;1(6):161-4
pubmed: 21227805
Ecol Evol. 2022 Nov 18;12(11):e9496
pubmed: 36415880
Philos Trans R Soc Lond B Biol Sci. 2015 Feb 19;370(1662):20140003
pubmed: 25561664
Stud Hist Philos Biol Biomed Sci. 2007 Mar;38(1):85-109
pubmed: 17324810

Auteurs

Mark G Tozer (MG)

NSW Department of Planning and Environment Parramatta New South Wales Australia.
School of Biological, Earth and Environmental Science, Centre for Ecosystem Science University of NSW Sydney New South Wales Australia.

David A Keith (DA)

School of Biological, Earth and Environmental Science, Centre for Ecosystem Science University of NSW Sydney New South Wales Australia.

Classifications MeSH