From Leiden to Tel-Aviv University (TAU): exploring clustering solutions via a genetic algorithm.
community detection
graph clustering
modularity optimization
Journal
PNAS nexus
ISSN: 2752-6542
Titre abrégé: PNAS Nexus
Pays: England
ID NLM: 9918367777906676
Informations de publication
Date de publication:
Jun 2023
Jun 2023
Historique:
received:
15
03
2023
accepted:
22
05
2023
medline:
8
6
2023
pubmed:
8
6
2023
entrez:
8
6
2023
Statut:
epublish
Résumé
Graph clustering is a fundamental problem in machine learning with numerous applications in data science. State-of-the-art approaches to the problem, Louvain and Leiden, aim at optimizing the modularity function. However, their greedy nature leads to fast convergence to sub-optimal solutions. Here, we design a new approach to graph clustering, Tel-Aviv University (TAU), that efficiently explores the solution space using a genetic algorithm. We benchmark TAU on synthetic and real data sets and show its superiority over previous methods both in terms of the modularity of the computed solution and its similarity to a ground-truth partition when such exists. TAU is available at https://github.com/GalGilad/TAU.
Identifiants
pubmed: 37287709
doi: 10.1093/pnasnexus/pgad180
pii: pgad180
pmc: PMC10244004
doi:
Types de publication
Journal Article
Langues
eng
Pagination
pgad180Informations de copyright
© The Author(s) 2023. Published by Oxford University Press on behalf of National Academy of Sciences.
Références
Phys Rev E Stat Nonlin Soft Matter Phys. 2006 Sep;74(3 Pt 2):036104
pubmed: 17025705
Sci Adv. 2017 May 03;3(5):e1602548
pubmed: 28508065
Phys Rev E Stat Nonlin Soft Matter Phys. 2004 Feb;69(2 Pt 2):026113
pubmed: 14995526
Phys Rev E Stat Nonlin Soft Matter Phys. 2010 Apr;81(4 Pt 2):046106
pubmed: 20481785
Sci Rep. 2018 Jul 18;8(1):10872
pubmed: 30022098
Nucleic Acids Res. 2019 Jan 8;47(D1):D607-D613
pubmed: 30476243
Sci Rep. 2019 Mar 26;9(1):5233
pubmed: 30914743
Phys Rev E Stat Nonlin Soft Matter Phys. 2004 Jun;69(6 Pt 2):066133
pubmed: 15244693
PLoS One. 2014 Jan 29;9(1):e85777
pubmed: 24489671
Phys Rev E Stat Nonlin Soft Matter Phys. 2008 Oct;78(4 Pt 2):046110
pubmed: 18999496