MIT: Mutual Information Topic Model for Diverse Topic Extraction.

Journal

IEEE transactions on neural networks and learning systems

ISSN: 2162-2388

Titre abrégé: IEEE Trans Neural Netw Learn Syst

Pays: United States

ID NLM: 101616214

Informations de publication

Date de publication:
07 Feb 2024

Historique:

medline: 7 2 2024

pubmed: 7 2 2024

entrez: 7 2 2024

Statut: aheadofprint

Résumé

To automatically mine structured semantic topics from text, neural topic modeling has arisen and made some progress. However, most existing work focuses on designing a mechanism to enhance topic coherence but sacrificing the diversity of the extracted topics. To address this limitation, we propose the first neural-based topic modeling approach purely based on mutual information maximization, called the mutual information topic (MIT) model, in this article. The proposed MIT significantly improves topic diversity by maximizing the mutual information between word distribution and topic distribution. Meanwhile, MIT also utilizes Dirichlet prior in latent topic space to ensure the quality of mined topics. The experimental results on three publicly benchmark text corpora show that MIT could extract topics with higher coherence values (considering four topic coherence metrics) than competitive approaches and has a significant improvement on topic diversity metric. Besides, our experiments prove that the proposed MIT converges faster and more stable than adversarial-neural topic models.

Identifiants

DOI: 10.1109/TNNLS.2024.3357698 PMID: 38324432

pubmed: 38324432

doi: 10.1109/TNNLS.2024.3357698

doi:

Types de publication

Journal Article

Langues

eng

MIT: Mutual Information Topic Model for Diverse Topic Extraction.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Auteurs

Rui Wang (R)

Deyu Zhou (D)

Haiping Huang (H)

Yongquan Zhou (Y)

Classifications MeSH