ArabBert-LSTM: improving Arabic sentiment analysis based on transformer model and Long Short-Term Memory.

Arabic sentiment analysis Long Short-Term Memory deep learning machine learning sentiment analysis transformer models

Journal

Frontiers in artificial intelligence

ISSN: 2624-8212

Titre abrégé: Front Artif Intell

Pays: Switzerland

ID NLM: 101770551

Informations de publication

Date de publication:
2024

Historique:

received: 28 03 2024

accepted: 14 06 2024

medline: 17 7 2024

pubmed: 17 7 2024

entrez: 17 7 2024

Statut: epublish

Résumé

Sentiment analysis also referred to as opinion mining, plays a significant role in automating the identification of negative, positive, or neutral sentiments expressed in textual data. The proliferation of social networks, review sites, and blogs has rendered these platforms valuable resources for mining opinions. Sentiment analysis finds applications in various domains and languages, including English and Arabic. However, Arabic presents unique challenges due to its complex morphology characterized by inflectional and derivation patterns. To effectively analyze sentiment in Arabic text, sentiment analysis techniques must account for this intricacy. This paper proposes a model designed using the transformer model and deep learning (DL) techniques. The word embedding is represented by Transformer-based Model for Arabic Language Understanding (ArabBert), and then passed to the AraBERT model. The output of AraBERT is subsequently fed into a Long Short-Term Memory (LSTM) model, followed by feedforward neural networks and an output layer. AraBERT is used to capture rich contextual information and LSTM to enhance sequence modeling and retain long-term dependencies within the text data. We compared the proposed model with machine learning (ML) algorithms and DL algorithms, as well as different vectorization techniques: term frequency-inverse document frequency (TF-IDF), ArabBert, Continuous Bag-of-Words (CBOW), and skipGrams using four Arabic benchmark datasets. Through extensive experimentation and evaluation of Arabic sentiment analysis datasets, we showcase the effectiveness of our approach. The results underscore significant improvements in sentiment analysis accuracy, highlighting the potential of leveraging transformer models for Arabic Sentiment Analysis. The outcomes of this research contribute to advancing Arabic sentiment analysis, enabling more accurate and reliable sentiment analysis in Arabic text. The findings reveal that the proposed framework exhibits exceptional performance in sentiment classification, achieving an impressive accuracy rate of over 97%.

Identifiants

DOI: 10.3389/frai.2024.1408845 PMID: 39015364 PMC: PMC11250580

pubmed: 39015364

doi: 10.3389/frai.2024.1408845

pmc: PMC11250580

doi:

Types de publication

Journal Article

Langues

eng

Pagination

1408845

Informations de copyright

Déclaration de conflit d'intérêts

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

ArabBert-LSTM: improving Arabic sentiment analysis based on transformer model and Long Short-Term Memory.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Pagination

Informations de copyright

Déclaration de conflit d'intérêts

Auteurs

Wael Alosaimi (W)

Hager Saleh (H)

Ali A Hamzah (AA)

Nora El-Rashidy (N)

Abdullah Alharb (A)

Ahmed Elaraby (A)

Sherif Mostafa (S)

Classifications MeSH