Skeleton-Based Emotion Recognition Based on Two-Stream Self-Attention Enhanced Spatial-Temporal Graph Convolutional Network.

emotion recognition gesture graph convolutional networks self-attention skeleton

Journal

Sensors (Basel, Switzerland)

ISSN: 1424-8220

Titre abrégé: Sensors (Basel)

Pays: Switzerland

ID NLM: 101204366

Informations de publication

Date de publication:
30 Dec 2020

Historique:

received: 30 11 2020

revised: 24 12 2020

accepted: 27 12 2020

entrez: 5 1 2021

pubmed: 6 1 2021

medline: 6 1 2021

Statut: epublish

Résumé

Emotion recognition has drawn consistent attention from researchers recently. Although gesture modality plays an important role in expressing emotion, it is seldom considered in the field of emotion recognition. A key reason is the scarcity of labeled data containing 3D skeleton data. Some studies in action recognition have applied graph-based neural networks to explicitly model the spatial connection between joints. However, this method has not been considered in the field of gesture-based emotion recognition, so far. In this work, we applied a pose estimation based method to extract 3D skeleton coordinates for IEMOCAP database. We propose a self-attention enhanced spatial temporal graph convolutional network for skeleton-based emotion recognition, in which the spatial convolutional part models the skeletal structure of the body as a static graph, and the self-attention part dynamically constructs more connections between the joints and provides supplementary information. Our experiment demonstrates that the proposed model significantly outperforms other models and that the features of the extracted skeleton data improve the performance of multimodal emotion recognition.

Identifiants

DOI: 10.3390/s21010205 PMID: 33396917 PMC: PMC7795329

pubmed: 33396917

pii: s21010205

doi: 10.3390/s21010205

pmc: PMC7795329

pii:

doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

Subventions

Organisme : Grant-in-Aid for Scientific Research on Innovative Areas

ID : JP20H05576

Organisme : JST ERATO

ID : JPMJER1401

Références

Perception. 2013;42(6):642-57

pubmed: 24422246

IEEE Trans Neural Netw Learn Syst. 2021 Jan;32(1):4-24

pubmed: 32217482

IEEE Trans Image Process. 2020 Oct 09;PP:

pubmed: 33035162

Entropy (Basel). 2019 Jun 29;21(7):

pubmed: 33267360

Skeleton-Based Emotion Recognition Based on Two-Stream Self-Attention Enhanced Spatial-Temporal Graph Convolutional Network.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Subventions

Références

Auteurs

Jiaqi Shi (J)

Chaoran Liu (C)

Carlos Toshinori Ishi (CT)

Hiroshi Ishiguro (H)

Classifications MeSH