Affective Action and Interaction Recognition by Multi-View Representation Learning from Handcrafted Low-Level Skeleton Features.

Algorithms Cluster Analysis Facial Expression Human Activities Humans Skeleton

Affective action affective interaction bag-of-visual-words handcrafted low-level skeleton features multi-view representation learning

Journal

International journal of neural systems

ISSN: 1793-6462

Titre abrégé: Int J Neural Syst

Pays: Singapore

ID NLM: 9100527

Informations de publication

Date de publication:
Oct 2022

Historique:

pubmed: 27 7 2022

medline: 24 9 2022

entrez: 26 7 2022

Statut: ppublish

Résumé

Human feelings expressed through verbal (e.g. voice) and non-verbal communication channels (e.g. face or body) can influence either human actions or interactions. In the literature, most of the attention was given to facial expressions for the analysis of emotions conveyed through non-verbal behaviors. Despite this, psychology highlights that the body is an important indicator of the human affective state in performing daily life activities. Therefore, this paper presents a novel method for affective action and interaction recognition from videos, exploiting multi-view representation learning and only full-body handcrafted characteristics selected following psychological and proxemic studies. Specifically, 2D skeletal data are extracted from RGB video sequences to derive diverse low-level skeleton features, i.e. multi-views, modeled through the bag-of-visual-words clustering approach generating a condition-related codebook. In this way, each affective action and interaction within a video can be represented as a frequency histogram of codewords. During the learning phase, for each affective class, training samples are used to compute its global histogram of codewords stored in a database and later used for the recognition task. In the recognition phase, the video frequency histogram representation is matched against the database of class histograms and classified as the closest affective class in terms of Euclidean distance. The effectiveness of the proposed system is evaluated on a specifically collected dataset containing 6 emotion for both actions and interactions, on which the proposed system obtains 93.64% and 90.83% accuracy, respectively. In addition, the devised strategy also achieves in line performances with other literature works based on deep learning when tested on a public collection containing 6 emotions plus a neutral state, demonstrating the effectiveness of the presented approach and confirming the findings in psychological and proxemic studies.

Identifiants

DOI: 10.1142/S012906572250040X PMID: 35881015

pubmed: 35881015

doi: 10.1142/S012906572250040X

doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

Pagination

2250040

Affective Action and Interaction Recognition by Multi-View Representation Learning from Handcrafted Low-Level Skeleton Features.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Pagination

Auteurs

Danilo Avola (D)

Marco Cascio (M)

Luigi Cinque (L)

Alessio Fagioli (A)

Gian Luca Foresti (GL)

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Smoking Cessation and Incident Cardiovascular Disease.

Evaluation of Low-Value Services Across Major Medicare Advantage Insurers and Traditional Medicare.

Effectiveness of Virtual Yoga for Chronic Low Back Pain: A Randomized Clinical Trial.

Classifications MeSH