A weighted sparse coding model on product Grassmann manifold for video-based human gesture recognition.
Human gesture recognition
Product Grassmann manifold
Sparse coding
Video classification
Journal
PeerJ. Computer science
ISSN: 2376-5992
Titre abrégé: PeerJ Comput Sci
Pays: United States
ID NLM: 101660598
Informations de publication
Date de publication:
2022
2022
Historique:
received:
10
11
2021
accepted:
18
02
2022
entrez:
2
5
2022
pubmed:
3
5
2022
medline:
3
5
2022
Statut:
epublish
Résumé
It is a challenging problem to classify multi-dimensional data with complex intrinsic geometry inherent, such as human gesture recognition based on videos. In particular, manifold structure is a good way to characterize intrinsic geometry of multi-dimensional data. The recently proposed sparse coding on Grassmann manifold shows high discriminative power in many visual classification tasks. It represents videos on Grassmann manifold using Singular Value Decomposition (SVD) of the data matrix by vectorizing each image in videos, while vectorization destroys the spatial structure of videos. To keep the spatial structure of videos, they can be represented as the form of data tensor. In this paper, we firstly represent human gesture videos on product Grassmann manifold (PGM) by Higher Order Singular Value Decomposition (HOSVD) of data tensor. Each factor manifold characterizes features of human gesture video from different perspectives and can be understood as appearance, horizontal motion and vertical motion of human gesture video respectively. We then propose a weighted sparse coding model on PGM, where weights can be understood as modeling the importance of factor manifolds. Furthermore, we propose an optimization algorithm for learning coding coefficients by embedding each factor Grassmann manifold into symmetric matrices space. Finally, we give a classification algorithm, and experimental results on three public datasets show that our method is competitive to some relevant excellent methods.
Identifiants
pubmed: 35494827
doi: 10.7717/peerj-cs.923
pii: cs-923
pmc: PMC9044265
doi:
Types de publication
Journal Article
Langues
eng
Pagination
e923Informations de copyright
©2022 Wang and Zhang.
Déclaration de conflit d'intérêts
The authors declare there are no competing interests.
Références
IEEE Trans Pattern Anal Mach Intell. 2013 Nov;35(11):2765-81
pubmed: 24051734
IEEE Trans Pattern Anal Mach Intell. 2013 Jan;35(1):221-31
pubmed: 22392705
IEEE Trans Pattern Anal Mach Intell. 2009 Oct;31(10):1762-74
pubmed: 19696448
IEEE Trans Pattern Anal Mach Intell. 2009 Aug;31(8):1415-28
pubmed: 19542576
Sensors (Basel). 2019 Jan 10;19(2):
pubmed: 30634583
IEEE Trans Pattern Anal Mach Intell. 2009 Feb;31(2):210-27
pubmed: 19110489
J Opt Soc Am A Opt Image Sci Vis. 2018 Jul 1;35(7):1149-1159
pubmed: 30110307