FRAUG: A FRAME RATE BASED DATA AUGMENTATION METHOD FOR DEPRESSION DETECTION FROM SPEECH SIGNALS.
data augmentation
depression detection
frame rate
time-frequency resolution
x-vector
Journal
Proceedings of the ... IEEE International Conference on Acoustics, Speech, and Signal Processing. ICASSP (Conference)
ISSN: 1520-6149
Titre abrégé: Proc IEEE Int Conf Acoust Speech Signal Process
Pays: United States
ID NLM: 101182171
Informations de publication
Date de publication:
May 2022
May 2022
Historique:
entrez:
9
5
2022
pubmed:
10
5
2022
medline:
10
5
2022
Statut:
ppublish
Résumé
In this paper, a data augmentation method is proposed for depression detection from speech signals. Samples for data augmentation were created by changing the frame-width and the frame-shift parameters during the feature extraction process. Unlike other data augmentation methods (such as VTLP, pitch perturbation, or speed perturbation), the proposed method does not explicitly change acoustic parameters but rather the time-frequency resolution of frame-level features. The proposed method was evaluated using two different datasets, models, and input acoustic features. For the DAIC-WOZ (English) dataset when using the DepAudioNet model and mel-Spectrograms as input, the proposed method resulted in an improvement of 5.97% (validation) and 25.13% (test) when compared to the baseline. The improvements for the CONVERGE (Mandarin) dataset when using the x-vector embeddings with CNN as the backend and MFCCs as input features were 9.32% (validation) and 12.99% (test). Baseline systems do not incorporate any data augmentation. Further, the proposed method outperformed commonly used data-augmentation methods such as noise augmentation, VTLP, Speed, and Pitch Perturbation. All improvements were statistically significant.
Identifiants
pubmed: 35531125
doi: 10.1109/icassp43922.2022.9746307
pmc: PMC9070766
mid: NIHMS1798595
doi:
Types de publication
Journal Article
Langues
eng
Pagination
6267-6271Subventions
Organisme : NIMH NIH HHS
ID : R01 MH122569
Pays : United States
Références
J Biomed Inform. 2018 Jul;83:103-111
pubmed: 29852317
Biomed Signal Process Control. 2022 Jan;71:103170
pubmed: 34567236
Psychometrika. 1947 Jun;12(2):153-7
pubmed: 20254758
J Affect Disord. 2009 Apr;114(1-3):163-73
pubmed: 18752852
Lancet. 2018 Nov 10;392(10159):1789-1858
pubmed: 30496104
Psychol Med. 2012 Jun;42(6):1239-48
pubmed: 22126712
JAMA. 1989 Dec 15;262(23):3298-302
pubmed: 2585674
PLoS Med. 2006 Nov;3(11):e442
pubmed: 17132052