Soundgen: An open-source tool for synthesizing nonverbal vocalizations.
Animal vocalizations
Emotion
Formant synthesis
Nonverbal vocalizations
Open source
Parametric synthesis
Voice synthesis
Journal
Behavior research methods
ISSN: 1554-3528
Titre abrégé: Behav Res Methods
Pays: United States
ID NLM: 101244316
Informations de publication
Date de publication:
04 2019
04 2019
Historique:
pubmed:
29
7
2018
medline:
13
7
2019
entrez:
29
7
2018
Statut:
ppublish
Résumé
Voice synthesis is a useful method for investigating the communicative role of different acoustic features. Although many text-to-speech systems are available, researchers of human nonverbal vocalizations and bioacousticians may profit from a dedicated simple tool for synthesizing and manipulating natural-sounding vocalizations. Soundgen ( https://CRAN.R-project.org/package=soundgen ) is an open-source R package that synthesizes nonverbal vocalizations based on meaningful acoustic parameters, which can be specified from the command line or in an interactive app. This tool was validated by comparing the perceived emotion, valence, arousal, and authenticity of 60 recorded human nonverbal vocalizations (screams, moans, laughs, and so on) and their approximate synthetic reproductions. Each synthetic sound was created by manually specifying only a small number of high-level control parameters, such as syllable length and a few anchors for the intonation contour. Nevertheless, the valence and arousal ratings of synthetic sounds were similar to those of the original recordings, and the authenticity ratings were comparable, maintaining parity with the originals for less complex vocalizations. Manipulating the precise acoustic characteristics of synthetic sounds may shed light on the salient predictors of emotion in the human voice. More generally, soundgen may prove useful for any studies that require precise control over the acoustic features of nonspeech sounds, including research on animal vocalizations and auditory perception.
Identifiants
pubmed: 30054898
doi: 10.3758/s13428-018-1095-7
pii: 10.3758/s13428-018-1095-7
pmc: PMC6478631
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
778-792Références
Proc Biol Sci. 2005 May 7;272(1566):941-7
pubmed: 16024350
J Neurophysiol. 2006 Feb;95(2):1244-62
pubmed: 16207780
J Voice. 2007 Sep;21(5):531-40
pubmed: 16647247
J Acoust Soc Am. 2007 Mar;121(3):1758-67
pubmed: 17407912
J Neurosci Methods. 2008 May 15;170(1):45-55
pubmed: 18289695
J Acoust Soc Am. 2008 May;123(5):2903-9
pubmed: 18529206
Q J Exp Psychol (Hove). 2010 Nov;63(11):2251-72
pubmed: 20437296
Behav Res Methods. 2010 Nov;42(4):1030-41
pubmed: 21139170
J Acoust Soc Am. 1990 Feb;87(2):820-57
pubmed: 2137837
Emotion. 2012 Oct;12(5):1161-79
pubmed: 22081890
Behav Res Methods. 2013 Dec;45(4):1234-45
pubmed: 23444120
Front Neurosci. 2014 Dec 22;8:422
pubmed: 25565951
J Acoust Soc Am. 2015 Jul;138(1):1-10
pubmed: 26233000
Proc Natl Acad Sci U S A. 2016 Jan 26;113(4):948-53
pubmed: 26755584
Behav Res Methods. 2017 Apr;49(2):758-771
pubmed: 27130172
Q J Exp Psychol (Hove). 2018 Mar;71(3):622-641
pubmed: 27937389
Behav Res Methods. 2018 Feb;50(1):323-343
pubmed: 28374144
J Exp Biol. 2017 Oct 1;220(Pt 19):3571-3578
pubmed: 28778999
PLoS One. 2017 Aug 29;12(8):e0183811
pubmed: 28850589
J Nonverbal Behav. 2018;42(1):53-80
pubmed: 29497221
J Acoust Soc Am. 1985 Apr;77(4):1560-75
pubmed: 3989111
J Acoust Soc Am. 1971 Feb;49(2):Suppl 2:583+
pubmed: 5541751
Phonetica. 1984;41(1):1-16
pubmed: 6204347
J Neurosci. 1983 May;3(5):1039-57
pubmed: 6842281
J Pers Soc Psychol. 1996 Mar;70(3):614-36
pubmed: 8851745