Cross-Correlation Based Automated Segmentation of Audio Samples.
Journal
Studies in health technology and informatics
ISSN: 1879-8365
Titre abrégé: Stud Health Technol Inform
Pays: Netherlands
ID NLM: 9214582
Informations de publication
Date de publication:
26 Jun 2020
26 Jun 2020
Historique:
entrez:
2
7
2020
pubmed:
2
7
2020
medline:
15
7
2020
Statut:
ppublish
Résumé
This paper presents an audio file segmentation method in an attempt to mitigate the issue of variable durations of the same utterance by different individuals, e.g.: Speech-Language Pathologist (SLP) and dyslalic subjects. The Method section describes the manner of determination of the maximum cross-correlation value between the 2 audio files and the subsequent automated segmentation thereof in order to extract 2 valid pronunciation samples of the target consonant. The method is aimed at pre-processing audio files and supplying homogeneously-trimmed audio samples to a computerized SSD Screening system. The results obtained on a batch of 30 pronunciations are presented and briefly discussed in the third section while the last section is reserved for conclusions and perspectives.
Identifiants
pubmed: 32604646
pii: SHTI200539
doi: 10.3233/SHTI200539
doi:
Types de publication
Journal Article
Langues
eng