SAMI: an M-Health application to telemonitor intelligibility and speech disorder severity in head and neck cancers.

deep learning head and neck cancer healthcare application speaker embeddings speech intelligibility

Journal

Frontiers in artificial intelligence
ISSN: 2624-8212
Titre abrégé: Front Artif Intell
Pays: Switzerland
ID NLM: 101770551

Informations de publication

Date de publication:
2024
Historique:
received: 20 12 2023
accepted: 19 04 2024
medline: 27 5 2024
pubmed: 27 5 2024
entrez: 27 5 2024
Statut: epublish

Résumé

Perceptual measures, such as intelligibility and speech disorder severity, are widely used in the clinical assessment of speech disorders in patients treated for oral or oropharyngeal cancer. Despite their widespread usage, these measures are known to be subjective and hard to reproduce. Therefore, an M-Health assessment based on an automatic prediction has been seen as a more robust and reliable alternative. Despite recent progress, these automatic approaches still remain somewhat theoretical, and a need to implement them in real clinical practice rises. Hence, in the present work we introduce SAMI, a clinical mobile application used to predict speech intelligibility and disorder severity as well as to monitor patient progress on these measures over time. The first part of this work illustrates the design and development of the systems supported by SAMI. Here, we show how deep neural speaker embeddings are used to automatically regress speech disorder measurements (intelligibility and severity), as well as the training and validation of the system on a French corpus of head and neck cancer. Furthermore, we also test our model on a secondary corpus recorded in real clinical conditions. The second part details the results obtained from the deployment of our system in a real clinical environment, over the course of several weeks. In this section, the results obtained with SAMI are compared to an

Identifiants

pubmed: 38800762
doi: 10.3389/frai.2024.1359094
pmc: PMC11119748
doi:

Types de publication

Journal Article

Langues

eng

Pagination

1359094

Informations de copyright

Copyright © 2024 Quintas, Vaysse, Balaguer, Roger, Mauclair, Farinas, Woisard and Pinquier.

Déclaration de conflit d'intérêts

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Auteurs

Sebastião Quintas (S)

IRIT, Université de Toulouse, CNRS, Toulouse INP, UT3, Toulouse, France.

Robin Vaysse (R)

IRIT, Université de Toulouse, CNRS, Toulouse INP, UT3, Toulouse, France.

Mathieu Balaguer (M)

IRIT, Université de Toulouse, CNRS, Toulouse INP, UT3, Toulouse, France.

Vincent Roger (V)

IRIT, Université de Toulouse, CNRS, Toulouse INP, UT3, Toulouse, France.

Julie Mauclair (J)

IRIT, Université de Toulouse, CNRS, Toulouse INP, UT3, Toulouse, France.

Jérôme Farinas (J)

IRIT, Université de Toulouse, CNRS, Toulouse INP, UT3, Toulouse, France.

Virginie Woisard (V)

IUC Toulouse, CHU Toulouse, Service ORL de l'Hôpital Larrey, Toulouse, France.
Laboratoire de NeuroPsychoLinguistique, UR 4156, Université de Toulouse, Toulouse, France.

Julien Pinquier (J)

IRIT, Université de Toulouse, CNRS, Toulouse INP, UT3, Toulouse, France.

Classifications MeSH