Autism Detection in Children: Integrating Machine Learning and Natural Language Processing in Narrative Analysis.

autism spectrum disorder early diagnosis expressive vocabulary machine learning narrative production natural language processing

Journal

Behavioral sciences (Basel, Switzerland)
ISSN: 2076-328X
Titre abrégé: Behav Sci (Basel)
Pays: Switzerland
ID NLM: 101576826

Informations de publication

Date de publication:
29 May 2024
Historique:
received: 31 03 2024
revised: 24 05 2024
accepted: 25 05 2024
medline: 26 6 2024
pubmed: 26 6 2024
entrez: 26 6 2024
Statut: epublish

Résumé

Despite the consensus that early identification leads to better outcomes for individuals with autism spectrum disorder (ASD), recent research reveals that the average age of diagnosis in the Greek population is approximately six years. However, this age of diagnosis is delayed by an additional two years for families from lower-income or minority backgrounds. These disparities result in adverse impacts on intervention outcomes, which are further burdened by the often time-consuming and labor-intensive language assessments for children with ASD. There is a crucial need for tools that increase access to early assessment and diagnosis that will be rigorous and objective. The current study leverages the capabilities of artificial intelligence to develop a reliable and practical model for distinguishing children with ASD from typically-developing peers based on their narrative and vocabulary skills. We applied natural language processing-based extraction techniques to automatically acquire language features (narrative and vocabulary skills) from storytelling in 68 children with ASD and 52 typically-developing children, and then trained machine learning models on the children's combined narrative and expressive vocabulary data to generate behavioral targets that effectively differentiate ASD from typically-developing children. According to the findings, the model could distinguish ASD from typically-developing children, achieving an accuracy of 96%. Specifically, out of the models used, hist gradient boosting and XGBoost showed slightly superior performance compared to the decision trees and gradient boosting models, particularly regarding accuracy and F1 score. These results bode well for the deployment of machine learning technology for children with ASD, especially those with limited access to early identification services.

Identifiants

pubmed: 38920791
pii: bs14060459
doi: 10.3390/bs14060459
pii:
doi:

Types de publication

Journal Article

Langues

eng

Subventions

Organisme : H.F.R.I call "Basic research Financing (Horizontal support of all Sciences)" under the National Recovery and Resilience Plan "Greece 2.0" funded by the European Union -NextGenerationEU
ID : Language Phenotyping in Autism Using Machine Learning" (H.F.R.I. Project Number: 14864), P.I.: Eleni Peristeri.

Auteurs

Charalambos K Themistocleous (CK)

Department of Special Needs Education, Faculty of Educational Sciences, University of Oslo, 0313 Oslo, Norway.

Maria Andreou (M)

Department of Speech and Language Therapy, University of Peloponnese, 24100 Kalamata, Greece.

Eleni Peristeri (E)

School of English, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece.

Classifications MeSH