Autism Detection in Children: Integrating Machine Learning and Natural Language Processing in Narrative Analysis.
autism spectrum disorder
early diagnosis
expressive vocabulary
machine learning
narrative production
natural language processing
Journal
Behavioral sciences (Basel, Switzerland)
ISSN: 2076-328X
Titre abrégé: Behav Sci (Basel)
Pays: Switzerland
ID NLM: 101576826
Informations de publication
Date de publication:
29 May 2024
29 May 2024
Historique:
received:
31
03
2024
revised:
24
05
2024
accepted:
25
05
2024
medline:
26
6
2024
pubmed:
26
6
2024
entrez:
26
6
2024
Statut:
epublish
Résumé
Despite the consensus that early identification leads to better outcomes for individuals with autism spectrum disorder (ASD), recent research reveals that the average age of diagnosis in the Greek population is approximately six years. However, this age of diagnosis is delayed by an additional two years for families from lower-income or minority backgrounds. These disparities result in adverse impacts on intervention outcomes, which are further burdened by the often time-consuming and labor-intensive language assessments for children with ASD. There is a crucial need for tools that increase access to early assessment and diagnosis that will be rigorous and objective. The current study leverages the capabilities of artificial intelligence to develop a reliable and practical model for distinguishing children with ASD from typically-developing peers based on their narrative and vocabulary skills. We applied natural language processing-based extraction techniques to automatically acquire language features (narrative and vocabulary skills) from storytelling in 68 children with ASD and 52 typically-developing children, and then trained machine learning models on the children's combined narrative and expressive vocabulary data to generate behavioral targets that effectively differentiate ASD from typically-developing children. According to the findings, the model could distinguish ASD from typically-developing children, achieving an accuracy of 96%. Specifically, out of the models used, hist gradient boosting and XGBoost showed slightly superior performance compared to the decision trees and gradient boosting models, particularly regarding accuracy and F1 score. These results bode well for the deployment of machine learning technology for children with ASD, especially those with limited access to early identification services.
Identifiants
pubmed: 38920791
pii: bs14060459
doi: 10.3390/bs14060459
pii:
doi:
Types de publication
Journal Article
Langues
eng
Subventions
Organisme : H.F.R.I call "Basic research Financing (Horizontal support of all Sciences)" under the National Recovery and Resilience Plan "Greece 2.0" funded by the European Union -NextGenerationEU
ID : Language Phenotyping in Autism Using Machine Learning" (H.F.R.I. Project Number: 14864), P.I.: Eleni Peristeri.