Automatic language analysis identifies and predicts schizophrenia in first-episode of psychosis.


Journal

Schizophrenia (Heidelberg, Germany)
ISSN: 2754-6993
Titre abrégé: Schizophrenia (Heidelb)
Pays: Germany
ID NLM: 9918367987006676

Informations de publication

Date de publication:
01 Jun 2022
Historique:
received: 18 07 2021
accepted: 18 04 2022
entrez: 19 7 2022
pubmed: 20 7 2022
medline: 20 7 2022
Statut: epublish

Résumé

Automated language analysis of speech has been shown to distinguish healthy control (HC) vs chronic schizophrenia (SZ) groups, yet the predictive power on first-episode psychosis patients (FEP) and the generalization to non-English speakers remain unclear. We performed a cross-sectional and longitudinal (18 months) automated language analysis in 133 Spanish-speaking subjects from three groups: healthy control or HC (n = 49), FEP (n = 40), and chronic SZ (n = 44). Interviews were manually transcribed, and the analysis included 30 language features (4 verbal fluency; 20 verbal productivity; 6 semantic coherence). Our cross-sectional analysis showed that using the top ten ranked and decorrelated language features, an automated HC vs SZ classification achieved 85.9% accuracy. In our longitudinal analysis, 28 FEP patients were diagnosed with SZ at the end of the study. Here, combining demographics, PANSS, and language information, the prediction accuracy reached 77.5% mainly driven by semantic coherence information. Overall, we showed that language features from Spanish-speaking clinical interviews can distinguish HC vs chronic SZ, and predict SZ diagnosis in FEP patients.

Identifiants

pubmed: 35853943
doi: 10.1038/s41537-022-00259-3
pii: 10.1038/s41537-022-00259-3
pmc: PMC9261086
doi:

Types de publication

Journal Article

Langues

eng

Pagination

53

Subventions

Organisme : Ministry of Education, Government of Chile | National Commission for Scientific and Technological Research | Fondo de Fomento al Desarrollo Científico y Tecnológico (FONDEF)
ID : ID20I10371

Informations de copyright

© 2022. The Author(s).

Références

Tandon, R., Nasrallah, H. A. & Keshavan, M. S. Schizophrenia, ‘Just the facts’ 5. Treatment and prevention. Past, present, and future. Schizophr. Res. 122, 1–23 (2010).
pubmed: 20655178 doi: 10.1016/j.schres.2010.05.025
Gaspar, P. A. et al. Early psychosis detection program in Chile: a first step for the South American challenge in psychosis research. Early Interv. Psychiatry 13, 328–334 (2019).
pubmed: 30548415 doi: 10.1111/eip.12766
Mckenna, P. & Oh, T. M. Schizophrenic Speech: Making Sense of Bathroots and Ponds that Fall in Doorways (Cambridge University Press, 2005).
Kuperberg, G. R. Language in schizophrenia Part 1: an introduction. Lang. Linguist. Compass 4, 576–589 (2010).
pubmed: 20936080 pmcid: 2950318 doi: 10.1111/j.1749-818X.2010.00216.x
Pawełczyk, A., Kotlicka-Antczak, M., Łojek, E., Ruszpel, A. & Pawełczyk, T. Schizophrenia patients have higher-order language and extralinguistic impairments. Schizophr. Res. 192, 274–280 (2018).
pubmed: 28438437 doi: 10.1016/j.schres.2017.04.030
Covington, M. A. et al. Schizophrenia and the structure of language: the linguist’s view. Schizophrenia Research 77, 85–98 (2005).
pubmed: 16005388 doi: 10.1016/j.schres.2005.01.016
Cecchi, G. & Corcoran, C. O2.3. Automated analysis of recent-onset and prodromal schizophrenia. Schizophr. Bull. 44, S76–S76 (2018).
pmcid: 5887598 doi: 10.1093/schbul/sby015.193
Hitczenko, K., Mittal, V. A. & Goldrick, M. Understanding language abnormalities and associated clinical markers in psychosis: the promise of computational methods. Schizophr. Bull 47, 344–362 (2021).
pubmed: 33205155 doi: 10.1093/schbul/sbaa141
Foltz, P. W., Rosenstein, M. & Elvevåg, B. Detecting clinically significant events through automated language analysis: Quo imus? npj Schizophr. 2, 15054 (2016).
pubmed: 27336051 pmcid: 4849434 doi: 10.1038/npjschz.2015.54
Crystal, D. & Davy, D. Advanced Conversational English (Longman Publishing Group, 1975).
Crockford, C. & Lesser, R. Assessing functional communication in aphasia: clinical utility and time demands of three methods. Eur. J. Disord. Commun. 29, 165–182 (1994).
pubmed: 7532476 doi: 10.3109/13682829409041490
Alpert, M., Kotsaftis, A. & Pouget, E. R. At issue: speech fluency and schizophrenic negative signs. Schizophr. Bull. 23, 171–177 (1997).
pubmed: 9165627 doi: 10.1093/schbul/23.2.171
Barra, A. I. F. & Herrera, C. J. M. Las pausas en personas con diagnóstico de esquizofrenia de primer episodio. Pragmalinguistica 26, 88–108 (2018).
León, M. Relación entre nivel plasmático de BDNF y las pausas en el discurso en Esquizofrenia (Universidad de Chile, 2020).
Cohen, A. S., Mitchell, K. R., Docherty, N. M. & Horan, W. P. Vocal expression in schizophrenia: less than meets the ear. J. Abnorm. Psychol. 125, 299–309 (2016).
pubmed: 26854511 pmcid: 4967358 doi: 10.1037/abn0000136
Stanislawski, E. R. et al. Negative symptoms and speech pauses in youths at clinical high risk for psychosis. npj Schizophr. 7, 3 (2021).
pubmed: 33483485 pmcid: 7822906 doi: 10.1038/s41537-020-00132-1
Clemmer, E. J. Psycholinguistic aspects of pauses and temporal patterns in schizophrenic speech. J. Psycholinguist. Res. 9, 161–185 (1980).
pubmed: 7391993 doi: 10.1007/BF01067469
Andreasen, N. C. Scale for the assessment of thought, language, and communication (TLC). Schizophr. Bull. 12, 473–482 (1986).
pubmed: 3764363 doi: 10.1093/schbul/12.3.473
Sabbe, B., Beheydt, L., De Picker, L., Goetschalckx, J. & Daelemans, W. Computational language analysis for assessment of schizophrenia. In 2017 Annual International Conference on Cognitive & Behavioral Psychology. https://doi.org/10.5176/2251-1865_CBP17.37 (GSTF, 2017).
de Boer, J. N., Voppel, A. E., Brederoo, S. G., Wijnen, F. N. K. & Sommer, I. E. C. Language disturbances in schizophrenia: the relation with antipsychotic medication. npj Schizophr. 6, 24 (2020).
pubmed: 32895389 pmcid: 7477551 doi: 10.1038/s41537-020-00114-3
Rezaii, N., Walker, E. & Wolff, P. A machine learning approach to predicting psychosis using semantic density and latent content analysis. npj Schizophr. 5, 9 (2019).
pubmed: 31197184 pmcid: 6565626 doi: 10.1038/s41537-019-0077-9
Corcoran, C. M. et al. Prediction of psychosis across protocols and risk cohorts using automated language analysis. World Psychiatry 17, 67–75 (2018).
pubmed: 29352548 pmcid: 5775133 doi: 10.1002/wps.20491
Gupta, T., Hespos, S. J., Horton, W. S. & Mittal, V. A. Automated analysis of written narratives reveals abnormalities in referential cohesion in youth at ultra high risk for psychosis. Schizophr. Res. 192, 82–88 (2018).
pubmed: 28454920 doi: 10.1016/j.schres.2017.04.025
Bedi, G. et al. Automated analysis of free speech predicts psychosis onset in high-risk youths. npj Schizophr. 1, 15030 (2015).
pubmed: 27336038 pmcid: 4849456 doi: 10.1038/npjschz.2015.30
Minor, K. S., Willits, J. A., Marggraf, M. P., Jones, M. N. & Lysaker, P. H. Measuring disorganized speech in schizophrenia: automated analysis explains variance in cognitive deficits beyond clinician-rated scales. Psychol. Med. 49, 440–448 (2019).
pubmed: 29692287 doi: 10.1017/S0033291718001046
Minor, K. S. et al. Conceptual disorganization weakens links in cognitive pathways: disentangling neurocognition, social cognition, and metacognition in schizophrenia. Schizophr. Res. 169, 153–158 (2015).
pubmed: 26441007 doi: 10.1016/j.schres.2015.09.026
Figueroa, A., Durán, E. & Oyarzún, S. La gestión temática como marcador de déficit lingüístico primario en personas con diagnóstico de primer episodio de Esquizofrenia: un estudio en una muestra chilena. RLA. Revista de lingüística teórica y aplicada 55, 117–147 (2017).
doi: 10.4067/S0718-48832017000100117
Docherty, N. M., Gordinier, S. W., Hall, M. J. & Cutting, L. P. Communication disturbances in relatives beyond the age of risk for schizophrenia and their associations with symptoms in patients. Schizophr. Bull. 25, 851–862 (1999).
pubmed: 10667753 doi: 10.1093/oxfordjournals.schbul.a033424
Gordinier, S. W. & Docherty, N. M. Factor analysis of the communication disturbances index. Psychiatry Res. 101, 55–62 (2001).
pubmed: 11223120 doi: 10.1016/S0165-1781(00)00239-0
Docherty, N. M., DeRosa, M. & Andreasen, N. C. Communication disturbances index. PsycTESTS Dataset https://doi.org/10.1037/t39394-000 (2015).
Elvevåg, B., Foltz, P. W., Weinberger, D. R. & Goldberg, T. E. Quantifying incoherence in speech: an automated methodology and novel application to schizophrenia. Schizophr. Res. 93, 304–316 (2007).
pubmed: 17433866 pmcid: 1995127 doi: 10.1016/j.schres.2007.03.001
Elvevåg, B., Foltz, P. W., Rosenstein, M. & Delisi, L. E. An automated method to analyze language use in patients with schizophrenia and their first-degree relatives. J. Neurolinguistics 23, 270–284 (2010).
pubmed: 20383310 pmcid: 2850213 doi: 10.1016/j.jneuroling.2009.05.002
Halliday, M. A. K. & Hasan, R. Language, Context, and Text: Aspects of Language in a Social-semiotic Perspective (Deakin University Press, 1985).
Figueroa, A. Análisis pragmalingüístico de los marcadores de coherencia en el discurso de sujetos con esquizofrenia crónica y de primer episodio (Universidad de Valladolid, 2015).
Frau, F. et al. Can language detect different clinical profiles in schizophrenia? A semi-automated analysis on Italian-speaking patients. In Architectures and Mechanisms for Language Processing. https://amlap2021.github.io/program/174.pdf (AMLaP, 2021).
Wouts, J. et al. belabBERT: a Dutch RoBERTa-based language model applied to psychiatric classification. Preprint at https://arxiv.org/abs/2106.01091 (2021).
Mota, N. B. et al. Speech graphs provide a quantitative measure of thought disorder in psychosis. PLoS ONE 7, e34928 (2012).
pubmed: 22506057 pmcid: 3322168 doi: 10.1371/journal.pone.0034928
Mota, N. B., Copelli, M. & Ribeiro, S. Thought disorder measured as random speech structure classifies negative symptoms and schizophrenia diagnosis 6 months in advance. npj Schizophr. 3, 18 (2017).
pubmed: 28560264 pmcid: 5441540 doi: 10.1038/s41537-017-0019-3
Kay, S. R. et al. SCID-PANSS: two-tier diagnostic system for psychotic disorders. Compr. Psychiatry 32, 355–361 (1991).
pubmed: 1935026 doi: 10.1016/0010-440X(91)90085-Q
Kay, S. R. Positive and negative syndromes in schizophrenia: assessment and research (No. 5) (Brunner/Mazel, 1991).
Hinzen, W. & Rosselló, J. The linguistics of schizophrenia: thought disturbance as language pathology across positive symptoms. Front. Psychol. 6, 971 (2015).
pubmed: 26236257 pmcid: 4503928
Docherty, N., Schnur, M. & Harvey, P. D. Reference performance and positive and negative thought disorder: a follow-up study of manics and schizophrenics. J. Abnorm. Psychol. 97, 437–442 (1988).
pubmed: 3204229 doi: 10.1037/0021-843X.97.4.437
Spironelli, C. & Angrilli, A. Language-related gamma EEG frontal reduction is associated with positive symptoms in schizophrenia patients. Schizophr. Res. 165, 22–29 (2015).
pubmed: 25913900 doi: 10.1016/j.schres.2015.04.003
Kemper, S., Marquis, J. & Thompson, M. Longitudinal change in language production: effects of aging and dementia on grammatical complexity and propositional content. Psychol. Aging 16, 600–614 (2001).
pubmed: 11766915 doi: 10.1037/0882-7974.16.4.600
Burke, D. M. & Shafto, M. A. Aging and language production. Curr. Dir. Psychol. Sci. 13, 21–24 (2004).
pubmed: 18414600 pmcid: 2293308 doi: 10.1111/j.0963-7214.2004.01301006.x
Chao, Y. R. & Zipf, G. K. Human Behavior and the Principle of Least Effort: An Introduction to Human Ecology, Vol. 26 (Addison-Wesley, 1949).
Chaika, E. & Lambe, R. The locus of dysfunction in schizophrenic speech. Schizophr. Bull. 11, 8–15 (1985).
doi: 10.1093/schbul/11.1.8
Chaika, E. Linguistics, Pragmatics and Psychotherapy: A Guide for Therapists (John Wiley & Sons, 2008).
Piro, S. El lenguaje esquizofrénico (Fondo de Cultura Economica USA, 1987).
Minor, K. S., Marggraf, M. P., Davis, B. J., Mehdiyoun, N. F. & Breier, A. Affective systems induce formal thought disorder in early-stage psychosis. J. Abnorm. Psychol. 125, 537–542 (2016).
pubmed: 26999283 doi: 10.1037/abn0000156
Kambeitz, J. et al. Detecting neuroimaging biomarkers for schizophrenia: a meta-analysis of multivariate pattern recognition studies. Neuropsychopharmacology 40, 1742–1751 (2015).
pubmed: 25601228 pmcid: 4915258 doi: 10.1038/npp.2015.22
Shim, M., Hwang, H.-J., Kim, D.-W., Lee, S.-H. & Im, C.-H. Machine-learning-based diagnosis of schizophrenia using combined sensor-level and source-level EEG features. Schizophr. Res. 176, 314–319 (2016).
pubmed: 27427557 doi: 10.1016/j.schres.2016.05.007
Zeng, L.-L. et al. Multi-site diagnostic classification of schizophrenia using discriminant deep learning with functional connectivity MRI. EBioMedicine 30, 74–85 (2018).
pubmed: 29622496 pmcid: 5952341 doi: 10.1016/j.ebiom.2018.03.017
San Martín Núñez, A. & Guerrero González, S. Estudio Sociolingüístico del Español de Chile (ESECH): recogida y estratificación del corpus de Santiago. Bol. filol. 50, 221–247 (2015).
doi: 10.4067/S0718-93032015000100009
Zhang, D., Xu, H., Su, Z. & Xu, Y. Chinese comments sentiment classification based on word2vec and SVMperf. Expert Syst. Appl. 42, 1857–1863 (2015).
doi: 10.1016/j.eswa.2014.09.011
Lilleberg, J., Zhu, Y. & Zhang, Y. Support vector machines and Word2vec for text classification with semantic features. In 2015 IEEE 14th International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC) (IEEE, 2015).
Rehurek, R. & Sojka, P. Software framework for topic modelling with large corpora. In Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks. http://www.lrec-conf.org/proceedings/lrec2010/workshops/W10.pdf (ELRA, 2010).
Genuer, R., Poggi, J.-M. & Tuleau-Malot, C. Variable selection using random forests. Pattern Recognit. Lett. 31, 2225–2236 (2010).
doi: 10.1016/j.patrec.2010.03.014

Auteurs

Alicia Figueroa-Barra (A)

Department of Psychiatry, Faculty of Medicine, Universidad de Chile, Santiago, Chile. aliciafigueroa@uchile.cl.
Biomedical Neuroscience Institute, Santiago, Chile. aliciafigueroa@uchile.cl.
Millennium Nucleus to Improve the Mental Health of Adolescents and Youths (IMHAY), Santiago, Chile. aliciafigueroa@uchile.cl.
Translational Psychiatry Laboratory Psiquislab, Faculty of Medicine, Universidad de Chile, Santiago, Chile. aliciafigueroa@uchile.cl.

Daniel Del Aguila (D)

Artificial Intelligence Development Department, BiosIntelligence, GrupoBios, Santiago, Chile.

Mauricio Cerda (M)

Biomedical Neuroscience Institute, Santiago, Chile.
Integrative Biology Program, Institute of Biomedical Sciences, Faculty of Medicine, Universidad de Chile, Santiago, Chile.
Center for Medical Informatics and Telemedicine, Faculty of Medicine, Universidad de Chile, Santiago, Chile.

Pablo A Gaspar (PA)

Department of Psychiatry, Faculty of Medicine, Universidad de Chile, Santiago, Chile.
Biomedical Neuroscience Institute, Santiago, Chile.
Millennium Nucleus to Improve the Mental Health of Adolescents and Youths (IMHAY), Santiago, Chile.
Translational Psychiatry Laboratory Psiquislab, Faculty of Medicine, Universidad de Chile, Santiago, Chile.
Department of Neuroscience, Faculty of Medicine, Universidad de Chile, Santiago, Chile.

Lucas D Terissi (LD)

Laboratory for System Dynamics & Signal Processing, Universidad Nacional de Rosario and CIFASIS, Santa Fe, Argentina.

Manuel Durán (M)

Integrative Biology Program, Institute of Biomedical Sciences, Faculty of Medicine, Universidad de Chile, Santiago, Chile.
Center for Medical Informatics and Telemedicine, Faculty of Medicine, Universidad de Chile, Santiago, Chile.

Camila Valderrama (C)

Integrative Biology Program, Institute of Biomedical Sciences, Faculty of Medicine, Universidad de Chile, Santiago, Chile.
Center for Medical Informatics and Telemedicine, Faculty of Medicine, Universidad de Chile, Santiago, Chile.

Classifications MeSH