Automatic Detection of Twitter Users Who Express Chronic Stress Experiences via Supervised Machine Learning and Natural Language Processing.


Journal

Computers, informatics, nursing : CIN
ISSN: 1538-9774
Titre abrégé: Comput Inform Nurs
Pays: United States
ID NLM: 101141667

Informations de publication

Date de publication:
01 Sep 2023
Historique:
medline: 11 9 2023
pubmed: 30 11 2022
entrez: 29 11 2022
Statut: epublish

Résumé

Americans bear a high chronic stress burden, particularly during the COVID-19 pandemic. Although social media have many strengths to complement the weaknesses of conventional stress measures, including surveys, they have been rarely utilized to detect individuals self-reporting chronic stress. Thus, this study aimed to develop and evaluate an automatic system on Twitter to identify users who have self-reported chronic stress experiences. Using the Twitter public streaming application programming interface, we collected tweets containing certain stress-related keywords (eg, "chronic," "constant," "stress") and then filtered the data using pre-defined text patterns. We manually annotated tweets with (without) self-report of chronic stress as positive (negative). We trained multiple classifiers and tested them via accuracy and F1 score. We annotated 4195 tweets (1560 positives, 2635 negatives), achieving an inter-annotator agreement of 0.83 (Cohen's kappa). The classifier based on Bidirectional Encoder Representation from Transformers performed the best (accuracy of 83.6% [81.0-86.1]), outperforming the second best-performing classifier (support vector machines: 76.4% [73.5-79.3]). The past tweets from the authors of positive tweets contained useful information, including sources and health impacts of chronic stress. Our study demonstrates that users' self-reported chronic stress experiences can be automatically identified on Twitter, which has a high potential for surveillance and large-scale intervention.

Identifiants

pubmed: 36445331
doi: 10.1097/CIN.0000000000000985
pii: 00024665-202309000-00012
pmc: PMC10510804
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

717-724

Subventions

Organisme : NINR NIH HHS
ID : K01 NR019651
Pays : United States

Informations de copyright

Copyright © 2022 The Authors. Published by Wolters Kluwer Health, Inc.

Références

J Med Internet Res. 2017 Aug 14;19(8):e289
pubmed: 28807891
Proc ACM Hum Comput Interact. 2019 Nov;3(CSCW):
pubmed: 32935081
Neurosci Biobehav Rev. 2010 Sep;35(1):2-16
pubmed: 19822172
Biomed Inform Insights. 2018 Aug 27;10:1178222618792860
pubmed: 30158822
Ann N Y Acad Sci. 2008 Oct;1141:105-30
pubmed: 18991954
Depress Anxiety. 2009;26(8):718-23
pubmed: 19496077
Chronic Stress (Thousand Oaks). 2017 Jan-Dec;1:
pubmed: 29862379
JAMIA Open. 2021 Jun 23;4(2):ooab042
pubmed: 34169232
Front Neuroendocrinol. 2018 Apr;49:146-169
pubmed: 29551356
AMIA Jt Summits Transl Sci Proc. 2018 May 18;2017:188-196
pubmed: 29888070
JMIR Public Health Surveill. 2017 Jun 13;3(2):e35
pubmed: 28611016
BMC Med Inform Decis Mak. 2021 Jan 26;21(1):27
pubmed: 33499852

Auteurs

Yuan-Chi Yang (YC)

Author Affiliations: Department of Biomedical Informatics, School of Medicine (Drs Yang, Al-Garadi, and Sarker, and Ms Xie), and Nell Hodgson Woodruff School of Nursing (Ms Hair), Emory University, Atlanta, GA.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH