Automatic Detection of Twitter Users Who Express Chronic Stress Experiences via Supervised Machine Learning and Natural Language Processing.
Journal
Computers, informatics, nursing : CIN
ISSN: 1538-9774
Titre abrégé: Comput Inform Nurs
Pays: United States
ID NLM: 101141667
Informations de publication
Date de publication:
01 Sep 2023
01 Sep 2023
Historique:
medline:
11
9
2023
pubmed:
30
11
2022
entrez:
29
11
2022
Statut:
epublish
Résumé
Americans bear a high chronic stress burden, particularly during the COVID-19 pandemic. Although social media have many strengths to complement the weaknesses of conventional stress measures, including surveys, they have been rarely utilized to detect individuals self-reporting chronic stress. Thus, this study aimed to develop and evaluate an automatic system on Twitter to identify users who have self-reported chronic stress experiences. Using the Twitter public streaming application programming interface, we collected tweets containing certain stress-related keywords (eg, "chronic," "constant," "stress") and then filtered the data using pre-defined text patterns. We manually annotated tweets with (without) self-report of chronic stress as positive (negative). We trained multiple classifiers and tested them via accuracy and F1 score. We annotated 4195 tweets (1560 positives, 2635 negatives), achieving an inter-annotator agreement of 0.83 (Cohen's kappa). The classifier based on Bidirectional Encoder Representation from Transformers performed the best (accuracy of 83.6% [81.0-86.1]), outperforming the second best-performing classifier (support vector machines: 76.4% [73.5-79.3]). The past tweets from the authors of positive tweets contained useful information, including sources and health impacts of chronic stress. Our study demonstrates that users' self-reported chronic stress experiences can be automatically identified on Twitter, which has a high potential for surveillance and large-scale intervention.
Identifiants
pubmed: 36445331
doi: 10.1097/CIN.0000000000000985
pii: 00024665-202309000-00012
pmc: PMC10510804
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
717-724Subventions
Organisme : NINR NIH HHS
ID : K01 NR019651
Pays : United States
Informations de copyright
Copyright © 2022 The Authors. Published by Wolters Kluwer Health, Inc.
Références
J Med Internet Res. 2017 Aug 14;19(8):e289
pubmed: 28807891
Proc ACM Hum Comput Interact. 2019 Nov;3(CSCW):
pubmed: 32935081
Neurosci Biobehav Rev. 2010 Sep;35(1):2-16
pubmed: 19822172
Biomed Inform Insights. 2018 Aug 27;10:1178222618792860
pubmed: 30158822
Ann N Y Acad Sci. 2008 Oct;1141:105-30
pubmed: 18991954
Depress Anxiety. 2009;26(8):718-23
pubmed: 19496077
Chronic Stress (Thousand Oaks). 2017 Jan-Dec;1:
pubmed: 29862379
JAMIA Open. 2021 Jun 23;4(2):ooab042
pubmed: 34169232
Front Neuroendocrinol. 2018 Apr;49:146-169
pubmed: 29551356
AMIA Jt Summits Transl Sci Proc. 2018 May 18;2017:188-196
pubmed: 29888070
JMIR Public Health Surveill. 2017 Jun 13;3(2):e35
pubmed: 28611016
BMC Med Inform Decis Mak. 2021 Jan 26;21(1):27
pubmed: 33499852