How to apply zero-shot learning to text data in substance use research: An overview and tutorial with media data.

Artificial intelligence machine-learning media social media tutorial zero-shot learning

Journal

Addiction (Abingdon, England)

ISSN: 1360-0443

Titre abrégé: Addiction

Pays: England

ID NLM: 9304118

Informations de publication

Date de publication:
11 Jan 2024

Historique:

received: 14 06 2023

accepted: 13 12 2023

medline: 12 1 2024

pubmed: 12 1 2024

entrez: 12 1 2024

Statut: aheadofprint

Résumé

A vast amount of media-related text data is generated daily in the form of social media posts, news stories or academic articles. These text data provide opportunities for researchers to analyse and understand how substance-related issues are being discussed. The main methods to analyse large text data (content analyses or specifically trained deep-learning models) require substantial manual annotation and resources. A machine-learning approach called 'zero-shot learning' may be quicker, more flexible and require fewer resources. Zero-shot learning uses models trained on large, unlabelled (or weakly labelled) data sets to classify previously unseen data into categories on which the model has not been specifically trained. This means that a pre-existing zero-shot learning model can be used to analyse media-related text data without the need for task-specific annotation or model training. This approach may be particularly important for analysing data that is time critical. This article describes the relatively new concept of zero-shot learning and how it can be applied to text data in substance use research, including a brief practical tutorial.

Identifiants

DOI: 10.1111/add.16427 PMID: 38212974

pubmed: 38212974

doi: 10.1111/add.16427

doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

Subventions

Organisme : La Trobe University SPPH internal grant

ID : NA

Organisme : Australian Research Council Discovery Early Career Researcher Award

ID : DE230100659

Informations de copyright

Références

Meyer R. How many stories do newspapers publish per day? The Atlantic 2016. Available at: https://www.theatlantic.com/technology/archive/2016/05/how-many-stories-do-newspapers-publish-per-day/483845/ Accessed 21 Mar 2023.

Benitez C. 20 Spotify statistics 2023: usage, revenue & more: Tone Island. Available at: https://toneisland.com/spotify-statistics/#:~:text=Spotify%20uploads%2060%2C000%20new%20tracks%20every%20day.,-Spotify%20confirms%20through&text=That%20amounts%20to%2022%20million,million%20tracks%20in%20its%20database Accessed 21 Mar 2023.

Twitter Blog. The 2014 #YearOnTwitter 2014. Available at: https://blog.twitter.com/official/en_us/a/2014/the-2014-yearontwitter.html Accessed 21 Mar 2023.

Christenson P, Roberts DF, Bjork N. Booze, drugs, and pop music: trends in substance portrayals in the Billboard top 100-1968-2008. Subst Use Misuse. 2012;47:121-129.

Alhabash S, VanDam C, Tan P-N, Smith SW, Viken G, Kanver D, et al. 140 characters of intoxication: exploring the prevalence of alcohol-related tweets and predicting their virality. SAGE Open. 2018;8:2158244018803137.

Cavazos-Rehg PA, Krauss MJ, Sowles SJ, Bierut LJ. ‘Hey everyone, I’m drunk’. An evaluation of drinking-related Twitter chatter. J Stud Alcohol Drugs. 2015;76:635-643.

Rutherford BN, Lim CC, Johnson B, Cheng B, Chung J, Huang S, et al. #TurntTrending: a systematic review of substance use portrayals on social media platforms. Addiction. 2023;118:206-217.

Wright LA, Golder S, Balkham A, McCambridge J. Understanding public opinion to the introduction of minimum unit pricing in Scotland: a qualitative study using twitter. BMJ Open. 2019;9:e029690.

Rychert M, Wilkins C. Referendum campaigns in hybrid media systems: insights from the New Zealand cannabis legalisation referendum. Media Commun. 2023;11:56-68.

Curtis BL, Lookatch SJ, Ramo DE, McKay JR, Feinn RS, Kranzler HR. Meta-analysis of the association of alcohol-related social media use with alcohol consumption and alcohol-related problems in adolescents and young adults. Alcohol Clin Exp Res. 2018;42:978-986.

Cheng B, Lim CC, Rutherford BN, Huang S, Ashley DP, Johnson B, et al. A systematic review and meta-analysis of the relationship between youth drinking, self-posting of alcohol use and other social media engagement (2012-21). Addiction. 2024;119:28-46.

Cristello JV, Litt DM, Sutherland MT, Trucco EM. Subjective norms as a mediator between exposure to online alcohol and marijuana content and offline use among adolescents. Drug Alcohol Rev. 2023. https://doi.org/10.1111/dar.13620

Holody KJ, Anderson C, Craig C, Flynn M. ‘Drunk in love’: the portrayal of risk behavior in music lyrics. J Health Commun. 2016;21:1098-1106.

Merrill J, Riordan B, Ward RM, Raubenheimer J. Using Twitter post data to ascertain the sentiment of alcohol-related blackouts in the United States. Proceedings of the Annual Hawaii International Conference on System Sciences 2023;124:107110.

Lim CC, Sun T, Gartner C, Connor J, Fahmi M, Hall W, et al. What is the hype on #MedicinalCannabis in the United States? A content analysis of medicinal cannabis tweets. Drug Alcohol Rev. 2023. https://doi.org/10.1111/dar.13618

Mathieson S, O’Keeffe M, Traeger AC, Ferreira GE, Abdel SC. Content and sentiment analysis of gabapentinoid-related tweets: an infodemiology study. Drug Alcohol Rev. 2022. https://doi.org/10.1111/dar.13590

Riordan BC, Winter DT, Haber PS, Day CA, Morley KC. What are people saying on social networking sites about the Australian alcohol consumption guidelines? Med J Aust. 214:105-107.

Arshonsky J, Krawczyk N, Bunting AM, Frank D, Friedman SR, Bragg MA. Informal coping strategies among people who use opioids during COVID-19: thematic analysis of Reddit forums. JMIR Format Res. 2022;6:e32871.

Chenworth M, Perrone J, Love JS, Greller HA, Sarker A, Chai PR. Buprenorphine initiation in the emergency department: a thematic content analysis of a #firesidetox tweetchat. J Med Toxicol. 2020;16:262-268.

Kuntsche E, Patsouras M, He Z, Riordan B. Artificial intelligence in substance use research. In: Franken I, Wiers R, Witkiewitz K, editors. Handbook of Addiction Psychology Thousand Oaks, CA: Sage; 2023.

Riordan BC, Merrill JE, Ward RM, Raubenheimer J. When are alcohol-related blackout Tweets written in the United States? Addict Behav. 2022;124:107110.

van Draanen J, Tao H, Gupta S, Liu S. Geographic differences in cannabis conversations on Twitter: infodemiology study. JMIR Public Health Surveill. 2020;6:e18540.

Devlin J, Chang M-W, Lee K, Toutanova K. Bert: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:181004805. 2018.

Brown T, Mann B, Ryder N, Subbiah M, Kaplan JD, Dhariwal P, et al. Language models are few-shot learners. Adv Neural Inform Process Syst. 2020;33:1877-1901.

Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, et al. Attention is all you need. Adv Neural Inf Process Syst. 2017;30:5998-6008.

Riordan BC, Raubenheimer J, Ward RM, Merrill JE, Winter T, Scarf D. Monitoring the sentiment of cannabis-related tweets in the lead up to New Zealand’s cannabis referendum. Drug Alcohol Rev. 2020;40:835-841.

Lewis M, Liu Y, Goyal N, Ghazvininejad M, Mohamed A, Levy O, et al. Bart: denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension. arXiv preprint arXiv:191013461. 2019.

Touvron H, Martin L, Stone K, Albert P, Almahairi A, Babaei Y, et al. Llama 2: open foundation and fine-tuned chat models. arXiv preprint arXiv:230709288. 2023.

Ward RM, Riordan BC, Merrill JE, Raubenheimer J. Describing the impact of the COVID-19 pandemic on alcohol-induced blackout tweets. Drug Alcohol Rev. 2021;40:192-195.

Bonela AA, Nibali A, He Z, Riordan B, Anderson-Luxford D, Kuntsche E. The promise of zero-shot learning for alcohol image detection: comparison with a task-specific deep learning algorithm. Sci Rep. 2023;13:11891.

Radford A, Kim JW, Hallacy C, Ramesh A, Goh G, Agarwal S, et al. (2021, July). Learning transferable visual models from natural language supervision. In International conference on machine learning (pp. 8748-8763). PMLR.

Bonela AA, He Z, Norman T, Kuntsche E. Development and validation of the Alcoholic Beverage Identification Deep Learning Algorithm Version 2 (ABIDLA2) for quantifying alcohol exposure in electronic images. Alcohol Clin Exp Res. 2022;46:1837-1845.

Sivarajkumar S, Wang Y. HealthPrompt: a zero-shot learning paradigm for clinical natural language processing. arXiv preprint arXiv:220305061. 2022.

Pushp PK, Srivastava MM. Train once, test anywhere: zero-shot learning for text classification. arXiv preprint arXiv:171205972. 2017.

Pelicon A, Pranjić M, Miljković D, Škrlj B, Pollak S. Zero-shot learning for cross-lingual news sentiment classification. Appl Sci. 2020;10:5993.

Cooper ML. Motivations for alcohol use among adolescents: development and validation of a four-factor model. Psychol Assess. 1994;6:117-128.

Kundu D. Harness the power of LLMs: zero-shot and few-shot prompting. Analysis Vidhya 2023. Available at: https://www.analyticsvidhya.com/blog/2023/09/power-of-llms-zero-shot-and-few-shot-prompting/#h-few-shot-vs-zero-shot Accessed 21 Mar 2023.

Chow A, Perrigo B. The AI arms race is changing everything: time. 2023. Available at: https://time.com/6255952/ai-impact-chatgpt-microsoft-google/ Accessed 21 Mar 2023.

Gurtner M, Smith M, Gage R, Howey-Brown A, Wang X, Latavao T, et al. Objective assessment of the nature and extent of children’s internet-based world: protocol for the kids online Aotearoa study. JMIR Res Protocol. 2022;11:e39017.

LaBrie JW, Trager BM, Boyle SC, Davis JP, Earle AM, Morgan RM. An examination of the prospective associations between objectively assessed exposure to alcohol-related Instagram content, alcohol-specific cognitions, and first-year college drinking. Addict Behav. 2021;119:106948.

Ray PP. ChatGPT: a comprehensive review on background, applications, key challenges, bias, ethics, limitations and future scope. Int Things Cyber-Phys Syst. 2023;3:121-154.

Hartmann J, Schwenzow J, Witte M. The political ideology of conversational AI: converging evidence on ChatGPT’s pro-environmental, left-libertarian orientation. arXiv preprint arXiv:230101768. 2023.

How to apply zero-shot learning to text data in substance use research: An overview and tutorial with media data.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Subventions

Informations de copyright

Références

Auteurs

Benjamin Riordan (B)

Abraham Albert Bonela (AA)

Zhen He (Z)

Aiden Nibali (A)

Dan Anderson-Luxford (D)

Emmanuel Kuntsche (E)

Classifications MeSH