Using Natural Language Processing to Examine the Uptake, Content, and Readability of Media Coverage of a Pan-Canadian Drug Safety Research Project: Cross-Sectional Observational Study.

knowledge translation mass media natural language processing pharmacoepidemiology readability

Journal

JMIR formative research
ISSN: 2561-326X
Titre abrégé: JMIR Form Res
Pays: Canada
ID NLM: 101726394

Informations de publication

Date de publication:
14 Jan 2020
Historique:
received: 03 01 2019
accepted: 26 09 2019
revised: 11 07 2019
entrez: 15 1 2020
pubmed: 15 1 2020
medline: 15 1 2020
Statut: epublish

Résumé

Isotretinoin, for treating cystic acne, increases the risk of miscarriage and fetal abnormalities when taken during pregnancy. The Health Canada-approved product monograph for isotretinoin includes pregnancy prevention guidelines. A recent study by the Canadian Network for Observational Drug Effect Studies (CNODES) on the occurrence of pregnancy and pregnancy outcomes during isotretinoin therapy estimated poor adherence to these guidelines. Media uptake of this study was unknown; awareness of this uptake could help improve drug safety communication. The aim of this study was to understand how the media present pharmacoepidemiological research using the CNODES isotretinoin study as a case study. Google News was searched (April 25-May 6, 2016), using a predefined set of terms, for mention of the CNODES study. In total, 26 articles and 3 CNODES publications (original article, press release, and podcast) were identified. The article texts were cleaned (eg, advertisements and links removed), and the podcast was transcribed. A dictionary of 1295 unique words was created using natural language processing (NLP) techniques (term frequency-inverse document frequency, Porter stemming, and stop-word filtering) to identify common words and phrases. Similarity between the articles and reference publications was calculated using Euclidian distance; articles were grouped using hierarchical agglomerative clustering. Nine readability scales were applied to measure text readability based on factors such as number of words, difficult words, syllables, sentence counts, and other textual metrics. The top 5 dictionary words were pregnancy (250 appearances), isotretinoin (220), study (209), drug (201), and women (185). Three distinct clusters were identified: Clusters 2 (5 articles) and 3 (4 articles) were from health-related websites and media, respectively; Cluster 1 (18 articles) contained largely media sources; 2 articles fell outside these clusters. Use of the term isotretinoin versus Accutane (a brand name of isotretinoin), discussion of pregnancy complications, and assignment of responsibility for guideline adherence varied between clusters. For example, the term pregnanc appeared most often in Clusters 1 (14.6 average times per article) and 2 (11.4) and relatively infrequently in Cluster 3 (1.8). Average readability for all articles was high (eg, Flesch-Kincaid, 13; Gunning Fog, 15; SMOG Index, 10; Coleman Liau Index, 15; Linsear Write Index, 13; and Text Standard, 13). Readability increased from Cluster 2 (Gunning Fog of 16.9) to 3 (12.2). It varied between clusters (average 13th-15th grade) but exceeded the recommended health information reading level (grade 6th to 8th), overall. Media interpretation of the CNODES study varied, with differences in synonym usage and areas of focus. All articles were written above the recommended health information reading level. Analyzing media using NLP techniques can help determine drug safety communication effectiveness. This project is important for understanding how drug safety studies are taken up and redistributed in the media.

Sections du résumé

BACKGROUND BACKGROUND
Isotretinoin, for treating cystic acne, increases the risk of miscarriage and fetal abnormalities when taken during pregnancy. The Health Canada-approved product monograph for isotretinoin includes pregnancy prevention guidelines. A recent study by the Canadian Network for Observational Drug Effect Studies (CNODES) on the occurrence of pregnancy and pregnancy outcomes during isotretinoin therapy estimated poor adherence to these guidelines. Media uptake of this study was unknown; awareness of this uptake could help improve drug safety communication.
OBJECTIVE OBJECTIVE
The aim of this study was to understand how the media present pharmacoepidemiological research using the CNODES isotretinoin study as a case study.
METHODS METHODS
Google News was searched (April 25-May 6, 2016), using a predefined set of terms, for mention of the CNODES study. In total, 26 articles and 3 CNODES publications (original article, press release, and podcast) were identified. The article texts were cleaned (eg, advertisements and links removed), and the podcast was transcribed. A dictionary of 1295 unique words was created using natural language processing (NLP) techniques (term frequency-inverse document frequency, Porter stemming, and stop-word filtering) to identify common words and phrases. Similarity between the articles and reference publications was calculated using Euclidian distance; articles were grouped using hierarchical agglomerative clustering. Nine readability scales were applied to measure text readability based on factors such as number of words, difficult words, syllables, sentence counts, and other textual metrics.
RESULTS RESULTS
The top 5 dictionary words were pregnancy (250 appearances), isotretinoin (220), study (209), drug (201), and women (185). Three distinct clusters were identified: Clusters 2 (5 articles) and 3 (4 articles) were from health-related websites and media, respectively; Cluster 1 (18 articles) contained largely media sources; 2 articles fell outside these clusters. Use of the term isotretinoin versus Accutane (a brand name of isotretinoin), discussion of pregnancy complications, and assignment of responsibility for guideline adherence varied between clusters. For example, the term pregnanc appeared most often in Clusters 1 (14.6 average times per article) and 2 (11.4) and relatively infrequently in Cluster 3 (1.8). Average readability for all articles was high (eg, Flesch-Kincaid, 13; Gunning Fog, 15; SMOG Index, 10; Coleman Liau Index, 15; Linsear Write Index, 13; and Text Standard, 13). Readability increased from Cluster 2 (Gunning Fog of 16.9) to 3 (12.2). It varied between clusters (average 13th-15th grade) but exceeded the recommended health information reading level (grade 6th to 8th), overall.
CONCLUSIONS CONCLUSIONS
Media interpretation of the CNODES study varied, with differences in synonym usage and areas of focus. All articles were written above the recommended health information reading level. Analyzing media using NLP techniques can help determine drug safety communication effectiveness. This project is important for understanding how drug safety studies are taken up and redistributed in the media.

Identifiants

pubmed: 31934872
pii: v4i1e13296
doi: 10.2196/13296
pmc: PMC6996767
doi:

Types de publication

Journal Article

Langues

eng

Pagination

e13296

Commentaires et corrections

Type : ErratumIn

Informations de copyright

©Hossein Mohammadhassanzadeh, Ingrid Sketris, Robyn Traynor, Susan Alexander, Brandace Winquist, Samuel Alan Stewart. Originally published in JMIR Formative Research (http://formative.jmir.org), 14.01.2020.

Références

Proc Natl Acad Sci U S A. 2013 Aug 20;110 Suppl 3:14062-8
pubmed: 23942122
Phys Sportsmed. 2014 Nov;42(4):125-30
pubmed: 25419896
AJNR Am J Neuroradiol. 2014 Jul;35(7):1270-5
pubmed: 24763420
J Biomed Inform. 2009 Oct;42(5):760-72
pubmed: 19683066
Pharmacoepidemiol Drug Saf. 2013 Aug;22(8):834-41
pubmed: 23554109
Lancet. 1983 Aug 27;2(8348):513
pubmed: 6136666
Drug Saf. 2004;27(14):1069-80
pubmed: 15554743
CMAJ. 2004 Apr 27;170(9):1399-407
pubmed: 15111473
J Am Acad Dermatol. 2011 Dec;65(6):1117-25
pubmed: 21565419
N Engl J Med. 2000 Jun 1;342(22):1645-50
pubmed: 10833211
CMAJ. 2016 Jul 12;188(10):723-730
pubmed: 27114489
Crit Rev Food Sci Nutr. 2017 Sep 2;57(13):2811-2824
pubmed: 26558421
N Engl J Med. 2009 Jan 1;360(1):1-3
pubmed: 19118299
Health Expect. 2017 Apr;20(2):309-320
pubmed: 27145430
J Am Med Inform Assoc. 2009 May-Jun;16(3):328-37
pubmed: 19261932
Otolaryngol Head Neck Surg. 2012 Nov;147(5):848-54
pubmed: 22864405
Open Med. 2012 Oct 30;6(4):e134-40
pubmed: 23687528
BMC Public Health. 2017 Apr 20;17(1):343
pubmed: 28427426
BMJ. 2002 Apr 13;324(7342):886-91
pubmed: 11950740
JAMA Surg. 2016 Sep 1;151(9):831-7
pubmed: 27144966
J Biomed Inform. 2005 Aug;38(4):314-21
pubmed: 16084473
Br J Dermatol. 2011 Feb;164(2):238-44
pubmed: 20716214
BMC Public Health. 2013 Feb 18;13:150
pubmed: 23419015
J Gen Intern Med. 2003 Feb;18(2):153-4
pubmed: 12542591
Int J Epidemiol. 2018 Apr 1;47(2):617-624
pubmed: 29420741
BMJ Open. 2017 Nov 17;7(11):e017425
pubmed: 29151047
CMAJ. 2013 Mar 19;185(5):411-3
pubmed: 23296582
Public Health. 2008 Aug;122(8):747-53
pubmed: 18561966
Health Educ Behav. 2006 Jun;33(3):352-73
pubmed: 16699125
Infect Control Hosp Epidemiol. 1998 Feb;19(2):94-100
pubmed: 9510106
Health Commun. 2018 Feb;33(2):174-187
pubmed: 27983868
Bioinformatics. 2008 Mar 1;24(5):719-20
pubmed: 18024473
Eur J Clin Pharmacol. 2006 Aug;62(8):667-74
pubmed: 16791584
Pharmacoepidemiol Drug Saf. 2020 Jan;29 Suppl 1:93-102
pubmed: 29575351
N Engl J Med. 1985 Oct 3;313(14):837-41
pubmed: 3162101

Auteurs

Hossein Mohammadhassanzadeh (H)

Dalhousie University, Halifax, NS, Canada.

Ingrid Sketris (I)

Dalhousie University, Halifax, NS, Canada.

Robyn Traynor (R)

Dalhousie University, Halifax, NS, Canada.

Susan Alexander (S)

Nova Scotia Health Authority, Halifax, NS, Canada.

Brandace Winquist (B)

University of Saskatchewan, Saskatoon, SK, Canada.

Samuel Alan Stewart (SA)

Dalhousie University, Halifax, NS, Canada.

Classifications MeSH