Iterative guided machine learning-assisted systematic literature reviews: a diabetes case study.

Applied case study Machine learning Machine learning configurations Natural language processing Systematic review screening Transfer learning

Journal

Systematic reviews
ISSN: 2046-4053
Titre abrégé: Syst Rev
Pays: England
ID NLM: 101580575

Informations de publication

Date de publication:
02 04 2021
Historique:
received: 22 07 2020
accepted: 19 03 2021
entrez: 3 4 2021
pubmed: 4 4 2021
medline: 6 7 2021
Statut: epublish

Résumé

Systematic Reviews (SR), studies of studies, use a formal process to evaluate the quality of scientific literature and determine ensuing effectiveness from qualifying articles to establish consensus findings around a hypothesis. Their value is increasing as the conduct and publication of research and evaluation has expanded and the process of identifying key insights becomes more time consuming. Text analytics and machine learning (ML) techniques may help overcome this problem of scale while still maintaining the level of rigor expected of SRs. In this article, we discuss an approach that uses existing examples of SRs to build and test a method for assisting the SR title and abstract pre-screening by reducing the initial pool of potential articles down to articles that meet inclusion criteria. Our approach differs from previous approaches to using ML as a SR tool in that it incorporates ML configurations guided by previously conducted SRs, and human confirmation on ML predictions of relevant articles during multiple iterative reviews on smaller tranches of citations. We applied the tailored method to a new SR review effort to validate performance. The case study test of the approach proved a sensitivity (recall) in finding relevant articles during down selection that may rival many traditional processes and show ability to overcome most type II errors. The study achieved a sensitivity of 99.5% (213 out of 214) of total relevant articles while only conducting a human review of 31% of total articles available for review. We believe this iterative method can help overcome bias in initial ML model training by having humans reinforce ML models with new and relevant information, and is an applied step towards transfer learning for ML in SR.

Sections du résumé

BACKGROUND
Systematic Reviews (SR), studies of studies, use a formal process to evaluate the quality of scientific literature and determine ensuing effectiveness from qualifying articles to establish consensus findings around a hypothesis. Their value is increasing as the conduct and publication of research and evaluation has expanded and the process of identifying key insights becomes more time consuming. Text analytics and machine learning (ML) techniques may help overcome this problem of scale while still maintaining the level of rigor expected of SRs.
METHODS
In this article, we discuss an approach that uses existing examples of SRs to build and test a method for assisting the SR title and abstract pre-screening by reducing the initial pool of potential articles down to articles that meet inclusion criteria. Our approach differs from previous approaches to using ML as a SR tool in that it incorporates ML configurations guided by previously conducted SRs, and human confirmation on ML predictions of relevant articles during multiple iterative reviews on smaller tranches of citations. We applied the tailored method to a new SR review effort to validate performance.
RESULTS
The case study test of the approach proved a sensitivity (recall) in finding relevant articles during down selection that may rival many traditional processes and show ability to overcome most type II errors. The study achieved a sensitivity of 99.5% (213 out of 214) of total relevant articles while only conducting a human review of 31% of total articles available for review.
CONCLUSIONS
We believe this iterative method can help overcome bias in initial ML model training by having humans reinforce ML models with new and relevant information, and is an applied step towards transfer learning for ML in SR.

Identifiants

pubmed: 33810798
doi: 10.1186/s13643-021-01640-6
pii: 10.1186/s13643-021-01640-6
pmc: PMC8017891
doi:

Types de publication

Journal Article Systematic Review

Langues

eng

Sous-ensembles de citation

IM

Pagination

97

Références

BMC Med Res Methodol. 2018 Jan 10;18(1):5
pubmed: 29316881
BMC Bioinformatics. 2010 Jan 26;11:55
pubmed: 20102628
J Clin Epidemiol. 2017 Nov;91:31-37
pubmed: 28912003
Syst Rev. 2019 Jan 15;8(1):23
pubmed: 30646959
SAR QSAR Environ Res. 2006 Jun;17(3):337-52
pubmed: 16815772
Syst Rev. 2014 Jul 09;3:74
pubmed: 25005128
Scientometrics. 2010 Sep;84(3):575-603
pubmed: 20700371
Front Physiol. 2018 Jul 03;9:835
pubmed: 30018571
Mach Learn Knowl Discov Databases. 2014;8725:225-239
pubmed: 26023687

Auteurs

John Zimmerman (J)

Deloitte Consulting, LLP, 191 Peachtree Street, Atlanta, GA, 30303, USA. Jzimmerman@deloitte.com.

Robin E Soler (RE)

Centers for Disease Control and Prevention, National Center for Chronic Disease Prevention and Health Promotion, Division of Diabetes Translation, 1600 Clifton Rd, Atlanta, GA, USA.

James Lavinder (J)

Deloitte Consulting, LLP, 191 Peachtree Street, Atlanta, GA, 30303, USA.

Sarah Murphy (S)

Deloitte Consulting, LLP, 191 Peachtree Street, Atlanta, GA, 30303, USA.

Charisma Atkins (C)

Centers for Disease Control and Prevention, National Center for Chronic Disease Prevention and Health Promotion, Division of Diabetes Translation, 1600 Clifton Rd, Atlanta, GA, USA.

LaShonda Hulbert (L)

Centers for Disease Control and Prevention, National Center for Chronic Disease Prevention and Health Promotion, Division of Diabetes Translation, 1600 Clifton Rd, Atlanta, GA, USA.

Richard Lusk (R)

Deloitte Consulting, LLP, 191 Peachtree Street, Atlanta, GA, 30303, USA.

Boon Peng Ng (BP)

Centers for Disease Control and Prevention, National Center for Chronic Disease Prevention and Health Promotion, Division of Diabetes Translation, 1600 Clifton Rd, Atlanta, GA, USA.
College of Nursing & Disability, Aging and Technology Cluster, University of Central Florida, 12201 Research Pkwy Suite 300, Orlando, FL, USA.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH