Deep forecasting of translational impact in medical research.

deep learning natural language processing representation learning research impact translational research

Journal

Patterns (New York, N.Y.)
ISSN: 2666-3899
Titre abrégé: Patterns (N Y)
Pays: United States
ID NLM: 101767765

Informations de publication

Date de publication:
13 May 2022
Historique:
received: 07 12 2021
revised: 10 01 2022
accepted: 04 03 2022
entrez: 24 5 2022
pubmed: 25 5 2022
medline: 25 5 2022
Statut: epublish

Résumé

The value of biomedical research-a $1.7 trillion annual investment-is ultimately determined by its downstream, real-world impact, whose predictability from simple citation metrics remains unquantified. Here we sought to determine the comparative predictability of future real-world translation-as indexed by inclusion in patents, guidelines, or policy documents-from complex models of title/abstract-level content versus citations and metadata alone. We quantify predictive performance out of sample, ahead of time, across major domains, using the entire corpus of biomedical research captured by Microsoft Academic Graph from 1990-2019, encompassing 43.3 million papers. We show that citations are only moderately predictive of translational impact. In contrast, high-dimensional models of titles, abstracts, and metadata exhibit high fidelity (area under the receiver operating curve [AUROC] > 0.9), generalize across time and domain, and transfer to recognizing papers of Nobel laureates. We argue that content-based impact models are superior to conventional, citation-based measures and sustain a stronger evidence-based claim to the objective measurement of translational potential.

Identifiants

pubmed: 35607619
doi: 10.1016/j.patter.2022.100483
pii: S2666-3899(22)00068-X
pmc: PMC9122964
doi:

Types de publication

Journal Article

Langues

eng

Pagination

100483

Informations de copyright

© 2022 The Authors.

Déclaration de conflit d'intérêts

The authors declare no competing interests.

Références

Nature. 2011 Jan 20;469(7330):299
pubmed: 21248827
Scientometrics. 2017;110(3):1209-1216
pubmed: 28255186
Science. 2017 Aug 11;357(6351):583-587
pubmed: 28798128
Nat Biotechnol. 2021 Oct;39(10):1300-1307
pubmed: 34002098
Scientometrics. 2021;126(1):871-906
pubmed: 32981987
Mov Disord. 2013 Sep;28(10):1370-5
pubmed: 23818421
Sci Data. 2019 Apr 18;6(1):33
pubmed: 31000709
PLoS Biol. 2019 Oct 10;17(10):e3000416
pubmed: 31600189
Scientometrics. 2021;126(1):725-739
pubmed: 33230352
Stud Hist Philos Sci. 2019 Aug;76:13-23
pubmed: 31558205
BMJ. 2000 Apr 22;320(7242):1107-11
pubmed: 10775218
Nature. 2012 Sep 13;489(7415):201-2
pubmed: 22972278
Sci Adv. 2021 Apr 23;7(17):
pubmed: 33893092
NPJ Digit Med. 2019 Dec 6;2:119
pubmed: 31840090
J Appl Physiol (1985). 2020 Oct 1;129(4):967-979
pubmed: 32790596
Nature. 2012 Dec 6;492(7427):34-6
pubmed: 23222591
Nature. 2019 Jul;571(7763):95-98
pubmed: 31270483
J R Soc Med. 2011 Jun;104(6):251-61
pubmed: 21659400
Res Integr Peer Rev. 2020 Feb 03;5:3
pubmed: 32025338
J Med Internet Res. 2012 Oct 18;14(5):e144
pubmed: 23079075
Am J Med. 2003 Apr 15;114(6):477-84
pubmed: 12731504
Science. 2014 Dec 5;346(6214):1155
pubmed: 25477433
Science. 2021 Jan 8;371(6525):128-130
pubmed: 33414211
Bioinformatics. 2020 Feb 15;36(4):1234-1240
pubmed: 31501885
Science. 2017 Apr 7;356(6333):78-81
pubmed: 28360137
Nat Biotechnol. 2018 Jan 10;36(1):31-39
pubmed: 29319684
Br J Cancer. 2008 Jun 17;98(12):1944-50
pubmed: 18521087
Front Big Data. 2019 Dec 03;2:45
pubmed: 33693368
Health Res Policy Syst. 2018 Jun 28;16(1):55
pubmed: 29950167

Auteurs

Amy P K Nelson (APK)

High Dimensional Neurology Group, UCL Queen Square Institute of Neurology, University College London, Russell Square House, Bloomsbury, London WC1B 5EH, UK.

Robert J Gray (RJ)

High Dimensional Neurology Group, UCL Queen Square Institute of Neurology, University College London, Russell Square House, Bloomsbury, London WC1B 5EH, UK.

James K Ruffle (JK)

High Dimensional Neurology Group, UCL Queen Square Institute of Neurology, University College London, Russell Square House, Bloomsbury, London WC1B 5EH, UK.

Henry C Watkins (HC)

High Dimensional Neurology Group, UCL Queen Square Institute of Neurology, University College London, Russell Square House, Bloomsbury, London WC1B 5EH, UK.

Daniel Herron (D)

Research & Development, NIHR University College London Hospitals Biomedical Research Centre, London WC1E 6BT, UK.

Nick Sorros (N)

Wellcome Data Labs, Wellcome Trust, London NW1 2BE, UK.

Danil Mikhailov (D)

Wellcome Data Labs, Wellcome Trust, London NW1 2BE, UK.

M Jorge Cardoso (MJ)

School of Biomedical Engineering & Imaging Sciences, King's College London, London WC2R 2LS, UK.

Sebastien Ourselin (S)

School of Biomedical Engineering & Imaging Sciences, King's College London, London WC2R 2LS, UK.

Nick McNally (N)

Research & Development, NIHR University College London Hospitals Biomedical Research Centre, London WC1E 6BT, UK.

Bryan Williams (B)

Research & Development, NIHR University College London Hospitals Biomedical Research Centre, London WC1E 6BT, UK.
UCL Institute of Cardiovascular Sciences, University College London, London WC1E 6BT, UK.

Geraint E Rees (GE)

High Dimensional Neurology Group, UCL Queen Square Institute of Neurology, University College London, Russell Square House, Bloomsbury, London WC1B 5EH, UK.
Faculty of Life Sciences, University College London, Gower Street, London WC1E 6BT, UK.

Parashkev Nachev (P)

High Dimensional Neurology Group, UCL Queen Square Institute of Neurology, University College London, Russell Square House, Bloomsbury, London WC1B 5EH, UK.

Classifications MeSH