Why we need a small data paradigm.
Artificial intelligence
Data science
Personalized medicine
Precision health
Precision medicine
Small data
Journal
BMC medicine
ISSN: 1741-7015
Titre abrégé: BMC Med
Pays: England
ID NLM: 101190723
Informations de publication
Date de publication:
17 07 2019
17 07 2019
Historique:
received:
09
04
2019
accepted:
13
06
2019
entrez:
18
7
2019
pubmed:
18
7
2019
medline:
7
11
2019
Statut:
epublish
Résumé
There is great interest in and excitement about the concept of personalized or precision medicine and, in particular, advancing this vision via various 'big data' efforts. While these methods are necessary, they are insufficient to achieve the full personalized medicine promise. A rigorous, complementary 'small data' paradigm that can function both autonomously from and in collaboration with big data is also needed. By 'small data' we build on Estrin's formulation and refer to the rigorous use of data by and for a specific N-of-1 unit (i.e., a single person, clinic, hospital, healthcare system, community, city, etc.) to facilitate improved individual-level description, prediction and, ultimately, control for that specific unit. The purpose of this piece is to articulate why a small data paradigm is needed and is valuable in itself, and to provide initial directions for future work that can advance study designs and data analytic techniques for a small data approach to precision health. Scientifically, the central value of a small data approach is that it can uniquely manage complex, dynamic, multi-causal, idiosyncratically manifesting phenomena, such as chronic diseases, in comparison to big data. Beyond this, a small data approach better aligns the goals of science and practice, which can result in more rapid agile learning with less data. There is also, feasibly, a unique pathway towards transportable knowledge from a small data approach, which is complementary to a big data approach. Future work should (1) further refine appropriate methods for a small data approach; (2) advance strategies for better integrating a small data approach into real-world practices; and (3) advance ways of actively integrating the strengths and limitations from both small and big data approaches into a unified scientific knowledge base that is linked via a robust science of causality. Small data is valuable in its own right. That said, small and big data paradigms can and should be combined via a foundational science of causality. With these approaches combined, the vision of precision health can be achieved.
Sections du résumé
BACKGROUND
There is great interest in and excitement about the concept of personalized or precision medicine and, in particular, advancing this vision via various 'big data' efforts. While these methods are necessary, they are insufficient to achieve the full personalized medicine promise. A rigorous, complementary 'small data' paradigm that can function both autonomously from and in collaboration with big data is also needed. By 'small data' we build on Estrin's formulation and refer to the rigorous use of data by and for a specific N-of-1 unit (i.e., a single person, clinic, hospital, healthcare system, community, city, etc.) to facilitate improved individual-level description, prediction and, ultimately, control for that specific unit.
MAIN BODY
The purpose of this piece is to articulate why a small data paradigm is needed and is valuable in itself, and to provide initial directions for future work that can advance study designs and data analytic techniques for a small data approach to precision health. Scientifically, the central value of a small data approach is that it can uniquely manage complex, dynamic, multi-causal, idiosyncratically manifesting phenomena, such as chronic diseases, in comparison to big data. Beyond this, a small data approach better aligns the goals of science and practice, which can result in more rapid agile learning with less data. There is also, feasibly, a unique pathway towards transportable knowledge from a small data approach, which is complementary to a big data approach. Future work should (1) further refine appropriate methods for a small data approach; (2) advance strategies for better integrating a small data approach into real-world practices; and (3) advance ways of actively integrating the strengths and limitations from both small and big data approaches into a unified scientific knowledge base that is linked via a robust science of causality.
CONCLUSION
Small data is valuable in its own right. That said, small and big data paradigms can and should be combined via a foundational science of causality. With these approaches combined, the vision of precision health can be achieved.
Identifiants
pubmed: 31311528
doi: 10.1186/s12916-019-1366-x
pii: 10.1186/s12916-019-1366-x
pmc: PMC6636023
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
133Références
Transl Behav Med. 2014 Sep;4(3):290-303
pubmed: 25264468
J Biomed Inform. 2018 Mar;79:82-97
pubmed: 29409750
J Med Internet Res. 2013 Feb 08;15(2):e22
pubmed: 23399668
BMJ. 1996 Jan 13;312(7023):71-2
pubmed: 8555924
Per Med. 2011 Mar;8(2):161-173
pubmed: 21695041
Trials. 2015 Feb 27;16:67
pubmed: 25881274
Endocrinol Metab Clin North Am. 2016 Sep;45(3):565-80
pubmed: 27519131
Methods Inf Med. 2018 Feb;57(1):e10-e21
pubmed: 29621835
Clin Transl Med. 2013 May 10;2(1):10
pubmed: 23663660
Nature. 2015 Apr 30;520(7549):609-11
pubmed: 25925459
JAMA Intern Med. 2018 Oct 1;178(10):1368-1377
pubmed: 30193253
N Engl J Med. 2015 Feb 26;372(9):793-5
pubmed: 25635347
Transl Behav Med. 2016 Jun;6(2):317-28
pubmed: 27357001
Obes Rev. 2010 Dec;11(12):899-906
pubmed: 20345430
J Med Internet Res. 2019 Apr 26;21(4):e12910
pubmed: 31025942
Proc SIGCHI Conf Hum Factor Comput Syst. 2017 May 2;2017:6850-6863
pubmed: 28516175
MMWR Suppl. 2014 Oct 31;63(4):3-27
pubmed: 25356673
JAMA. 2016 May 10;315(18):1941-2
pubmed: 27163980
Health Psychol. 2016 Apr;35(4):407-11
pubmed: 27018733
J Diabetes Sci Technol. 2019 Jul;13(4):790-793
pubmed: 30348013
J Med Internet Res. 2018 Jun 28;20(6):e214
pubmed: 29954725
Am Psychol. 2005 Jan;60(1):16-26
pubmed: 15641918