Evidence from big data in obesity research: international case studies.
Journal
International journal of obesity (2005)
ISSN: 1476-5497
Titre abrégé: Int J Obes (Lond)
Pays: England
ID NLM: 101256108
Informations de publication
Date de publication:
05 2020
05 2020
Historique:
received:
23
05
2019
accepted:
07
01
2020
revised:
20
12
2019
pubmed:
29
1
2020
medline:
7
9
2021
entrez:
29
1
2020
Statut:
ppublish
Résumé
Obesity is thought to be the product of over 100 different factors, interacting as a complex system over multiple levels. Understanding the drivers of obesity requires considerable data, which are challenging, costly and time-consuming to collect through traditional means. Use of 'big data' presents a potential solution to this challenge. Big data is defined by Delphi consensus as: always digital, has a large sample size, and a large volume or variety or velocity of variables that require additional computing power (Vogel et al. Int J Obes. 2019). 'Additional computing power' introduces the concept of big data analytics. The aim of this paper is to showcase international research case studies presented during a seminar series held by the Economic and Social Research Council (ESRC) Strategic Network for Obesity in the UK. These are intended to provide an in-depth view of how big data can be used in obesity research, and the specific benefits, limitations and challenges encountered. Three case studies are presented. The first investigated the influence of the built environment on physical activity. It used spatial data on green spaces and exercise facilities alongside individual-level data on physical activity and swipe card entry to leisure centres, collected as part of a local authority exercise class initiative. The second used a variety of linked electronic health datasets to investigate associations between obesity surgery and the risk of developing cancer. The third used data on tax parcel values alongside data from the Seattle Obesity Study to investigate sociodemographic determinants of obesity in Seattle. The case studies demonstrated how big data could be used to augment traditional data to capture a broader range of variables in the obesity system. They also showed that big data can present improvements over traditional data in relation to size, coverage, temporality, and objectivity of measures. However, the case studies also encountered challenges or limitations; particularly in relation to hidden/unforeseen biases and lack of contextual information. Overall, despite challenges, big data presents a relatively untapped resource that shows promise in helping to understand drivers of obesity.
Sections du résumé
BACKGROUND/OBJECTIVE
Obesity is thought to be the product of over 100 different factors, interacting as a complex system over multiple levels. Understanding the drivers of obesity requires considerable data, which are challenging, costly and time-consuming to collect through traditional means. Use of 'big data' presents a potential solution to this challenge. Big data is defined by Delphi consensus as: always digital, has a large sample size, and a large volume or variety or velocity of variables that require additional computing power (Vogel et al. Int J Obes. 2019). 'Additional computing power' introduces the concept of big data analytics. The aim of this paper is to showcase international research case studies presented during a seminar series held by the Economic and Social Research Council (ESRC) Strategic Network for Obesity in the UK. These are intended to provide an in-depth view of how big data can be used in obesity research, and the specific benefits, limitations and challenges encountered.
METHODS AND RESULTS
Three case studies are presented. The first investigated the influence of the built environment on physical activity. It used spatial data on green spaces and exercise facilities alongside individual-level data on physical activity and swipe card entry to leisure centres, collected as part of a local authority exercise class initiative. The second used a variety of linked electronic health datasets to investigate associations between obesity surgery and the risk of developing cancer. The third used data on tax parcel values alongside data from the Seattle Obesity Study to investigate sociodemographic determinants of obesity in Seattle.
CONCLUSIONS
The case studies demonstrated how big data could be used to augment traditional data to capture a broader range of variables in the obesity system. They also showed that big data can present improvements over traditional data in relation to size, coverage, temporality, and objectivity of measures. However, the case studies also encountered challenges or limitations; particularly in relation to hidden/unforeseen biases and lack of contextual information. Overall, despite challenges, big data presents a relatively untapped resource that shows promise in helping to understand drivers of obesity.
Identifiants
pubmed: 31988482
doi: 10.1038/s41366-020-0532-8
pii: 10.1038/s41366-020-0532-8
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
1028-1040Références
Davison KK, Birch LL. Childhood overweight: a contextual model and recommendations for future research. Obes Rev. 2001;2:159–71.
pubmed: 12120101
pmcid: 2530932
doi: 10.1046/j.1467-789x.2001.00036.x
Egger G, Swinburn B. An “ecological” approach to the obesity pandemic. BMJ. 1997;315:477–80.
pubmed: 9284671
pmcid: 2127317
doi: 10.1136/bmj.315.7106.477
Harrison K, Bost KK, McBride BA, Donovan SM, Grigsby-Toussaint DS, Kim J, et al. Toward a developmental conceptualization of contributors to overweight and obesity in childhood: the six-Cs model. Child Dev Perspect. 2011;5:50–8.
doi: 10.1111/j.1750-8606.2010.00150.x
Butland B, Jebb S, Kopelman P, McPherson K, Thomas S, Mardell J et al. Foresight. Tackling obesities: future choices—project report. Government Office for Science; 2007.
Rutter HR, Bes-Rastrollo M, de Henauw S, Lahti-Koski M, Lehtinen-Jacks S, Mullerova D, et al. Balancing upstream and downstream measures to tackle the obesity epidemic: a position statement from the European association for the study of obesity. Obes Facts. 2017;10:61–3.
pubmed: 28245444
pmcid: 5644948
doi: 10.1159/000455960
Mittelstadt BD, Floridi L. The ethics of big data: current and foreseeable issues in biomedical contexts. Sci Eng Ethics. 2016;22:303–41.
pubmed: 26002496
doi: 10.1007/s11948-015-9652-2
Kaisler S, Armour F, Espinosa JA, Money W. Big data: issues and challenges moving forward. In: Proceedings of the 46th Hawaii International Conference on System Sciences. Association for Computing Machinery Digital Library; 2013. p. 995–1004.
Herland M, Khoshgoftaar TM, Wald R. A review of data mining using big data in health informatics. J Big Data. 2014;1: https://doi.org/10.1186/2196-1115-1-2 .
Vogel C, Zwolinsky S, Griffiths C, Hobbs M, Henderson E, Wilkins E. A Delphi study to build consensus on the definition and use of big data in obesity research. Int J Obes. 2019. https://doi.org/10.1038/s41366-018-0313-9 .
Morris M, Birkin M. The ESRC strategic network for obesity: tackling obesity with big data. Int J Obes. 2018;42:1948–50.
doi: 10.1038/s41366-018-0196-9
Timmins K, Green M, Radley D, Morris M, Pearce J. How has big data contributed to obesity research? A review of the literature. Int J Obes. 2018;42:1951–62.
doi: 10.1038/s41366-018-0153-7
Monsivais P, Francis O, Lovelace R, Chang M, Strachan E, Burgoine T. Data visualisation to support obesity policy: case studies of data tools for planning and transport policy in the UK. Int J Obes. 2018;42:1977–86.
doi: 10.1038/s41366-018-0243-6
Morris M, Wilkins E, Timmins K, Bryant M, Birkin M, Griffiths C. Can big data solve a big problem? Reporting the obesity data landscape in line with the Foresight obesity system map. Int J Obes. 2018;42:1963–76.
doi: 10.1038/s41366-018-0184-0
Vayena E, Salathé M, Madoff LC, Brownstein JS. Ethical challenges of big data in public health. PLOS Comput Biol. 2015;11:e1003904.
pubmed: 25664461
pmcid: 4321985
doi: 10.1371/journal.pcbi.1003904
Silver LD, Ng SW, Ryan-Ibarra S, Taillie LS, Induni M, Miles DR, et al. Changes in prices, sales, consumer spending, and beverage consumption one year after a tax on sugar-sweetened beverages in Berkeley, California, US: a before-and-after study. PLoS Med. 2017;14:e1002283.
pubmed: 28419108
pmcid: 5395172
doi: 10.1371/journal.pmed.1002283
Gore RJ, Diallo S, Padilla J. You are what you tweet: connecting the geographic variation in america’s obesity rate to Twitter content. PLoS ONE. 2015;10:e0133505.
pubmed: 26332588
pmcid: 4557976
doi: 10.1371/journal.pone.0133505
Nguyen QC, Li D, Meng H-W, Kath S, Nsoesie E, Li F, et al. Building a national neighborhood dataset from geotagged Twitter data for indicators of happiness, diet, and physical activity. JMIR Public Health Surveill. 2016;2:e158.
pubmed: 27751984
pmcid: 5088343
doi: 10.2196/publichealth.5869
Hirsch JA, James P, Robinson JR, Eastman KM, Conley KD, Evenson KR, et al. Using MapMyFitness to place physical activity into neighborhood context. Front Public Health. 2014;2:1–9.
doi: 10.3389/fpubh.2014.00019
Althoff T, Hicks JL, King AC, Delp SL, Leskovec J. Large-scale physical activity data reveal worldwide activity inequality. Nature. 2017;547:336–9.
pubmed: 28693034
pmcid: 5774986
doi: 10.1038/nature23018
Kerr NL. HARKing: hypothesizing after the results are known. Pers Soc Psychol Rev. 1998;2:196–217.
pubmed: 15647155
doi: 10.1207/s15327957pspr0203_4
Lee IM, Shiroma EJ, Lobelo F, Puska P, Blair SN, Katzmarzyk PT, et al. Effect of physical inactivity on major non-communicable diseases worldwide: an analysis of burden of disease and life expectancy. Lancet. 2012;380:219–29.
pubmed: 3645500
pmcid: 3645500
doi: 10.1016/S0140-6736(12)61031-9
Bennett JE, Li G, Foreman K, Best N, Kontis V, Pearson C, et al. The future of life expectancy and life expectancy inequalities in England and Wales: Bayesian spatiotemporal forecasting. Lancet. 2015;386:163–70.
pubmed: 25935825
pmcid: 4502253
doi: 10.1016/S0140-6736(15)60296-3
World Health Organisation. Report of the Commission on ending childhood obesity. Geneva, Switzerland: World Health Organisation; 2016.
Centers for Disease Control and Prevention. Recommended community strategies and measurements to prevent obesity in the United States. Atlanta, GA, U.S.: Centers for Disease Control and Prevention; 2009.
Local Government Association. Building the foundations: tackling obesity through planning and development. London, UK: Local Government Association; 2016.
Burgoine T, Alvanides S, Lake AA. Creating ‘obesogenic realities’; Do our methodological choices make a difference when measuring the food environment? Int J Health Geogr. 2013;12. https://doi.org/10.1186/1476-072X-12-33 .
Wilkins E, Morris M, Radley D, Griffiths C. Methods of measuring associations between the Retail Food Environment and weight status: Importance of classifications and metrics. SSM Popul Health. 2019. https://doi.org/10.1016/j.ssmph.2019.100404 .
Bardou M, Barkun AN, Martel M. Obesity and colorectal cancer. Gut. 2013;62:933–47.
pubmed: 23481261
doi: 10.1136/gutjnl-2013-304701
Siegel R, Desantis C, Jemal A. Colorectal cancer statistics, 2014. CA Cancer J Clin. 2014;64:104–17.
pubmed: 24639052
doi: 10.3322/caac.21220
Derogar M, Hull MA, Kant P, Östlund M, Lu Y, Lagergren J. Increased risk of colorectal cancer after obesity surgery. Ann Surg. 2013;258:983–8.
pubmed: 23470581
doi: 10.1097/SLA.0b013e318288463a
Kant P, Hull MA. Excess body weight and obesity—the link with gastrointestinal and hepatobiliary cancer. Nat Rev Gastroenterol Hepatol. 2011;8:224–38.
pubmed: 21386810
doi: 10.1038/nrgastro.2011.23
Östlund MP, Lu Y, Lagergren J. Risk of obesity-related cancer after obesity surgery in a population-based cohort study. Ann Surg. 2010;252:972–6.
pubmed: 20571362
doi: 10.1097/SLA.0b013e3181e33778
Sainsbury A, Goodlad RA, Perry SL, Pollard SG, Robins GG, Hull MA. Increased colorectal epithelial cell proliferation and crypt fission associated with obesity and roux-en-Y gastric bypass. Cancer Epidemiol Biomark Prev. 2008;17:1401–10.
doi: 10.1158/1055-9965.EPI-07-2874
Aravani A, Downing A, Thomas JD, Lagergren J, Morris EJA, Hull MA. Obesity surgery and risk of colorectal and other obesity-related cancers: an English population-based cohort study. Cancer Epidemiol. 2018;53:99–104.
pubmed: 29414638
pmcid: 5865073
doi: 10.1016/j.canep.2018.01.002
Openshaw S. The modifiable areal unit problem. In: Concepts and techniques in modern geography. Norwich: Geo Books; 1984. p. 1–41.
Kwan M-P. The uncertain geographic context problem. Ann Assoc Am Geogr. 2012;102:958–68.
doi: 10.1080/00045608.2012.687349
Di Zhu X, Yang Y, Liu X. The importance of housing to the accumulation of household net wealth. Harvard, USA: Joint Center for Housing Studies, Harvard University; 2003.
Rehm CD, Moudon AV, Hurvitz PM, Drewnowski A. Residential property values are associated with obesity among women in King County, WA, USA. Soc Sci Med. 2012;75:491–5.
pubmed: 22591823
pmcid: 3889661
doi: 10.1016/j.socscimed.2012.03.041
Drewnowski A, Buszkiewicz J, Aggarwal A. Soda, salad, and socioeconomic status: findings from the Seattle Obesity Study (SOS). SSM Popul Health. 2019;7:e100339.
doi: 10.1016/j.ssmph.2018.100339
Birkin M, Morris MA, Birkin TM, Lovelace R. Using census data in microsimulation modelling. In: Stillwell J, Duke-Williams O, editors. The Routledge handbook of census resources, methods and applications. 1st ed. Routledge: IJO publication; 2018.
Jiao J, Drewnowski A, Moudon AV, Aggarwal A, Oppert J-M, Charreire H, et al. The impact of area residential property values on self-rated health: a cross-sectional comparative study of Seattle and Paris. Prev Med Rep. 2016;4:68–74.
pubmed: 27413663
pmcid: 4929065
doi: 10.1016/j.pmedr.2016.05.008
Nguyen DM, El-Serag HB. The epidemiology of obesity. Gastroenterol Clinics. 2010;39:1–7.
doi: 10.1016/j.gtc.2009.12.014
Pickett KE, Pearl M. Multilevel analyses of neighbourhood socioeconomic context and health outcomes: a critical review. J Epidemiol Commun Health. 2001;55:111–22.
doi: 10.1136/jech.55.2.111
Timperio A, Salmon J, Telford A, Crawford D. Perceptions of local neighbourhood environments and their relationship to childhood overweight and obesity. Int J Obes. 2005;29:170–5.
doi: 10.1038/sj.ijo.0802865
Roda C, Charreire H, Feuillet T, Mackenbach JD, Compernolle S, Glonti K, et al. Mismatch between perceived and objectively measured environmental obesogenic features in European neighbourhoods. Obes Rev. 2016;17 S1:31–41.
pubmed: 26879111
doi: 10.1111/obr.12376
Drewnowski A, Arterburn D, Zane J, Aggarwal A, Gupta S, Hurvitz PM, et al. The Moving to Health (M2H) approach to natural experiment research: a paradigm shift for studies on built environment and health. SSM Popul Health. 2019;7:100345.
pubmed: 30656207
doi: 10.1016/j.ssmph.2018.100345
Bourassa SC, Cantoni E, Hoesli M. Predicting house prices with spatial dependence a comparison of alternative methods. J Real Estate Res. 2010;32:139–60.
Wilkins EL, Radley D, Morris MA, Griffiths C. Examining the validity and utility of two secondary sources of food environment data against street audits in England. Nutr J. 2017;16:1–13.
doi: 10.1186/s12937-017-0302-1
Nevalainen J, Erkkola M, Saarijarvi H, Nappila T, Fogelholm M. Large-scale loyalty card data in health research. Digit Health. 2018;4:2055207618816898.
pubmed: 30546912
pmcid: 6287323
Aiello L, Schifanello R, Quercia D, Del Prete L. Large-scale and high-resolution analysis of food purchases and health outcomes. EPJ Data Sci. 2019;8:14.
Craig CL, Marshall AL, Sjostrom M, Bauman AE, Booth ML, Ainsworth BE, et al. International physical activity questionnaire: 12-country reliability and validity. Med Sci Sports Exerc. 2003;35:1381–95.
pubmed: 12900694
doi: 10.1249/01.MSS.0000078924.61453.FB
Zwolinsky S, McKenna J, Pringle A, Widdop P, Griffiths C, Mellis M, et al. Physical activity and sedentary behavior clustering: segmentation to optimize active lifestyles. J Phys Act Health. 2016;13:921–8.
pubmed: 27171277
doi: 10.1123/jpah.2015-0307
Bauman A, Ainsworth BE, Sallis JF, Hagströmer M, Craig CL, Bull FC, et al. The descriptive epidemiology of sitting: a 20-country comparison using the International Physical Activity Questionnaire (IPAQ). Am J Prev Med. 2011;41:228–35.
doi: 10.1016/j.amepre.2011.05.003
Guerin PB, Diiriye RO, Corrigan C, Guerin B. Physical activity programs for refugee somali women: working out in a new country. Women & Health. 2003;38:83–99.
doi: 10.1300/J013v38n01_06
Pope L, Harvey J. The efficacy of incentives to motivate continued fitness-center attendance in college first-year students: a randomized controlled trial. J Am Coll Health. 2014;62:81–90.
pubmed: 24456510
doi: 10.1080/07448481.2013.847840
Cetateanu A, Jones A. Understanding the relationship between food environments, deprivation and childhood overweight and obesity: evidence from a cross sectional England-wide study. Health Place. 2014;27:68–76.
pubmed: 24561918
pmcid: 4018665
doi: 10.1016/j.healthplace.2014.01.007
Harrison F, Burgoine T, Corder K, van Sluijs EM, Jones A. How well do modelled routes to school record the environments children are exposed to? A cross-sectional comparison of GIS-modelled and GPS-measured routes to school. Int J Health Geogr. 2014;13:5.
pubmed: 24529075
pmcid: 3942764
doi: 10.1186/1476-072X-13-5
Ells LJ, Macknight N, Wilkinson JR. Obesity surgery in England: an examination of the health episode statistics 1996–2005. Obes Surg. 2007;17:400–5.
pubmed: 17546850
doi: 10.1007/s11695-007-9070-x
Nielsen JDJ, Laverty AA, Millett C, Mainous AG, Majeed A, Saxena S. Rising obesity-related hospital admissions among children and young people in England: National time trends study. PLoS ONE. 2013;8:e65764.
doi: 10.1371/journal.pone.0065764
Smittenaar C, Petersen K, Stewart K, Moitt N. Cancer incidence and mortality projections in the UK until 2035. Br J Cancer. 2016;115:1147–55.
pubmed: 27727232
pmcid: 5117795
doi: 10.1038/bjc.2016.304
Wallington M, Saxon EB, Bomb M, Smittenaar R, Wickenden M, McPhail S, et al. 30-day mortality after systemic anticancer treatment for breast and lung cancer in England: a population-based, observational study. The Lancet Oncol. 2016;17:1203–16.
pubmed: 27599138
doi: 10.1016/S1470-2045(16)30383-7
Smolina K, Wright FL, Rayner M, Goldacre MJ. Determinants of the decline in mortality from acute myocardial infarction in England between 2002 and 2010: Linked national database study. BMJ. 2012;344:d8059.
pubmed: 22279113
pmcid: 3266430
doi: 10.1136/bmj.d8059
Hanratty B, Lowson E, Grande G, Payne S, Addington-Hall J, Valtorta N, et al. Transitions at the end of life for older adults–patient, carer and professional perspectives: A mixed-methods study. Health Serv Deliv Res. 2014. https://doi.org/10.3310/hsdr02170 .
Aggarwal A, Monsivais P, Cook AJ, Drewnowski A. Does diet cost mediate the relation between socioeconomic position and diet quality? Eur J Clin Nutr. 2011;65:1059–66.
pubmed: 21559042
pmcid: 3157585
doi: 10.1038/ejcn.2011.72
Drewnowski A, Aggarwal A, Tang W, Moudon AV. Residential property values predict prevalent obesity but do not predict 1-year weight change. Obesity. 2015;23:671–6.
pubmed: 25684713
doi: 10.1002/oby.20989