A global dataset of 7 billion individuals with socio-economic characteristics.
Journal
Scientific data
ISSN: 2052-4463
Titre abrégé: Sci Data
Pays: England
ID NLM: 101640192
Informations de publication
Date de publication:
07 Oct 2024
07 Oct 2024
Historique:
received:
21
12
2023
accepted:
11
09
2024
medline:
8
10
2024
pubmed:
8
10
2024
entrez:
7
10
2024
Statut:
epublish
Résumé
In global impact modeling, there is a need to address the heterogeneous characteristics of households and individuals that drive different behavioral responses to, for example, environmental risk, socio-economic policy changes and spread of diseases. In this research, we present GLOPOP-S, the first global synthetic population dataset with 1,999,227,130 households and 7,335,881,094 individuals for the year 2015, consistent with population statistics at an administrative unit 1 level. GLOPOS-S contains the following attributes: age, education, gender, income/wealth, settlement type (urban/rural), household size, household type, and for selected countries in the Global South, ownership of agricultural land and dwelling characteristics. To generate GLOPOP-S, we use microdata from the Luxembourg Income Study (LIS) and Demographic and Health Surveys (DHS) and apply synthetic reconstruction techniques to fit national survey data to regional statistics, thereby accounting for spatial differences within and across countries. Additionally, we have developed methods to generate data for countries without available microdata. The dataset can be downloaded per region or country. GLOPOP-S is open source and can be extended with other attributes.
Identifiants
pubmed: 39375378
doi: 10.1038/s41597-024-03864-2
pii: 10.1038/s41597-024-03864-2
doi:
Types de publication
Dataset
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
1096Subventions
Organisme : EC | Horizon 2020 Framework Programme (EU Framework Programme for Research and Innovation H2020)
ID : 884442
Organisme : EC | Horizon 2020 Framework Programme (EU Framework Programme for Research and Innovation H2020)
ID : 884442
Organisme : EC | Horizon 2020 Framework Programme (EU Framework Programme for Research and Innovation H2020)
ID : 884442
Organisme : EC | Horizon 2020 Framework Programme (EU Framework Programme for Research and Innovation H2020)
ID : 88442
Organisme : EC | Horizon 2020 Framework Programme (EU Framework Programme for Research and Innovation H2020)
ID : 884442
Organisme : EC | Horizon 2020 Framework Programme (EU Framework Programme for Research and Innovation H2020)
ID : 884442
Organisme : EC | Horizon 2020 Framework Programme (EU Framework Programme for Research and Innovation H2020)
ID : 884442
Informations de copyright
© 2024. The Author(s).
Références
Balcan, D. et al. Modeling the spatial spread of infectious diseases: the GLobal Epidemic and Mobility computational model. J. Comput. Sci. 1(3), 132–145, https://doi.org/10.1016/j.jocs.2010.07.002 (2010).
doi: 10.1016/j.jocs.2010.07.002
Yeh, S. et al. Detailed assessment of global transport-energy models’ structures and projections. Transport. Res. D-Tr. E. 55, 294–309, https://doi.org/10.1016/j.trd.2016.11.001 (2017).
doi: 10.1016/j.trd.2016.11.001
Van Meijl, H. et al. Modelling alternative futures of global food security: Insights from FOODSECURE. Glob. Food Secur. 25, https://doi.org/10.1016/j.gfs.2020.100358 (2020).
Ahmad, T. & Zhang, D. A critical review of comparative global historical energy consumption and future demand: The story told so far. Energy Reports 6, 1973–1991, https://doi.org/10.1016/j.egyr.2020.07.020 (2020).
doi: 10.1016/j.egyr.2020.07.020
Jones, E. R. et al. DynQual v1.0: a high-resolution global surface water quality model. Geosci. Model Dev. 16(15), 4481–4500, https://doi.org/10.5194/gmd-16-4481-2023 (2023).
doi: 10.5194/gmd-16-4481-2023
Haer, T., Husby, T. G., Botzen, W. J. W. & Aerts, J. C. J. H. The Safe Development Paradox: An Agent-Based Model for Flood Risk under Climate Change in the European Union. Glob. Environ. Change 60, 1–12, https://doi.org/10.1016/j.gloenvcha.2019.102009 (2020).
doi: 10.1016/j.gloenvcha.2019.102009
Nordhaus, W. D. Revisiting the Social Cost of Carbon. PNAS 114(7), 1518–1523, https://doi.org/10.1073/pnas.1609244114 (2017).
doi: 10.1073/pnas.1609244114
Arneth, A., Brown, C. & Rounsevell, M. Global models of human decision-making for land-based mitigation and adaptation assessment. Nature Clim. Change 4, 550–557, https://doi.org/10.1038/nclimate2250 (2014).
doi: 10.1038/nclimate2250
Weyant, J. Some Contributions of Integrated Assessment Models of Global Climate Change. REEP 11(1), 115–137 (2017).
Lippe, M. et al. Using agent-based modelling to simulate social-ecological systems across scales. Geoinformatica 23, 269–298, https://doi.org/10.1007/s10707-018-00337-8 (2019).
doi: 10.1007/s10707-018-00337-8
Giarola, S., Sachs, J., d’Avezac, M., Kell, A. & Hawkes, A. MUSE: An open-source agent-based integrated assessment modelling framework. Energy Strat. Rev. 44, https://doi.org/10.1016/j.esr.2022.100964 (2022).
De Mooij, J. et al A Framework for Modeling Human Behavior in Large-Scale Agent-Based Epidemic Simulations. SIMULATION 99(12), https://doi.org/10.1177/00375497231184898 (2023).
Rai, V. & Henry, A. Agent-based modelling of consumer energy choices. Nat. Clim. Change 6, 556–562, https://doi-org.vu-nl.idm.oclc.org/10.1038/nclimate2967 (2016).
doi: 10.1038/nclimate2967
Prédhumeau, M. & Manley, E. A synthetic population for agent-based modelling in Canada. Sci. Data 10, 148, https://doi-org.vu-nl.idm.oclc.org/10.1038/s41597-023-02030-4 (2023).
doi: 10.1038/s41597-023-02030-4
Münnich, R., Schürle, J. On the simulation of complex universes in the case of applying the German Microcensus. DACSEIS research paper series No. 4, University of Tübingen (2023).
Sun, L., Erath, A. & Cai, M. A hierarchical mixture modeling framework for population synthesis. Transport. Res. B-M 114, 199–212, https://doi.org/10.1016/j.trb.2018.06.002 (2018).
doi: 10.1016/j.trb.2018.06.002
Alfons, A., Kraft, S., Templ, M. & Filzmoser, P. Simulation of close-to-reality population data for household surveys with application to EU-SILC. SMA 20, https://doi.org/10.1007/s10260-011-0163-2 (2011).
Wu, G., Heppenstall, A., Meier, P., Purshouse, R. & Lomax, N. A Synthetic Population Dataset for Estimating Small Area Health and Socio-Economic Outcomes in Great Britain. Sci. Data 9, https://doi.org/10.1038/s41597-022-01124-9 (2022).
Beckman, R. J., Baggerly, K. A. & McKay, M. D. Creating synthetic baseline populations. Transport. Res. A-PP 30(6), 415–429, https://doi.org/10.1016/0965-8564(96)00004-3 (1996).
doi: 10.1016/0965-8564(96)00004-3
Lovelace, R., Birkin, M., Ballas, D. & van Leeuwen, E. Evaluating the performance of iterative proportional fitting for spatial microsimulation: new tests for an established technique. JASSS 18(21) (2015).
Ton, M.J. et al. GLOPOP-S. Harvard Dataverse, V4. https://doi.org/10.7910/DVN/KJC3RH (2023).
Hino, M., Fiel, C. B. & Mach, K. J. Managed retreat as a response to natural hazard risk. Nat. Clim. Change 7(5), 364–370, https://doi.org/10.1038/nclimate3252 (2017).
doi: 10.1038/nclimate3252
Berhe, M. et al The effects of adaptation to climate change on income of households in rural Ethiopia. Pastoralism 7, https://doi.org/10.1186/s13570-017-0084-2 (2017).
Ehsan, S., Begum, R. A., Nizam, K., Maulud, A. & Mia, S. Assessing household perception, autonomous adaptation and economic value of adaptation benefits: Evidence from West Coast of Peninsular Malaysia. Advances in Climate Change Research 13(5), 738–758, https://doi.org/10.1016/j.accre.2022.06.002 (2022).
doi: 10.1016/j.accre.2022.06.002
Chai, L., Han, Y., Han, Z., Wei, J. & Zhao, Y. Differences in disaster preparedness between urban and rural communities in China, IJDRR 53, https://doi.org/10.1016/j.ijdrr.2020.102020 (2022).
Lindersson, S. et al. The wider the gap between rich and poor the higher the flood mortality. Nat. Sustain. 6, 995–1005, https://doi.org/10.1038/s41893-023-01107-7 (2023).
doi: 10.1038/s41893-023-01107-7
Nixon, R. et al The relationship between household structures and everyday adaptation and livelihood strategies in northwestern Pakistan. Ecol. Soc. 28(2), https://doi.org/10.5751/ES-14026-280231 (2023).
The Demographic and Health Surveys (DHS) Program. https://dhsprogram.com/ (2022).
Luxembourg Income Study Database (LIS). https://www.lisdatacenter.org/ (multiple countries: November 2022 – September 2023). Luxembourg: LIS. (2022).
Rufat, S., Tate, E., Burton, C. G. & Maroof, A. S. Social vulnerability to floods: Review of case studies and implications for measurement. IJDRR 14(4), 470–486, https://doi.org/10.1016/j.ijdrr.2015.09.013 (2015).
doi: 10.1016/j.ijdrr.2015.09.013
Yaméogo, B. F., Gastineau, P., Hankach, P. & Vandanjon, P. O. Comparing Methods for Generating a Two-Layered Synthetic Population. Transport. Res. Rec. 2675(1), 136–147, https://doi.org/10.1177/0361198120964734 (2020).
doi: 10.1177/0361198120964734
Mueller, K. & Axhausen, K. Population synthesis for microsimulation: State of the art. In: Proceeding of Transportation Research Board 90
Choupani, A. A. & Mamdoohi, A. R. Population Synthesis Using Iterative Proportional Fitting (IPF): A Review and Future Research. Transport. Res. Proc. 17, 223–233, https://doi.org/10.1016/j.trpro.2016.11.078 (2016).
doi: 10.1016/j.trpro.2016.11.078
Guo, J. & Bhat, C. Population Synthesis for Microsimulating Travel Behavior. Transport. Res. Rec. 1, 92–101, https://doi.org/10.3141/2014-12 (2007).
doi: 10.3141/2014-12
Auld, J., Mohammadian, A. K. & Wies, K. Population synthesis with control category optimization, paper presented at the 10th International Conference on Application of Advanced Technologies in Transportation, Athens, Greece (2008).
Khachman, M., Morency, C. & Ciari, F. Impact of the Geographic Resolution on Population Synthesis Quality. ISPRS Int. J. Geo-Inf 10(790), https://doi.org/10.3390/ijgi10110790 (2021).
Agresti, A. Categorical Data Analysis. New York: John Wiley (1990).
Chi, G., Fang, H., Chatterjee, S. & Blumenstock, J.E. Microestimates of wealth for all low- and middle-income countries. PNAS 119(3), https://doi.org/10.1073/pnas.2113658119 (2022).
Vyas, S. & Kumaranayake, L. Constructing socio-economic status indices: how to use principal components analysis. Health Policy Plan. 21(6), 459–468, https://doi.org/10.1093/heapol/czl029 (2006).
doi: 10.1093/heapol/czl029
OECD. Economic well-being. OECD Framework for Statistics on the Distribution of Household Income, Consumption and Wealth, OECD Publishing, Paris. https://doi.org/10.1787/9789264194830-en (2013).
Smits, J. & Permanyer, I. The Subnational Human Development Database. Sci. Data 6, 190038, https://doi.org/10.1038/sdata.2019.38 (2019).
doi: 10.1038/sdata.2019.38
Pesaresi, M., Florczyk, A., Schiavina, M., Melchiorri, M. & Maffenini, L. GHS settlement grid, updated and refined REGIO model 2014 in application to GHS-BUILT R2018A and GHS-POP R2019A, multitemporal (1975-1990-2000-2015), R2019A. European Commission, Joint Research Centre (JRC) https://doi.org/10.2905/42E8BE89-54FF-464E-BE7B-BF9E64DA5218 (2019).
Freire, S., Halkia, M. & Pesaresi, M. GHS population grid, derived from EUROSTAT census data (2011) and ESM R2016 - OBSOLETE RELEASE. European Commission, Joint Research Centre (JRC) [Dataset] PID: http://data.europa.eu/89h/jrc-ghsl-ghs_pop_eurostat_europe_r2016a (2016).
Zhou, M., Li, J., Basu, R. & Ferreira, J. Creating spatially-detailed heterogeneous synthetic populations for agent based microsimulation. Comput. Environ. Urban. Syst. 91, 101717, https://doi.org/10.1016/j.compenvurbsys.2021.101717 (2022).
doi: 10.1016/j.compenvurbsys.2021.101717
Ye, X., Konduri, K., Pendyala, R. M., Sana, B. & Waddell, P. A methodology to match distributions of both household and person attributes in the generation of synthetic populations. In: 88th Annual Meeting of the Transportation Research Board (2009).
Lovelace, R. & Ballas, D. “Truncate, Replicate, Sample”: A Method for Creating Integer Weights for Spatial Microsimulation. Comput. Environ. Urb. Syst. 41, 1–11, https://doi.org/10.1016/j.compenvurbsys.2013.03.004 (2013).
doi: 10.1016/j.compenvurbsys.2013.03.004
Sun, L. & Erath, A. A Bayesian network approach for population synthesis. Transport. Res.C-ET 61, 49–62, https://doi.org/10.1016/j.trc.2015.10.010 (2015).
doi: 10.1016/j.trc.2015.10.010