A flexible Bayesian nonconfounding spatial model for analysis of dispersed count data.


Journal

Biometrical journal. Biometrische Zeitschrift
ISSN: 1521-4036
Titre abrégé: Biom J
Pays: Germany
ID NLM: 7708048

Informations de publication

Date de publication:
04 2022
Historique:
revised: 14 11 2021
received: 20 05 2021
accepted: 16 11 2021
pubmed: 6 1 2022
medline: 20 4 2022
entrez: 5 1 2022
Statut: ppublish

Résumé

In employing spatial regression models for counts, we usually meet two issues. First, the possible inherent collinearity between covariates and the spatial effect could lead to misleading inferences. Second, real count data usually reveal over- or under-dispersion where the classical Poisson model is not appropriate to use. We propose a flexible Bayesian hierarchical modeling approach by joining nonconfounding spatial methodology and a newly reconsidered dispersed count modeling from the renewal theory to control the issues. Specifically, we extend the methodology for analyzing spatial count data based on the gamma distribution assumption for waiting times. The model can be formulated as a latent Gaussian model, and consequently, we can carry out the fast computation by using the integrated nested Laplace approximation method. We examine different popular approaches for handling spatial confounding and compare their performances in the presence of dispersion. Two real applications from a crime study against women in India as well as stomach cancer incidences in Slovenia motivate the suggested methods. We also perform a simulation study to understand the proposed approach's merits better. Supplementary Materials for this article are available.

Identifiants

pubmed: 34985802
doi: 10.1002/bimj.202100157
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

758-770

Informations de copyright

© 2022 Wiley-VCH GmbH.

Références

Adin, A., Goicoa, T., Hodges, J. S., Schnell, P. M., & Ugarte, M. D. (2021). Alleviating confounding in spatio-temporal areal models with an application on crimes against women in India. Statistical Modelling, 1471082X2110154.
Adrion, C., & Mansmann, U. (2012). Bayesian model selection techniques as decision support for shaping a statistical analysis plan of a clinical trial: An example from a vertigo phase III study with longitudinal count data as primary endpoint. BMC Medical Research Methodology, 12, 137.
Azevedo, D. R. M., Bandyopadhyay, D., Prates, M. O., Abdel-Salam, A.-S. G., & Garcia, D. (2020). Assessing spatial confounding in cancer disease mapping using R. Cancer Reports, 3(4), e1263.
Baetschmann, G., & Winkelmann, R., A Dynamic Hurdle Model for Zero-Inflated Count Data: With an Application to Health Care Utilization. SSRN Electronic Journal, https://doi.org/10.2139/ssrn.2422442
Besag, J., York, J., & Mollié, A. (1991). Bayesian image restoration, with two applications in spatial statistics. Annals of the Institute of Statistical Mathematics, 43(1), 1-20. https://doi.org/10.1007/bf00116466
Bonat, W., Jørgensen, B., Kokonendji, C., Hinde, J., & Demétrio, C. (2016). Extended Poisson-Tweedie: Properties and regression models for count data. Statistical Modelling: An International Journal, 18(1), 24-49. https://doi.org/10.1177/1471082X17715718
Breslow, N. E., & Clayton, D. G. (1993). Approximate inference in generalized linear mixed models. Journal of the American Statistical Association, 88(421), 9-25. https://doi.org/10.2307/2290687
Cameron, A., & Trivedi, P. (2013). Regression Analysis of Count Data. 2nd edition, Cambridge University Press.
Clayton, D. G., Bernardinelli, L., Montomoli, C. (1993). Spatial correlation in ecological analysis. International Journal of Epidemiology, 22(6), 1193-1202. https://doi.org/10.1093/ije/22.6.1193
Cox, D. R. (1962). Renewal theory. 1st edition, London: Methuen ltd.
Dang, G., Kulkarni, V. S., & Gaiha, R. (2018). Why dowry deaths have risen in India? Technical report, ASARC Working Paper, 2018/03. https://crawford.anu.edu.au/acde/asarc/pdf/papers/2018/WP2018-03.pdf (last accessed April 27, 2021).
Dupont, E., Wood, S. M., & Augustin, N. (2020). Spatial+: a novel approach to spatial confounding. Preprint: https://arxiv.org/abs/2009.09420.
Gelman, A., Hwang, J., Vehtari, A. (2014). Understanding predictive information criteria for Bayesian models. Statistics and Computing, 24(6), 997-1016. https://doi.org/10.1007/s11222-013-9416-2
Gómez-Rubio, V. (2020). Bayesian Inference with INLA. Chapman and Hall/CRC. https://doi.org/10.1201/9781315175584.
Gonzales-Barron, U., & Butler, F. (2011). Characterisation of within-batch and between-batch variability in microbial counts in foods using Poisson-gamma and Poisson-lognormal regression models. Food Control, 22, 1268-1278.
Hodges, J. S., & Reich, B. J. (2010). Adding spatially-correlated errors can mess up the fixed effect you love. The American Statistician, 64, 325-334.
Hughes, J., & Haran, M. (2013). Dimension reduction and alleviation of confounding for spatial generalized linear mixed models. Journal of the Royal Statistical Society: Series B (Methodological), 75(1), 139-159. http://www.jstor.org/stable/23361018
Kharrat, T., Boshnakov, G. N., McHale, I., & Baker, R. (2019). Flexible regression models for count data based on renewal processes: The countr package. Journal of Statistical Software, 90(13), 1-35. https://www.jstatsoft.org/index.php/jss/article/view/v090i13
Lord, D., Geedipally, S., & Guikema, S. (2010). Extension of the application of conway-maxwell-poisson models: Analyzing traffic crash data exhibiting underdispersion. Risk Analysis : An Official Publication of the Society for Risk Analysis, 30, 1268-76.
McShane, B., Adrian, M., Bradlow, E. T., & Fader, P. S. (2008). Count models based on Weibull interarrival times. Journal of Business & Economic Statistics, 26(3), 369-378.
Mukherjee, C., R.-P., & Krishnaji, N. (2001). Crimes against women in India: Analysis of official statistics. Economic and Political Weekly, 36(43), 4070-4080. https://www.jstor.org/stable/4411293
Nadifar, M., Baghishani, H., Fallah, A., & Rue, H. (2019). Statistical modeling of groundwater quality assessment in Iran using a flexible Poisson likelihood. Preprint: https://arxiv.org/abs/1908.02344
Nadifar, M., Baghishani, H., Kneib, T., & Fallah, A. (2021). Flexible Bayesian modeling of counts: constructing penalized complexity priors. Preprint: https://arxiv.org/abs/2105.08686
Pettit, L. I. (1990). The conditional predictive ordinate for the normal distribution. Journal of the Royal Statistical Society: Series B (Methodological), 52(1), 175-184.
Prates, M. O., Assuncão, R. M., & Rodrigues, E. C. (2019). Alleviating spatial confounding for areal data problems by displacing the geographical centroids. Bayesian Analysis, 14(2), 623-647. https://doi.org/10.1214/18-BA1123
Reich, B. J., Hodges, J. S., & Zadnik, V. (2006). Effects of residual smoothing on the posterior of the fixed effects in disease-mapping models. Biometrics, 62, 1197-1206.
Ridout, M. S., & Besbeas, P. (2004). An empirical model for underdispersed count data. Statistical Modelling, 4(1), 77-89.
Rue, H., & Held, L. (2005). Gaussian Markov Random Fields: Theory and Applications. Chapman & Hall/CRC Press. https://doi.org/10.1201/9780203492024
Rue, H., Martino, S., & Chopin, N. (2009). Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations. Journal of the Royal Statistical Society: Series B (Methodological), 71(2), 319-392. https://doi.org/10.1111/j.1467-9868.2008.00700.x
Simpson, D., Rue, H., Riebler, A., Martins, T. G., & Sørbye, S. H. (2017). Penalising model component complexity: A principled, practical approach to constructing priors. Statistical Science, 32(1), 1-28. https://doi.org/10.1214/16-STS576
Spiegelhalter, D. J., Best, N., Carlin, B. P., & Linde, A. (2002). Bayesian measures of model complexity and fit (with discussion). Journal of the Royal Statistical Society: Series B (Methodological), 64(4), 583-639. https://doi.org/10.1111/1467-9868.00353
Subhani, M. I., & Afza, S. (2009). To estimate an equation explaining the determinants of Dowry. MPRA Paper 21365. https://ideas.repec.org/p/pra/mprapa/21365.html
Thaden, H., & Kneib, T. (2018). Structural equation models for dealing with spatial confounding. The American Statistician, 72, 239-252.
Tierney, L., & Kadane, J. B. (1986). Accurate approximations for posterior moments and marginal densities. Journal of the American Statistical Association, 81(393), 82-86.
Wang, X., Yue, Y., & Faraway, J. (2018). Bayesian Regression Modeling with INLA. Chapman and Hall/CRC.
Watanabe, S. (2013). A widely applicable Bayesian information criterion. Journal of Machine Learning Research, 14, 867-897. https://www.jmlr.org/papers/volume14/watanabe13a/watanabe13a.pdf
Winkelmann, R. (1995). Duration dependence and dispersion in count-data models. Journal of Business & Economic Statistics, 13(4), 467-474.
Yue, Y., & Speckman, P. (2010). Nonstationary spatial Gaussian Markov random fields. Journal of Computational and Graphical Statistics, 19, 96-116.
Zamani, H., & Noriszura, I. (2012). Functional form for the generalized Poisson regression model. Communication in Statistics-Theory and Methods, 41(20), 3666-3675. https://doi.org/10.1080/03610926.2011.564742
Zhu, R., & Joe, H. (2009). Modelling heavy-tailed count data using a generalised Poisson-inverse Gaussian family. Statistics & Probability Letters, 79, 1695-1703.

Auteurs

Mahsa Nadifar (M)

Department of Statistics, Faculty of Mathematical Sciences, Shahrood University of Technology, Shahrood, Iran.

Hossein Baghishani (H)

Department of Statistics, Faculty of Mathematical Sciences, Shahrood University of Technology, Shahrood, Iran.

Afshin Fallah (A)

Department of Statistics, Faculty of Sciences, Imam Khomeini International University, Iran.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH