Multivariate Tail Probabilities: Predicting Regional Pertussis Cases in Washington State.

data fusion density ratio model disease outbreak goodness-of-fit model selection variable tilt

Journal

Entropy (Basel, Switzerland)
ISSN: 1099-4300
Titre abrégé: Entropy (Basel)
Pays: Switzerland
ID NLM: 101243874

Informations de publication

Date de publication:
27 May 2021
Historique:
received: 07 05 2021
revised: 23 05 2021
accepted: 23 05 2021
entrez: 2 6 2021
pubmed: 3 6 2021
medline: 3 6 2021
Statut: epublish

Résumé

In disease modeling, a key statistical problem is the estimation of lower and upper tail probabilities of health events from given data sets of small size and limited range. Assuming such constraints, we describe a computational framework for the systematic fusion of observations from multiple sources to compute tail probabilities that could not be obtained otherwise due to a lack of lower or upper tail data. The estimation of multivariate lower and upper tail probabilities from a given small reference data set that lacks complete information about such tail data is addressed in terms of pertussis case count data. Fusion of data from multiple sources in conjunction with the density ratio model is used to give probability estimates that are non-obtainable from the empirical distribution. Based on a density ratio model with variable tilts, we first present a univariate fit and, subsequently, improve it with a multivariate extension. In the multivariate analysis, we selected the best model in terms of the Akaike Information Criterion (AIC). Regional prediction, in Washington state, of the number of pertussis cases is approached by providing joint probabilities using fused data from several relatively small samples following the selected density ratio model. The model is validated by a graphical goodness-of-fit plot comparing the estimated reference distribution obtained from the fused data with that of the empirical distribution obtained from the reference sample only.

Identifiants

pubmed: 34072055
pii: e23060675
doi: 10.3390/e23060675
pmc: PMC8226468
pii:
doi:

Types de publication

Journal Article

Langues

eng

Références

JAMA. 2006 Oct 11;296(14):1757-63
pubmed: 17032989
Emerg Infect Dis. 2015 Sep;21(9):1568-73
pubmed: 26291475
Am J Public Health. 2015 Sep;105(9):e42-7
pubmed: 26180973
JAMA. 2016 Mar 15;315(11):1149-58
pubmed: 26978210
Am J Epidemiol. 2005 Jun 15;161(12):1159-67
pubmed: 15937025
Clin Infect Dis. 2014 Jun;58(11):1523-9
pubmed: 24633685
J Stat Theory Pract. 2021;15(2):25
pubmed: 33495693
Sankhya B (2008). 2021 Mar 16;:1-18
pubmed: 33746458
Pediatrics. 2016 Jul;138(1):
pubmed: 27358475
Am J Epidemiol. 2008 Dec 15;168(12):1389-96
pubmed: 18922998
MMWR Morb Mortal Wkly Rep. 2012 Jul 20;61(28):517-22
pubmed: 22810264
Lancet Infect Dis. 2017 Sep;17(9):974-980
pubmed: 28623146
Epidemics. 2011 Sep;3(3-4):183-8
pubmed: 22094341
PLoS One. 2018 Dec 26;13(12):e0208404
pubmed: 30586416
Med Hypotheses. 2020 Dec;145:110305
pubmed: 33032174

Auteurs

Xuze Zhang (X)

Department of Mathematics and Institute for Systems Research, University of Maryland, College Park, MD 20742, USA.

Saumyadipta Pyne (S)

Public Health Dynamics Laboratory, Department of Biostatistics, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, PA 15261, USA.
Health Analytics Network, Pittsburgh, PA 15237, USA.

Benjamin Kedem (B)

Department of Mathematics and Institute for Systems Research, University of Maryland, College Park, MD 20742, USA.

Classifications MeSH