Estimating the area under a receiver operating characteristic curve using partially ordered sets.

imperfect ranking isotonic estimation maximum likelihood nonparametric estimation relative efficiency tie information

Journal

The international journal of biostatistics
ISSN: 1557-4679
Titre abrégé: Int J Biostat
Pays: Germany
ID NLM: 101313850

Informations de publication

Date de publication:
07 08 2020
Historique:
received: 29 10 2019
accepted: 08 06 2020
pubmed: 9 8 2020
medline: 15 12 2021
entrez: 9 8 2020
Statut: epublish

Résumé

Ranked set sampling (RSS), known as a cost-effective sampling technique, requires that the ranker gives a complete ranking of the units in each set. Frey (2012) proposed a modification of RSS based on partially ordered sets, referred to as RSS-t in this paper, to allow the ranker to declare ties as much as he/she wishes. We consider the problem of estimating the area under a receiver operating characteristics (ROC) curve using RSS-t samples. The area under the ROC curve (AUC) is commonly used as a measure for the effectiveness of diagnostic markers. We develop six nonparametric estimators of the AUC with/without utilizing tie information based on different approaches. We then compare the estimators using a Monte Carlo simulation and an empirical study with real data from the National Health and Nutrition Examination Survey. The results show that utilizing tie information increases the efficiency of estimating the AUC. Suggestions about when to choose which estimator are also made available to practitioners.

Identifiants

pubmed: 32764163
doi: 10.1515/ijb-2019-0127
pii: /j/ijb.ahead-of-print/ijb-2019-0127/ijb-2019-0127.xml
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

139-152

Subventions

Organisme : NIGMS NIH HHS
ID : R15 GM131390
Pays : United States

Informations de copyright

© 2020 Walter de Gruyter GmbH, Berlin/Boston.

Références

McIntyre, GA. A method for unbiased selective sampling using ranked set sampling. Aust J Agric Res 1952;3:385–90. https://doi.org/10.1071/ar9520385.
Stokes, SL, Sager, TW. Characterization of a ranked-set sample with application to estimating distribution functions. J Am Stat Assoc 1988;83:374–81. https://doi.org/10.1080/01621459.1988.10478607.
Kvam, PH, Samaniego, FJ. Nonparametric maximum likelihood estimation based on ranked set samples. J Am Stat Assoc 1994;89:526–37. https://doi.org/10.1080/01621459.1994.10476777.
Huang, J. Asymptotic properties of Npmle of a distribution function based on ranked set samples. Ann Stat 1997;25:1036–49. https://doi.org/10.1214/aos/1069362737.
Duembgen, L, Zamanzade, E. Inference on a distribution function from ranked set samples. Ann Inst Stat Math 2020;72:157–85. https://doi.org/10.1007/s10463-018-0680-y.
Chen, H, Stasny, EA, Wolfe, DA. Ranked set sampling for efficient estimation of a population proportion. Stat Med 2005;24:3319–29. https://doi.org/10.1002/sim.2158.
Zamanzade, E, Mahdizadeh, M. A more efficient proportion estimator in ranked set sampling. Stat Prob Lett 2017;129:28–33. https://doi.org/10.1016/j.spl.2017.05.001.
Takahasi, K, Wakimoto, K. On unbiased estimates of the population mean based on the sample stratified by means of ordering. Ann Inst Stat Math 1968;20:1–31. https://doi.org/10.1007/bf02911622.
Frey, J, Feeman, TG. Efficiency comparisons for partially rank-ordered set sampling. Stat Papers 2016;58:1149–66. https://doi.org/10.1007/s00362-016-0742-2.
Frey, J, Zhang, Y. Testing perfect rankings in ranked-set sampling with binary data. Can J Stat 2017;45:326–39. https://doi.org/10.1002/cjs.11326.
Wang, X, Lim, J, Stokes, L. Using ranked set sampling with cluster randomized designs for improved inference on treatment effects. J Am Stat Assoc 2016;111:1576–90. https://doi.org/10.1080/01621459.2015.1093946.
Halls, LK, Dell, TR. Trail of ranked-set sampling for forage yields. Forest Sci 1966;12:22–6. https://doi.org/10.1093/forestscience/12.1.22.
Howard, RW, Jones, SC, Mauldin, JK, Beal, RH. Abundance, distribution, and colony size estimates for Reticulitermes spp. (Isopter: Rhinotermitidae) in Southern Mississippi. Environ Entomol 1982;11:1290–3. https://doi.org/10.1093/ee/11.6.1290.
Haq, A, Brown, J, Moltchanova, E, Al-Omari, AI. Partial ranked set sampling design. Environmetrics 2013;24:201–7. https://doi.org/10.1002/env.2203.
Hatefi, A, Jafari Jozani, M, Ozturk, O. Mixture model analysis of partially rank-ordered set samples: age groups of fish from length-frequency data. Scand J Stat 2015;42:848–71. https://doi.org/10.1111/sjos.12140.
Zamanzade, E, Mahdizadeh, M. Using ranked set sampling with extreme ranks in estimating the population proportion. Stat Methods Med Res 2020;29:165–77. https://doi.org/10.1177/0962280218823793.
Wang, X, Ahn, S, Lim, J. Unbalanced ranked set sampling in cluster randomized studies. J Stat Plann Inference 2017;187:1–16. https://doi.org/10.1016/j.jspi.2017.02.005.
Frey, J. Nonparametric mean estimation using partially ordered sets. Environ Ecol Stat 2012;19:309–26. https://doi.org/10.1007/s10651-012-0188-1.
Zamanzade, E, Wang, X. Proportion estimation in ranked set sampling in the presence of tie information. Comput Stat 2018;33:1349–66. https://doi.org/10.1007/s00180-018-0807-x.
Zamanzade, E, Wang, X. Improved nonparametric estimation using partially ordered sets. In: Chandra, G, Nautiyal, R, Chandra, H, editors. Statistical methods and applications in forestry and environmental sciences. Forum for interdisciplinary mathematics. Singapore: Springer; 2020.
Bamber, DC. The area above the ordinal dominance graph and the area below the receiver operating characteristic graph. J Math Psychol 1975;12:387–415. https://doi.org/10.1016/0022-2496(75)90001-2.
Kotz, S, Lumelskii, Y, Pensky, M. The stress-strength model and its generalizations. Theory and applications. Singapore: World Scientific; 2003.
Faraggi, D, Reiser, B. Estimation of the area under the ROC curve. Stat Med 2002;21:3093–106. https://doi.org/10.1002/sim.1228.
Sengupta, S, Mukhuti, S. Unbiased estimation of P(X>Y) using ranked set sample data. Statistics 2008;42:223–30. https://doi.org/10.1080/02331880701823271.
Yin, J, Hao, Y, Samawi, H, Rochani, H. Rank-based kernel estimation of the area under the ROC curve. Stat Methodol 2016;32:91–106. https://doi.org/10.1016/j.stamet.2016.04.001.
Mahdizadeh, M, Zamanzade, E. Kernel based estimation of P(X>Y) in ranked set sampling. SORT 2016;40:243–66. https://www.raco.cat/index.php/SORT/article/view/316145.
Zou, KH, Hall, WJ, Shapiro, DE. Smooth non-parametric receiver-operating characteristic (ROC) curves for continuous diagnostic tests. Stat Med 1997;16:2143–56. https://doi.org/10.1002/(sici)1097-0258(19971015)16:19<2143::aid-sim655>3.0.co;2-3.
MacEachern, SN, Stasny, EA, Wolfe, DA. Judgement post-stratification with imprecise rankings. Biometrics 2004;60:207–15. https://doi.org/10.1111/j.0006-341x.2004.00144.x.
Wang, X, Wang, K, Lim, J. Isotonized CDF estimation from judgment poststratification data with empty strata. Biometrics 2012;68:194–202. https://doi.org/10.1111/j.1541-0420.2011.01655.x.
Robertson, T, Wright, FT, Dykstra, RL. Order-restricted inferences. New York: Wiley; 1988.
Dell, TR, Clutter, JL. Ranked set sampling theory with order statistics background. Biometrics 1972;28:545–55. https://doi.org/10.2307/2556166.
Fligner, MA, MacEachern, SN. Nonparametric two-sample methods for ranked-set sample data. J Am Stat Assoc 2006;101:1107–18. https://doi.org/10.1198/016214506000000410.
Wang, X, Lim, J, Stokes, L. A nonparametric mean estimator for judgment post-stratified data. Biometrics 2008;64:355–63. https://doi.org/10.1111/j.1541-0420.2007.00900.x.

Auteurs

Ehsan Zamanzade (E)

Department of Statistics, Faculty of Mathematics and Statistics, University of Isfahan, Isfahan, 81746-73441, Iran.

Xinlei Wang (X)

Department of Statistical Science, Southern Methodist University, 3225 Daniel Avenue, Dallas, 75275-0332, TX, USA.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH