Maximal point-polyserial correlation for non-normal random distributions.
attainable correlations
biserial correlation
discretization
latent variable
non‐normal distribution
Journal
The British journal of mathematical and statistical psychology
ISSN: 2044-8317
Titre abrégé: Br J Math Stat Psychol
Pays: England
ID NLM: 0004047
Informations de publication
Date de publication:
22 Oct 2024
22 Oct 2024
Historique:
revised:
02
09
2024
received:
19
02
2024
accepted:
23
09
2024
medline:
22
10
2024
pubmed:
22
10
2024
entrez:
22
10
2024
Statut:
aheadofprint
Résumé
We consider the problem of determining the maximum value of the point-polyserial correlation between a random variable with an assigned continuous distribution and an ordinal random variable with
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Subventions
Organisme : Ministero dell'Università e della Ricerca
ID : 20225PC98R
Informations de copyright
© 2024 The Author(s). British Journal of Mathematical and Statistical Psychology published by John Wiley & Sons Ltd on behalf of British Psychological Society.
Références
Barbiero, A. , & Hitaj, A. (2021). A new method for building a discrete analogue to a continuous random variable based on minimization of a distance between distribution functions. In 2021 International Conference on Data Analytics for Business and Industry (ICDABI) (pp. 338–341).
Barbiero, A. , & Hitaj, A. (2023). Discrete approximations of continuous probability distributions obtained by minimizing Cramér‐von Mises‐type distances. Statistical Papers, 64(5), 1669–1697.
Bartholomew, D. J. (1980). Factor analysis for categorical data. Journal of the Royal Statistical Society, Series B: Statistical Methodology, 42(3), 293–312.
Bedrick, E. J. (1995). A note on the attenuation of correlation. British Journal of Mathematical and Statistical Psychology, 48(2), 271–280.
Chakraborty, S. , Roychowdhury, M. K. , & Sifuentes, J. (2021). High precision numerical computation of principal points for univariate distributions. Sankhya B, 83, 558–584.
Cheng, Y. , & Liu, H. (2016). A short note on the maximal point‐biserial correlation under non‐normality. British Journal of Mathematical and Statistical Psychology, 69(3), 344–351.
Cox, N. (1974). Estimation of the correlation between a continuous and a discrete variable. Biometrics, 30, 171–178.
Demirtas, H. , & Hedeker, D. (2011). A practical way for computing approximate lower and upper correlation bounds. The American Statistician, 65(2), 104–109.
Demirtas, H. , & Hedeker, D. (2016). Computing the point‐biserial correlation under any underlying continuous distribution. Communications in Statistics: Simulation and Computation, 45(8), 2744–2751.
Demirtas, H. , & Vardar‐Acar, C. (2017). Anatomy of correlational magnitude transformations in latency and discretization contexts in Monte‐Carlo studies. In D.‐G. Chen & J. D. Chen (Eds.), Monte‐Carlo Simulation‐Based Statistical Modeling (pp. 59–84). Springer.
Drezner, Z. , & Zerom, D. (2016). A simple and effective discretization of a continuous random variable. Communications in Statistics: Simulation and Computation, 45(10), 3798–3810.
Embrechts, P. , McNeil, A. J. , & Straumann, D. (2002). Correlation and dependence in risk management: Properties and pitfalls. In M. A. H. Dempster (Ed.), Risk management: Value at risk and beyond (pp. 176–223). Cambridge University Press.
Fang, K.‐T. , & Pan, J. (2023). A review of representative points of statistical distributions and their applications. Mathematics, 11(13), 2930.
Fernández, D. , Liu, I. , Costilla, R. , & Gu, P. Y. (2020). Assigning scores for ordered categorical responses. Journal of Applied Statistics, 47(7), 1261–1281.
Fleishman, A. I. (1978). A method for simulating non‐normal distributions. Psychometrika, 43(4), 521–532.
Flury, B. A. (1990). Principal points. Biometrika, 77(1), 33–41.
Foldnes, N. , & Grønneberg, S. (2015). How general is the Vale–Maurelli simulation approach? Psychometrika, 80, 1066–1083.
Fréchet, M. (1951). Sur les tableaux de corrélation dont les marges sont données. Annales de l'Universite de Lyon, Sciences, Section A, 14, 53–77.
Ghalanos, A. , & Theussl, S. (2015). Rsolnp: General non‐linear optimization using augmented lagrange multiplier method [Computer software manual]. R package version 1.16.
Golub, G. H. , & Welsch, J. H. (1969). Calculation of Gauss quadrature rules. Mathematics of Computation, 23(106), 221–230.
Hoeffding, W. (1940). Masstabinvariante korrelationstheorie. Schriften des Mathematischen Instituts und Instituts fur Angewandte Mathematik der Universitat Berlin, 5, 181–233.
Hofert, M. , Kojadinovic, I. , Maechler, M. , & Jun, Y. (2023). copula: Multivariate dependence with copulas [Computer software manual]. R package version 1.1‐2. https://CRAN.R‐project.org/package=copula/
Ivanova, A. , & Berger, V. W. (2001). Drawbacks to integer scoring for ordered categorical data. Biometrics, 57(2), 567–570.
Kadhem, S. H. , & Nikoloulopoulos, A. K. (2021). Factor copula models for mixed data. British Journal of Mathematical and Statistical Psychology, 74(3), 365–403.
Kennan, J. (2006). A note on discrete approximations of continuous distributions . University of Wisconsin‐Madison.
Lee, S.‐Y. , & Poon, W.‐Y. (1986). Maximum likelihood estimation of polyserial correlations. Psychometrika, 51(1), 113–121.
Lilliefors, H. W. (1967). On the Kolmogorov‐Smirnov test for normality with mean and variance unknown. Journal of the American Statistical Association, 62(318), 399–402.
Lloyd, S. (1982). Least squares quantization in PCM. IEEE Transactions on Information Theory, 28(2), 129–137.
MacCallum, R. C. , Zhang, S. , Preacher, K. J. , & Rucker, D. D. (2002). On the practice of dichotomization of quantitative variables. Psychological Methods, 7(1), 19.
Martin, A. D. , Quinn, K. M. , & Park, J. H. (2011). MCMCpack: Markov Chain Monte Carlo in R. Journal of Statistical Software, 42(9), 22. https://doi.org/10.18637/jss.v042.i09
McNeil, A. J. , Frey, R. , & Embrechts, P. (2015). Quantitative risk management: concepts, techniques and tools. Princeton University Press.
Narasimhan, B. , Johnson, S. G. , Hahn, T. , Bouvier, A. , & Kiêu, K. (2023). cubature: Adaptive multivariate integration over hypercubes [Computer software manual]. R package version 2.1.0. https://bnaras.github.io/cubature/
Norman, G. (2010). Likert scales, levels of measurement and the “laws” of statistics. Advances in Health Sciences Education, 15, 625–632.
Novack‐Gottshall, P. , & Wang, S. C. (2019). KScorrect: Lilliefors‐corrected Kolmogorov‐Smirnov goodness‐of‐fit tests [Computer software manual]. R package version 1.4.0. https://CRAN.R‐project.org/package=KScorrect
Olsson, U. , Drasgow, F. , & Dorans, N. J. (1982). The polyserial correlation coefficient. Psychometrika, 47, 337–347.
Pearson, K. (1909). On a new method of determining correlation between a measured character A, and a character B, of which only the percentage of cases wherein B exceeds (or falls short of) a given intensity is recorded for each grade of A. Biometrika, 7(1/2), 96–105.
Quinn, K. M. (2004). Bayesian factor analysis for mixed ordinal and continuous responses. Political Analysis, 12(4), 338–353.
Toda, A. A. (2021). Data‐based automatic discretization of nonparametric distributions. Computational Economics, 57(4), 1217–1235.
Ye, Y. (1987). Interior algorithms for linear, quadratic, and linearly constrained non‐linear programming (Unpublished doctoral dissertation). Department of Engineering‐Economic Systems, Stanford University.
Zhang, P. , Liu, B. , & Pan, J. (2024). Iteratively reweighted least squares method for estimating polyserial and polychoric correlation coefficients. Journal of Computational and Graphical Statistics, 33(1), 316–328.
Zoppè, A. (1995). Principal points of univariate continuous distributions. Statistics and Computing, 5, 127–132.