Maximal point-polyserial correlation for non-normal random distributions.

attainable correlations biserial correlation discretization latent variable non‐normal distribution

Journal

The British journal of mathematical and statistical psychology
ISSN: 2044-8317
Titre abrégé: Br J Math Stat Psychol
Pays: England
ID NLM: 0004047

Informations de publication

Date de publication:
22 Oct 2024
Historique:
revised: 02 09 2024
received: 19 02 2024
accepted: 23 09 2024
medline: 22 10 2024
pubmed: 22 10 2024
entrez: 22 10 2024
Statut: aheadofprint

Résumé

We consider the problem of determining the maximum value of the point-polyserial correlation between a random variable with an assigned continuous distribution and an ordinal random variable with

Identifiants

pubmed: 39435733
doi: 10.1111/bmsp.12362
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Subventions

Organisme : Ministero dell'Università e della Ricerca
ID : 20225PC98R

Informations de copyright

© 2024 The Author(s). British Journal of Mathematical and Statistical Psychology published by John Wiley & Sons Ltd on behalf of British Psychological Society.

Références

Barbiero, A. , & Hitaj, A. (2021). A new method for building a discrete analogue to a continuous random variable based on minimization of a distance between distribution functions. In 2021 International Conference on Data Analytics for Business and Industry (ICDABI) (pp. 338–341).
Barbiero, A. , & Hitaj, A. (2023). Discrete approximations of continuous probability distributions obtained by minimizing Cramér‐von Mises‐type distances. Statistical Papers, 64(5), 1669–1697.
Bartholomew, D. J. (1980). Factor analysis for categorical data. Journal of the Royal Statistical Society, Series B: Statistical Methodology, 42(3), 293–312.
Bedrick, E. J. (1995). A note on the attenuation of correlation. British Journal of Mathematical and Statistical Psychology, 48(2), 271–280.
Chakraborty, S. , Roychowdhury, M. K. , & Sifuentes, J. (2021). High precision numerical computation of principal points for univariate distributions. Sankhya B, 83, 558–584.
Cheng, Y. , & Liu, H. (2016). A short note on the maximal point‐biserial correlation under non‐normality. British Journal of Mathematical and Statistical Psychology, 69(3), 344–351.
Cox, N. (1974). Estimation of the correlation between a continuous and a discrete variable. Biometrics, 30, 171–178.
Demirtas, H. , & Hedeker, D. (2011). A practical way for computing approximate lower and upper correlation bounds. The American Statistician, 65(2), 104–109.
Demirtas, H. , & Hedeker, D. (2016). Computing the point‐biserial correlation under any underlying continuous distribution. Communications in Statistics: Simulation and Computation, 45(8), 2744–2751.
Demirtas, H. , & Vardar‐Acar, C. (2017). Anatomy of correlational magnitude transformations in latency and discretization contexts in Monte‐Carlo studies. In D.‐G. Chen & J. D. Chen (Eds.), Monte‐Carlo Simulation‐Based Statistical Modeling (pp. 59–84). Springer.
Drezner, Z. , & Zerom, D. (2016). A simple and effective discretization of a continuous random variable. Communications in Statistics: Simulation and Computation, 45(10), 3798–3810.
Embrechts, P. , McNeil, A. J. , & Straumann, D. (2002). Correlation and dependence in risk management: Properties and pitfalls. In M. A. H. Dempster (Ed.), Risk management: Value at risk and beyond (pp. 176–223). Cambridge University Press.
Fang, K.‐T. , & Pan, J. (2023). A review of representative points of statistical distributions and their applications. Mathematics, 11(13), 2930.
Fernández, D. , Liu, I. , Costilla, R. , & Gu, P. Y. (2020). Assigning scores for ordered categorical responses. Journal of Applied Statistics, 47(7), 1261–1281.
Fleishman, A. I. (1978). A method for simulating non‐normal distributions. Psychometrika, 43(4), 521–532.
Flury, B. A. (1990). Principal points. Biometrika, 77(1), 33–41.
Foldnes, N. , & Grønneberg, S. (2015). How general is the Vale–Maurelli simulation approach? Psychometrika, 80, 1066–1083.
Fréchet, M. (1951). Sur les tableaux de corrélation dont les marges sont données. Annales de l'Universite de Lyon, Sciences, Section A, 14, 53–77.
Ghalanos, A. , & Theussl, S. (2015). Rsolnp: General non‐linear optimization using augmented lagrange multiplier method [Computer software manual]. R package version 1.16.
Golub, G. H. , & Welsch, J. H. (1969). Calculation of Gauss quadrature rules. Mathematics of Computation, 23(106), 221–230.
Hoeffding, W. (1940). Masstabinvariante korrelationstheorie. Schriften des Mathematischen Instituts und Instituts fur Angewandte Mathematik der Universitat Berlin, 5, 181–233.
Hofert, M. , Kojadinovic, I. , Maechler, M. , & Jun, Y. (2023). copula: Multivariate dependence with copulas [Computer software manual]. R package version 1.1‐2. https://CRAN.R‐project.org/package=copula/
Ivanova, A. , & Berger, V. W. (2001). Drawbacks to integer scoring for ordered categorical data. Biometrics, 57(2), 567–570.
Kadhem, S. H. , & Nikoloulopoulos, A. K. (2021). Factor copula models for mixed data. British Journal of Mathematical and Statistical Psychology, 74(3), 365–403.
Kennan, J. (2006). A note on discrete approximations of continuous distributions . University of Wisconsin‐Madison.
Lee, S.‐Y. , & Poon, W.‐Y. (1986). Maximum likelihood estimation of polyserial correlations. Psychometrika, 51(1), 113–121.
Lilliefors, H. W. (1967). On the Kolmogorov‐Smirnov test for normality with mean and variance unknown. Journal of the American Statistical Association, 62(318), 399–402.
Lloyd, S. (1982). Least squares quantization in PCM. IEEE Transactions on Information Theory, 28(2), 129–137.
MacCallum, R. C. , Zhang, S. , Preacher, K. J. , & Rucker, D. D. (2002). On the practice of dichotomization of quantitative variables. Psychological Methods, 7(1), 19.
Martin, A. D. , Quinn, K. M. , & Park, J. H. (2011). MCMCpack: Markov Chain Monte Carlo in R. Journal of Statistical Software, 42(9), 22. https://doi.org/10.18637/jss.v042.i09
McNeil, A. J. , Frey, R. , & Embrechts, P. (2015). Quantitative risk management: concepts, techniques and tools. Princeton University Press.
Narasimhan, B. , Johnson, S. G. , Hahn, T. , Bouvier, A. , & Kiêu, K. (2023). cubature: Adaptive multivariate integration over hypercubes [Computer software manual]. R package version 2.1.0. https://bnaras.github.io/cubature/
Norman, G. (2010). Likert scales, levels of measurement and the “laws” of statistics. Advances in Health Sciences Education, 15, 625–632.
Novack‐Gottshall, P. , & Wang, S. C. (2019). KScorrect: Lilliefors‐corrected Kolmogorov‐Smirnov goodness‐of‐fit tests [Computer software manual]. R package version 1.4.0. https://CRAN.R‐project.org/package=KScorrect
Olsson, U. , Drasgow, F. , & Dorans, N. J. (1982). The polyserial correlation coefficient. Psychometrika, 47, 337–347.
Pearson, K. (1909). On a new method of determining correlation between a measured character A, and a character B, of which only the percentage of cases wherein B exceeds (or falls short of) a given intensity is recorded for each grade of A. Biometrika, 7(1/2), 96–105.
Quinn, K. M. (2004). Bayesian factor analysis for mixed ordinal and continuous responses. Political Analysis, 12(4), 338–353.
Toda, A. A. (2021). Data‐based automatic discretization of nonparametric distributions. Computational Economics, 57(4), 1217–1235.
Ye, Y. (1987). Interior algorithms for linear, quadratic, and linearly constrained non‐linear programming (Unpublished doctoral dissertation). Department of Engineering‐Economic Systems, Stanford University.
Zhang, P. , Liu, B. , & Pan, J. (2024). Iteratively reweighted least squares method for estimating polyserial and polychoric correlation coefficients. Journal of Computational and Graphical Statistics, 33(1), 316–328.
Zoppè, A. (1995). Principal points of univariate continuous distributions. Statistics and Computing, 5, 127–132.

Auteurs

Alessandro Barbiero (A)

Department of Economics, Management and Quantitative Methods, Università degli Studi di Milano, Milan, Italy.

Classifications MeSH