Data Distribution: Normal or Abnormal?
Biostatistics
Data Analysis
Epidemiologic Methods
Normal Distribution
Statistical Distributions
Journal
Journal of Korean medical science
ISSN: 1598-6357
Titre abrégé: J Korean Med Sci
Pays: Korea (South)
ID NLM: 8703518
Informations de publication
Date de publication:
22 Jan 2024
22 Jan 2024
Historique:
received:
26
11
2023
accepted:
21
12
2023
medline:
23
1
2024
pubmed:
23
1
2024
entrez:
23
1
2024
Statut:
epublish
Résumé
Determining if the frequency distribution of a given data set follows a normal distribution or not is among the first steps of data analysis. Visual examination of the data, commonly by Q-Q plot, although is acceptable by many scientists, is considered subjective and not acceptable by other researchers. One-sample Kolmogorov-Smirnov test with Lilliefors correction (for a sample size ≥ 50) and Shapiro-Wilk test (for a sample size < 50) are common statistical tests for checking the normality of a data set quantitatively. As parametric tests, which assume that the data distribution is normal (Gaussian, bell-shaped), are more robust compared to their non-parametric counterparts, we commonly use transformations (e.g., log-transformation, Box-Cox transformation, etc.) to make the frequency distribution of non-normally distributed data close to a normal distribution. Herein, I wish to reflect on presenting how to practically work with these statistical methods through examining of real data sets.
Identifiants
pubmed: 38258367
pii: 39.e35
doi: 10.3346/jkms.2024.39.e35
doi:
Types de publication
Journal Article
Review
Langues
eng
Sous-ensembles de citation
IM
Pagination
e35Informations de copyright
© 2024 The Korean Academy of Medical Sciences.
Déclaration de conflit d'intérêts
The author has no potential conflicts of interest to disclose.