Misinterpretations of Significance Testing Results Near the P-Value Threshold in the Urologic Literature.
data
p-value
statistical errors
statistics
urology
Journal
Cureus
ISSN: 2168-8184
Titre abrégé: Cureus
Pays: United States
ID NLM: 101596737
Informations de publication
Date de publication:
Jul 2023
Jul 2023
Historique:
received:
08
04
2023
accepted:
07
07
2023
medline:
10
8
2023
pubmed:
10
8
2023
entrez:
10
8
2023
Statut:
epublish
Résumé
Background The outcome of a statistical test is to accept or reject a null hypothesis. Reporting a metric as "trending toward significance" is a misinterpretation of the p-value. Studies highlighting the prevalence of statistical errors in the urologic literature remain scarce. We evaluated abstracts from 15 urology journals published within the years 2000-2021 and provided a quantitative measure of a common statistical mistake-misconstruing the function of null hypothesis testing by reporting "a trend toward significance." Materials and methods We performed an audit of 15 urology journals, looking at articles published from January 1, 2000, to January 1, 2022. A word recognition function in Microsoft Excel was utilized to identify the use of the word "trend" in the abstracts. Each use of the word "trend" was manually investigated by two authors to determine whether it was improperly used in describing non-statistically significant data as trending toward significance. Statistics and data analysis were performed using Python libraries: pandas, scipy.stats, and seaborn. Results This study included 101,134 abstracts from 15 urology journals. Within those abstracts, the word "trend" was used 2,509 times, 572 uses of which were describing non-statistically significant data as trending toward significance. There was a statistically significant difference in the rate of errors between the 15 journals (p < 0.01). The highest rate of improper use of the word "trend" was found in Bladder Cancer with a rate of 1.6% (p < 0.01) of articles. The lowest rate of improper use was found in European Urology, with a rate of 0.3% (p < 0.01). Our analysis found a moderate correlation between the number of articles published and the number of misuses of the word "trend" within each journal and across all journals every year (r = 0.61 and 0.70, respectively). Conclusion The overall rate of p-value misinterpretation never exceeded 2% of articles in each journal. There is significance in the difference in misinterpretation rates between the different journals. Authors' utilization of the word "trend" describing non-significant p-values as being near significant should be used with caution.
Identifiants
pubmed: 37559843
doi: 10.7759/cureus.41556
pmc: PMC10407971
doi:
Types de publication
Journal Article
Langues
eng
Pagination
e41556Informations de copyright
Copyright © 2023, Manda et al.
Déclaration de conflit d'intérêts
The authors have declared that no competing interests exist.
Références
J Urol. 2006 Jul;176(1):263-6
pubmed: 16753417
J Urol. 2005 Oct;174(4 Pt 1):1374-9
pubmed: 16145441
J Infect Dis. 1984 Mar;149(3):349-54
pubmed: 6715895
Arthroscopy. 2021 Apr;37(4):1057-1063
pubmed: 33812509
BMJ. 2014 Mar 31;348:g2215
pubmed: 24687314
J Urol. 2009 Oct;182(4 Suppl):1906-10
pubmed: 19695587
Am J Obstet Gynecol. 1996 Nov;175(5):1138-41
pubmed: 8942478
Br J Anaesth. 2015 Sep;115(3):337-9
pubmed: 26025239
World Neurosurg. 2022 Mar;159:e192-e198
pubmed: 34915206
JAMA Oncol. 2018 Dec 1;4(12):1778-1779
pubmed: 30347042
Diagn Interv Radiol. 2019 Mar;25(2):102-108
pubmed: 30582574
Eur Urol. 2019 Mar;75(3):358-367
pubmed: 30580902