Beware of Validation by Eye: Visual Validation of Linear Trends in Scatterplots.
Journal
IEEE transactions on visualization and computer graphics
ISSN: 1941-0506
Titre abrégé: IEEE Trans Vis Comput Graph
Pays: United States
ID NLM: 9891704
Informations de publication
Date de publication:
10 Sep 2024
10 Sep 2024
Historique:
medline:
10
9
2024
pubmed:
10
9
2024
entrez:
10
9
2024
Statut:
aheadofprint
Résumé
Visual validation of regression models in scatterplots is a common practice for assessing model quality, yet its efficacy remains unquantified. We conducted two empirical experiments to investigate individuals' ability to visually validate linear regression models (linear trends) and to examine the impact of common visualization designs on validation quality. The first experiment showed that the level of accuracy for visual estimation of slope (i.e., fitting a line to data) is higher than for visual validation of slope (i.e., accepting a shown line). Notably, we found bias toward slopes that are "too steep" in both cases. This lead to novel insights that participants naturally assessed regression with orthogonal distances between the points and the line (i.e., ODR regression) rather than the common vertical distances (OLS regression). In the second experiment, we investigated whether incorporating common designs for regression visualization (error lines, bounding boxes, and confidence intervals) would improve visual validation. Even though error lines reduced validation bias, results failed to show the desired improvements in accuracy for any design. Overall, our findings suggest caution in using visual model validation for linear trends in scatterplots.
Identifiants
pubmed: 39255144
doi: 10.1109/TVCG.2024.3456305
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM