Interactive Reweighting for Mitigating Label Quality Issues.
Journal
IEEE transactions on visualization and computer graphics
ISSN: 1941-0506
Titre abrégé: IEEE Trans Vis Comput Graph
Pays: United States
ID NLM: 9891704
Informations de publication
Date de publication:
21 Dec 2023
21 Dec 2023
Historique:
pubmed:
21
12
2023
medline:
21
12
2023
entrez:
21
12
2023
Statut:
aheadofprint
Résumé
Label quality issues, such as noisy labels and imbalanced class distributions, have negative effects on model performance. Automatic reweighting methods identify problematic samples with label quality issues by recognizing their negative effects on validation samples and assigning lower weights to them. However, these methods fail to achieve satisfactory performance when the validation samples are of low quality. To tackle this, we develop Reweighter, a visual analysis tool for sample reweighting. The reweighting relationships between validation samples and training samples are modeled as a bipartite graph. Based on this graph, a validation sample improvement method is developed to improve the quality of validation samples. Since the automatic improvement may not always be perfect, a co-cluster-based bipartite graph visualization is developed to illustrate the reweighting relationships and support the interactive adjustments to validation samples and reweighting results. The adjustments are converted into the constraints of the validation sample improvement method to further improve validation samples. We demonstrate the effectiveness of Reweighter in improving reweighting results through quantitative evaluation and two case studies.
Identifiants
pubmed: 38127601
doi: 10.1109/TVCG.2023.3345340
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM