An Information-Theoretic Method to Automatic Shortcut Avoidance and Domain Generalization for Dense Prediction Tasks.

Journal

IEEE transactions on pattern analysis and machine intelligence

ISSN: 1939-3539

Titre abrégé: IEEE Trans Pattern Anal Mach Intell

Pays: United States

ID NLM: 9885960

Informations de publication

Date de publication:
Sep 2023

Historique:

medline: 20 4 2023

pubmed: 20 4 2023

entrez: 20 04 2023

Statut: ppublish

Résumé

Deep convolutional neural networks for dense prediction tasks are commonly optimized using synthetic data, as generating pixel-wise annotations for real-world data is laborious. However, the synthetically trained models do not generalize well to real-world environments. This poor "synthetic to real" (S2R) generalization we address through the lens of shortcut learning. We demonstrate that the learning of feature representations in deep convolutional networks is heavily influenced by synthetic data artifacts (shortcut attributes). To mitigate this issue, we propose an Information-Theoretic Shortcut Avoidance (ITSA) approach to automatically restrict shortcut-related information from being encoded into the feature representations. Specifically, our proposed method minimizes the sensitivity of latent features to input variations: to regularize the learning of robust and shortcut-invariant features in synthetically trained models. To avoid the prohibitive computational cost of direct input sensitivity optimization, we propose a practical yet feasible algorithm to achieve robustness. Our results show that the proposed method can effectively improve S2R generalization in multiple distinct dense prediction tasks, such as stereo matching, optical flow, and semantic segmentation. Importantly, the proposed method enhances the robustness of the synthetically trained networks and outperforms their fine-tuned counterparts (on real data) for challenging out-of-domain applications.

Identifiants

DOI: 10.1109/TPAMI.2023.3268640 PMID: 37079402

pubmed: 37079402

doi: 10.1109/TPAMI.2023.3268640

doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

Pagination

10615-10631

An Information-Theoretic Method to Automatic Shortcut Avoidance and Domain Generalization for Dense Prediction Tasks.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Pagination

Auteurs

WeiQin Chuah (W)

Ruwan Tennakoon (R)

Reza Hoseinnezhad (R)

David Suter (D)

Alireza Bab-Hadiashar (A)

Classifications MeSH