Effect-Invariant Mechanisms for Policy Generalization.

causality distribution generalization domain adaptation invariance policy learning

Journal

Journal of machine learning research : JMLR
ISSN: 1532-4435
Titre abrégé: J Mach Learn Res
Pays: United States
ID NLM: 101262635

Informations de publication

Date de publication:
2024
Historique:
pmc-release: 01 01 2025
medline: 31 7 2024
pubmed: 31 7 2024
entrez: 31 7 2024
Statut: ppublish

Résumé

Policy learning is an important component of many real-world learning systems. A major challenge in policy learning is how to adapt efficiently to unseen environments or tasks. Recently, it has been suggested to exploit invariant conditional distributions to learn models that generalize better to unseen environments. However, assuming invariance of entire conditional distributions (which we call full invariance) may be too strong of an assumption in practice. In this paper, we introduce a relaxation of full invariance called effect-invariance (e-invariance for short) and prove that it is sufficient, under suitable assumptions, for zero-shot policy generalization. We also discuss an extension that exploits e-invariance when we have a small sample from the test environment, enabling few-shot policy generalization. Our work does not assume an underlying causal graph or that the data are generated by a structural causal model; instead, we develop testing procedures to test e-invariance directly from data. We present empirical results using simulated data and a mobile health intervention dataset to demonstrate the effectiveness of our approach.

Identifiants

pubmed: 39082006
pmc: PMC11286230
pii:

Types de publication

Journal Article

Langues

eng

Auteurs

Sorawit Saengkyongam (S)

Seminar for Statistics, ETH Zürich, Zürich, Switzerland.

Niklas Pfister (N)

Department of Mathematical Sciences, University of Copenhagena, Copenhagen, Denmark.

Predrag Klasnja (P)

School of Information, University of Michigan, Ann Arbor, MI, USA.

Susan Murphy (S)

Department of Statistics, Department of Computer Science, Harvard University, Cambridge, MA, USA.

Jonas Peters (J)

Seminar for Statistics, ETH Zürich, Zürich, Switzerland.

Classifications MeSH