Invariant Policy Learning : A Causal Perspective

Contextual bandit and reinforcement learning algorithms have been successfully used in various interactive learning systems such as online advertising, recommender systems, and dynamic pricing. However, they have yet to be widely adopted in high-stakes application domains, such as healthcare. One re...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on pattern analysis and machine intelligence. - 1979. - 45(2023), 7 vom: 03. Juli, Seite 8606-8620
1. Verfasser:	Saengkyongam, Sorawit (VerfasserIn)
Weitere Verfasser:	Thams, Nikolaj, Peters, Jonas, Pfister, Niklas
Format:	Online-Aufsatz
Sprache:	English
Veröffentlicht:	2023
Zugriff auf das übergeordnete Werk:	IEEE transactions on pattern analysis and machine intelligence
Schlagworte:	Journal Article

Online verfügbar	Volltext