State-Temporal Compression in Reinforcement Learning With the Reward-Restricted Geodesic Metric

It is difficult to solve complex tasks that involve large state spaces and long-term decision processes by reinforcement learning (RL) algorithms. A common and promising method to address this challenge is to compress a large RL problem into a small one. Towards this goal, the compression should be...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on pattern analysis and machine intelligence. - 1979. - 44(2022), 9 vom: 20. Sept., Seite 5572-5589
1. Verfasser:	Guo, Shangqi (VerfasserIn)
Weitere Verfasser:	Yan, Qi, Su, Xin, Hu, Xiaolin, Chen, Feng
Format:	Online-Aufsatz
Sprache:	English
Veröffentlicht:	2022
Zugriff auf das übergeordnete Werk:	IEEE transactions on pattern analysis and machine intelligence
Schlagworte:	Journal Article

Online verfügbar	Volltext