State-Temporal Compression in Reinforcement Learning With the Reward-Restricted Geodesic Metric

It is difficult to solve complex tasks that involve large state spaces and long-term decision processes by reinforcement learning (RL) algorithms. A common and promising method to address this challenge is to compress a large RL problem into a small one. Towards this goal, the compression should be...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on pattern analysis and machine intelligence. - 1979. - 44(2022), 9 vom: 20. Sept., Seite 5572-5589
1. Verfasser: Guo, Shangqi (VerfasserIn)
Weitere Verfasser: Yan, Qi, Su, Xin, Hu, Xiaolin, Chen, Feng
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2022
Zugriff auf das übergeordnete Werk:IEEE transactions on pattern analysis and machine intelligence
Schlagworte:Journal Article