Incremental state aggregation for value function estimation in reinforcement learning

In reinforcement learning, large state and action spaces make the estimation of value functions impractical, so a value function is often represented as a linear combination of basis functions whose linear coefficients constitute parameters to be estimated. However, preparing basis functions require...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society. - 1996. - 41(2011), 5 vom: 01. Okt., Seite 1407-16
1. Verfasser:	Mori, Takeshi (VerfasserIn)
Weitere Verfasser:	Ishii, Shin
Format:	Online-Aufsatz
Sprache:	English
Veröffentlicht:	2011
Zugriff auf das übergeordnete Werk:	IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society
Schlagworte:	Journal Article Research Support, Non-U.S. Gov't

Online verfügbar	Volltext