Accretive Operators and Markov Decision Processes
The dynamic programming functional equation for an abstract, continuous parameter, Markov decision process is shown to involve an operator which is m-accretive, thus giving rise to a nonlinear semigroup, called the Bellman semigroup. A class of controls is specified for which the maximum expected re...
Veröffentlicht in: | Mathematics of Operations Research. - Institute for Operations Research and the Management Sciences. - 5(1980), 3, Seite 444-459 |
---|---|
1. Verfasser: | |
Format: | Online-Aufsatz |
Sprache: | English |
Veröffentlicht: |
1980
|
Zugriff auf das übergeordnete Werk: | Mathematics of Operations Research |
Schlagworte: | Dynamic programming Markov decision theory Bellman equation Accretive operators Stochastic control Nonlinear semigroups Philosophy Mathematics Applied sciences Behavioral sciences |
Zusammenfassung: | The dynamic programming functional equation for an abstract, continuous parameter, Markov decision process is shown to involve an operator which is m-accretive, thus giving rise to a nonlinear semigroup, called the Bellman semigroup. A class of controls is specified for which the maximum expected reward over a finite planning horizon is given by this semigroup. This theory is applied to controlled jump, diffusion, and storage processes. |
---|---|
ISSN: | 15265471 |