Restless Bandit Marginal Productivity Indices, Diminishing Returns, and Optimal Control of Make-to-Order/Make-to-Stock M / G / 1 Queues

This paper presents a framework grounded on convex optimization and economics ideas to solve by index policies problems of optimal dynamic allocation of effort to a discrete-state (finite or countable) binary-action (work/rest) semi-Markov restless bandit project, elucidating issues raised by previo...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:Mathematics of Operations Research. - Institute for Operations Research and the Management Sciences. - 31(2006), 1, Seite 50-84
1. Verfasser: Niño-Mora, José (VerfasserIn)
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2006
Zugriff auf das übergeordnete Werk:Mathematics of Operations Research
Schlagworte:Restless bandits Stochastic scheduling Index policies Indexability Control by price Semi-Markov decision Processes Dynamic resource allocation Diminishing returns Marginal productivity mehr... Efficient frontier Convex optimization Bias Mixed criteria Make to order Make to stock Control of queues Production-inventory control Partial conservation laws Achievable performance Primary 90B36 Primary 90C40 Secondary 90B05 Secondary 90B22 Secondary 90B30 Secondary 90C25 Primary: dynamic programming/optimal control: semi Markov Secondary: queues: optimization Secondary: inventory/production: policies Business Mathematics Applied sciences Economics
Beschreibung
Zusammenfassung:This paper presents a framework grounded on convex optimization and economics ideas to solve by index policies problems of optimal dynamic allocation of effort to a discrete-state (finite or countable) binary-action (work/rest) semi-Markov restless bandit project, elucidating issues raised by previous work. Its contributions include: (i) the concept of a restless bandit's marginal productivity index (MPI), characterizing optimal policies relative to general cost and work measures; (ii) the characterization of indexable restless bandits as those satisfying diminishing marginal returns to work, consistently with a nested family of threshold policies; (iii) sufficient indexability conditions via partial conservation laws (PCLs); (iv) the characterization of the MPI as an optimal marginal productivity rate relative to feasible active-state sets; (v) application to semi-Markov bandits under several criteria, including a new mixed average-bias criterion; and (vi) PCL-indexability analyses and MPIs for optimal service control of make-to-order/make-to-stock queues with convex holding costs, under discounted and average-bias criteria.
ISSN:15265471