Optimal assignment of sellers in a store with a random number of clients via the Armed Bandit model

Víctor Hugo Vázquez-Guevara; Hugo Cruz−Suárez; Fernando Velasco-Luna

doi:10.1051/ro/2017015

All issues

Volume 51 / No 4 (October-December 2017)

RAIRO-Oper. Res., 51 4 (2017) 1119-1132

Abstract

Issue		RAIRO-Oper. Res. Volume 51, Number 4, October-December 2017


Page(s)		1119 - 1132
DOI		https://doi.org/10.1051/ro/2017015
Published online		24 November 2017

RAIRO-Oper. Res. 51 (2017) 1119-1132

Optimal assignment of sellers in a store with a random number of clients via the Armed Bandit model^∗

Víctor Hugo Vázquez-Guevara, Hugo Cruz−Suárez and Fernando Velasco-Luna

Facultad de Ciencias Físico Matemáticas, Benemérita Universidad Autónoma de Puebla, San Claudio y 18 sur. San Manuel, 72570, Puebla, Mexico.
vvazquez@fcfm.buap.mx

Received: 12 November 2015
Accepted: 2 March 2017

Abstract

The technique of Dynamic Programming for Armed Bandits is employed for solving the problem of maximizing the randomly depreciated gains of a store with unknown (finite random) number of clients with fixed (finite) number of sellers which skills are also random and will be represented as probability distributions which are themselves random. Hence, Armed Bandits’s framework will be considered with horizon being a random variable with a finite support, that far as the authors know, it has not yet been discussed. In addition, numerical examples are detailed in order to illustrate the versatility and practical implementation of the approach presented in this paper in two general contexts, given by the number of available products: one product only, such situation coincides with that in which the number of sales needs to be maximized. And, more than one product, in this case, the amount of sales is not necessarily ruled by a Bernoulli distribution.

Mathematics Subject Classification: 49L20 / 90C40 / 93E20

Key words: Armed bandit model / dynamic programming / assignment of personal / random horizon / markov decision processes

^∗

This work was partially supported by VIEP-BUAP, via the project: “Estimación de momentos de orden par del ruido en procesos ARX con ruido correlacionado”.

© EDP Sciences, ROADEF, SMAI 2017

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

Purchase access: €35

Unlimited access to the full article
Instant PDF download

Homepage

Table of Contents

Previous article Next article

Article contents

Metrics

Show article metrics

Services

Articles citing this article
CrossRef (1)
Same authors
- Google Scholar
- EDP Sciences database

Recommend this article
Download citation

Optimal assignment of sellers in a store with a random number of clients via the Armed Bandit model∗

Optimal assignment of sellers in a store with a random number of clients via the Armed Bandit model^∗