Observatorio de I+D+i UPM

| Otras actividades
HOME

Proyectos Internacionales Art�culos Patentes UPM Software UPM Empresas UPM Otras actividades Memorias de investigaci�n

Memorias de investigación

Ponencias en congresos:

A Possibilistic Reward Method for the Multi-Armed Bandit Problem

A�o:2017

�reas de investigaci�n

Sistemas estoc�sticos y control,
Comunicaci�n, informaci�n,
Procesos estoc�sticos

Datos

Descripci�n
Different allocation strategies can be found in the literature to deal with the multi-armed bandit problem under a frequentist view or from a Bayesian perspective. In this paper, we propose a novel allocation strategy, the possibilistic reward method. First, possibilistic reward distributions are used to model the uncertainty about the arm expected rewards, which are then converted into probability distributions using a pignistic probability transformation. Finally, a simulation experiment is carried out to find out the one with the highest expected reward, which is then pulled. A parametric probability transformation of the proposed is then introduced together with a dynamic optimization, which implies that neither previous knowledge nor a simulation of the arm distributions is required. A numerical study proves that the proposed method outperforms other policies in the literature in five scenarios: a Bernoulli distribution with very low success probabilities, with success probabilities close to 0.5 and with success probabilities close to 0.5 and Gaussian rewards; and truncated in [0,10] Poisson and exponential distributions.
Internacional	Si
Nombre congreso	6th International Conference on Operations Research and Enterprise Systems
Tipo de participaci�n	960
Lugar del congreso	Oporto, Portugal
Revisores	Si
ISBN o ISSN	978-989-758-218-9
DOI
Fecha inicio congreso	23/02/2017
Fecha fin congreso	25/02/2017
Desde la p�gina	75
Hasta la p�gina	84
T�tulo de las actas	Proceedings of the 6th International Conference on Operations Research and Enterprise Systems

Esta actividad pertenece a memorias de investigaci�n

Participantes

Autor: Miguel Mart�n Blanco
Autor: Antonio Jimenez Martin UPM
Autor: Alfonso Mateos Caballero UPM

Grupos de investigaci�n, Departamentos, Centros e Institutos de I+D+i relacionados

Creador: Grupo de Investigaci�n: Grupo de an�lisis de decisiones y estad�stica
Departamento: Inteligencia Artificial