Observatorio de I+D+i UPM

| Otras actividades
HOME

Proyectos Internacionales Art�culos Patentes UPM Software UPM Empresas UPM Otras actividades Memorias de investigaci�n

Memorias de investigación

Ponencias en congresos:

A Unified Framework for Linear Function Approximation of Value Functions in Stochastic Control

A�o:2013

�reas de investigaci�n

Tecnolog�a electr�nica y de las comunicaciones

Datos

Descripci�n
This paper contributes with a unified formulation that merges previ- ous analysis on the prediction of the performance ( value function ) of certain sequence of actions ( policy ) when an agent operates a Markov decision process with large state-space. When the states are represented by features and the value function is linearly approxi- mated, our analysis reveals a new relationship between two common cost functions used to obtain the optimal approximation. In addition, this analysis allows us to propose an efficient adaptive algorithm that provides an unbiased linear estimate. The performance of the pro- posed algorithm is illustrated by simulation, showing competitive results when compared with the state-of-the-art solutions
Internacional	Si
Nombre congreso	EUSIPCO, Signal Processing Conference
Tipo de participaci�n	960
Lugar del congreso	Morocco
Revisores	Si
ISBN o ISSN	2219-5491
DOI
Fecha inicio congreso	09/09/2013
Fecha fin congreso	13/09/2013
Desde la p�gina	1
Hasta la p�gina	5
T�tulo de las actas	Proceedings of EUSIPCO

Ver publicaci�n en Archivo digital upm

Esta actividad pertenece a memorias de investigaci�n

Participantes

Autor: Santiago Zazo Bello UPM

Grupos de investigaci�n, Departamentos, Centros e Institutos de I+D+i relacionados

Creador: Grupo de Investigaci�n: Grupo de Aplicaciones del Procesado de Se�al (GAPS)
Departamento: Se�ales, Sistemas y Radiocomunicaciones