Observatorio de I+D+i UPM

| Otras actividades
HOME

Proyectos Internacionales Art�culos Patentes UPM Software UPM Empresas UPM Otras actividades Memorias de investigaci�n

Memorias de investigación

Art�culos en revistas:

A Deep Reinforcement Learning Strategy for UAV Autonomous Landing on a Moving Platform

A�o:2019

�reas de investigaci�n

Inteligencia artificial (redes neuronales, l�gica borrosa, sistemas expertos, etc),
Robots a�reos,
Visi�n por computador

Datos

Descripci�n
The use of multi-rotor UAVs in industrial and civil applications has been extensively encouraged by the rapid innovation in all the technologies involved. In particular, deep learning techniques for motion control have recently taken a major qualitative step, since the successful application of Deep Q-Learning to the continuous action domain in Atari-like games. Based on these ideas, Deep Deterministic Policy Gradients (DDPG) algorithm was able to provide outstanding results with continuous state and action domains, which are a requirement in most of the robotics-related tasks. In this context, the research community is lacking the integration of realistic simulation systems with the reinforcement learning paradigm, enabling the application of deep reinforcement learning algorithms to the robotics field. In this paper, a versatile Gazebo-based reinforcement learning framework has been designed and validated with a continuous UAV landing task. The UAV landing maneuver on a moving platform has been solved by means of the novel DDPG algorithm, which has been integrated in our reinforcement learning framework. Several experiments have been performed in a wide variety of conditions for both simulated and real flights, demonstrating the generality of the approach. As an indirect result, a powerful work flow for robotics has been validated, where robots can learn in simulation and perform properly in real operation environments. To the best of the authors knowledge, this is the first work that addresses the continuous UAV landing maneuver on a moving platform by means of a state-of-the-art deep reinforcement learning algorithm, trained in simulation and tested in real flights.
Internacional	Si
JCR del ISI	Si
T�tulo de la revista	Journal of Intelligent & Robotic Systems
ISSN	0921-0296
Factor de impacto JCR	2,02
Informaci�n de impacto	Datos JCR del a�o 2018
Volumen	93
DOI	10.1007/s10846-018-0891-8
N�mero de revista	1-2
Desde la p�gina	351
Hasta la p�gina	366
Mes	SIN MES
Ranking

Ver publicaci�n en Archivo digital upm

Esta actividad pertenece a memorias de investigaci�n

Participantes

Autor: Alejandro Rodriguez Ramos UPM
Autor: Carlos Sampedro Perez UPM
Autor: Hriday Bavle Milind UPM
Autor: Paloma de la Puente Yusty UPM
Autor: Pascual Campoy Cervera UPM

Grupos de investigaci�n, Departamentos, Centros e Institutos de I+D+i relacionados

Creador: Grupo de Investigaci�n: Control Inteligente
Centro o Instituto I+D+i: Centro de Autom�tica y Rob�tica (CAR). Centro Mixto UPM-CSIC
Departamento: Autom�tica, Ingenier�a El�ctrica y Electr�nica e Inform�tica Industrial