Reinforcement learning control of robot manipulator
Carregando...
Data
2021
Data de publicação:
Autores IPEN
Orientador
Título da Revista
ISSN da Revista
Título do Volume
É parte de
É parte de
É parte de
Revista Brasileira de Computação Aplicada
Resumo
Since the establishment of robotics in industrial applications, industrial robot programming involves the repetitive and
time-consuming process of manually specifying a fixed trajectory, resulting in machine idle time in production and the
necessity of completely reprogramming the robot for different tasks. The increasing number of robotics applications
in unstructured environments requires not only intelligent but also reactive controllers due to the unpredictability
of the environment and safety measures, respectively. This paper presents a comparative analysis of two classes of
Reinforcement Learning algorithms, value iteration (Q-Learning/DQN) and policy iteration (REINFORCE), applied to
the discretized task of positioning a robotic manipulator in an obstacle-filled simulated environment, with no previous
knowledge of the obstacles’ positions or of the robot arm dynamics. The agent’s performance and algorithm convergence
are analyzed under different reward functions and on four increasingly complex test projects: 1-Degree of Freedom
(DOF) robot, 2-DOF robot, Kuka KR16 Industrial robot, Kuka KR16 Industrial robot with random setpoint/obstacle
placement. The DQN algorithm presented significantly better performance and reduced training time across all test
projects, and the third reward function generated better agents for both algorithms.
Como referenciar
COTRIM, LUCAS P.; JOSE, MARCOS M.; CABRAL, EDUARDO L.L. Reinforcement learning control of robot manipulator. Revista Brasileira de Computação Aplicada, v. 13, n. 3, p. 42-53, 2021. DOI: 10.5335/rbca.v13i3.12091. Disponível em: http://repositorio.ipen.br/handle/123456789/32793. Acesso em: 27 Apr 2024.
Esta referência é gerada automaticamente de acordo com as normas do estilo IPEN/SP (ABNT NBR 6023) e recomenda-se uma verificação final e ajustes caso necessário.