研究生: 江柏承
Bo-Cheng Jiang
論文名稱: 應用深度強化學習控制於具有串聯式彈性致動器的機械手臂研究
Study on Applications of Deep Reinforcement Learning Control to a Robot Manipulator with Series Elastic Actuator
指導教授: 郭永麟
Yong-Lin Kuo
口試委員: 郭永麟
Yong-Lin Kuo
Ming-Jong Tsai
Cheng-Hsiung Yang
Tsung-Liang Wu
學位類別: 碩士
系所名稱: 工程學院 - 自動化及控制研究所
Graduate Institute of Automation and Control
論文出版年: 2021
畢業學年度: 109
語文別: 中文
論文頁數: 193
中文關鍵詞: 深度強化學習位置控制扭力控制串聯式彈性致動器干擾觀測器
外文關鍵詞: Deep reinforcement learning, Position control, Torque control, Series elastic actuator, Disturbance observer
相關次數: 點閱:687下載:3
Due to the rapid development of industrial demand and consumer market changes in recent years, the importance of high-precision and fast industrial robotic arms in automation have increased. However, gradually diversified market models are challenging the control methods that rely on traditional control, manual operation, and use experience for complex and difficult-to-control robotic arms.
Therefore, this thesis applies deep reinforcement learning to control a high-dimensional nonlinear manipulator with the series elastic actuators. The purpose is to effectively improve the accuracy and robustness of position control and torque control and to reduce the difficulty of the control system. Users no longer need to use traditional time-consuming adjustment methods that rely on user experiences.
This thesis uses the deep deterministic policy gradient and the twin delayed deep deterministic policy gradient as the controller algorithms, and their control performances are compared with those by the PID control. In addition, the disturbance observer is added to reduce the impact of interferences. Two types of hardware devices are used for the series elastic actuators. One is to connect a torsion spring and a brake in series with a motor. The other is a structure where a motor is connected with a torsion spring in series with each axis joint of a three-axis robotic arm. Both hardware devices perform position control and torque control individually. The software Matlab is used to deep reinforcement learning controllers training and to perform PID control simulation. In the experimental parts, the trained agent is converted into code database and then is input to the software LabVIEW to construct the controllers, which send signals to the DC motors and receive feedback signals from encoders, so as to achieve deep reinforcement learning control. Furthermore, the experiments of PID control are performed based on the simulation parameters, and the follow-up discussions are conducted.
Through the discussion of experimental results, the difference in the applicability of deep reinforcement learning for position control and torque control can be observed. For the high-dimensional nonlinear manipulator with a series elastic actuator, its control effects are greatly affected by the design of the reward functions. In addition, the trends of the experiments are consistent with the simulations, which verifies the feasibilities of applying deep reinforcement learning to the control design of the series elastic actuator system. Finally, future researches are presented.

