Toward Obstacle Avoidance for Mobile Robots Using Deep Reinforcement Learning Algorithm
2021
The state-of-the-art deep reinforcement learning algorithm, i.e., the deep deterministic policy gradient (DDPG), has achieved good performance in continuous control problems for the robotics. However, the conventional experience replay mechanism of the DDPG algorithm stores the experience explored by the mobile robot in the bufer pool, and trains the neural network through random sampling, without considering whether the transition is valuable, which can probably influence the network performance. To overcome the limitation, the DDPG framework with separating experience is developed for mobile robot collision-free navigation in this study, to replay the transitions of valuable and the failed experience discretely. Additionally, environment state vector is designed including mobile robot and obstacles, the reward function and action space are also designed. The simulation results show that the proposed model can possess the collision-free navigation capacity to deal with multiple obstacles.
Keywords:
- Correction
- Source
- Cite
- Save
- Machine Reading By IdeaReader
16
References
0
Citations
NaN
KQI