Double BP Q-Learning Algorithm for Local Path Planning of Mobile Robot - Journal of Computer and Communications

JCC > Vol.9 No.6, June 2021

Journal of Computer and Communications

Volume 9, Issue 6 (June 2021)

ISSN Print: 2327-5219 ISSN Online: 2327-5227

Google-based Impact Factor: 1.12 Citations

Double BP Q-Learning Algorithm for Local Path Planning of Mobile Robot ()

HTML XML

Download as PDF (Size: 1123KB) PP. 138-157

DOI: 10.4236/jcc.2021.96008 298 Downloads 1,304 Views Citations

Author(s)

Guoming Liu, Caihong Li^*, Tengteng Gao, Yongdi Li, Xiaopei He

Affiliation(s)

School of Computer Science and Technology, Shandong University of Technology, Zibo, China.

ABSTRACT

Aiming at the dimension disaster problem, poor model generalization ability and deadlock problem in special obstacles environment caused by the increase of state information in the local path planning process of mobile robot, this paper proposed a Double BP Q-learning algorithm based on the fusion of Double Q-learning algorithm and BP neural network. In order to solve the dimensional disaster problem, two BP neural network fitting value functions with the same network structure were used to replace the two Q value tables in Double Q-Learning algorithm to solve the problem that the Q value table cannot store excessive state information. By adding the mechanism of priority experience replay and using the parameter transfer to initialize the model parameters in different environments, it could accelerate the convergence rate of the algorithm, improve the learning efficiency and the generalization ability of the model. By designing specific action selection strategy in special environment, the deadlock state could be avoided and the mobile robot could reach the target point. Finally, the designed Double BP Q-learning algorithm was simulated and verified, and the probability of mobile robot reaching the target point in the parameter update process was compared with the Double Q-learning algorithm under the same condition of the planned path length. The results showed that the model trained by the improved Double BP Q-learning algorithm had a higher success rate in finding the optimal or sub-optimal path in the dense discrete environment, besides, it had stronger model generalization ability, fewer redundant sections, and could reach the target point without entering the deadlock zone in the special obstacles environment.

KEYWORDS

Mobile Robot, Local Path Planning, Double BP Q-Learning, BP Neural Network, Transfer Learning

Share and Cite:

Liu, G. , Li, C. , Gao, T. , Li, Y. and He, X. (2021) Double BP Q-Learning Algorithm for Local Path Planning of Mobile Robot. Journal of Computer and Communications, 9, 138-157. doi: 10.4236/jcc.2021.96008.

Cited by

[1]	Path Planning of Industrial Wheeled Robots Based on Wireless Communication and Machine Learning Algorithms
	Mobile Information Systems, 2022

[2]	Proximal policy optimization based dynamic path planning algorithm for mobile robots
	Electronics Letters, 2022

[3]	Reinforcement Learning for Optimal Energy Control in Data Center Operations
	2022

Journals Menu

Follow SCIRP

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Journals Menu

Home

About SCIRP

Service

Policies