Exploring Deep Reinforcement Learning with Multi Q-Learning - Intelligent Control and Automation

ICA > Vol.7 No.4, November 2016

Intelligent Control and Automation

Volume 7, Issue 4 (November 2016)

ISSN Print: 2153-0653 ISSN Online: 2153-0661

Google-based Impact Factor: 0.70 Citations

Exploring Deep Reinforcement Learning with Multi Q-Learning ()

HTML XML

Download as PDF (Size: 2849KB) PP. 129-144

DOI: 10.4236/ica.2016.74012 4,052 Downloads 9,540 Views Citations

Author(s)

Ethan Duryea, Michael Ganger, Wei Hu

Affiliation(s)

Department of Computer Science, Houghton College, Houghton, USA.

ABSTRACT

Q-learning is a popular temporal-difference reinforcement learning algorithm which often explicitly stores state values using lookup tables. This implementation has been proven to converge to the optimal solution, but it is often beneficial to use a function-approximation system, such as deep neural networks, to estimate state values. It has been previously observed that Q-learning can be unstable when using value function approximation or when operating in a stochastic environment. This instability can adversely affect the algorithm’s ability to maximize its returns. In this paper, we present a new algorithm called Multi Q-learning to attempt to overcome the instability seen in Q-learning. We test our algorithm on a 4 × 4 grid-world with different stochastic reward functions using various deep neural networks and convolutional networks. Our results show that in most cases, Multi Q-learning outperforms Q-learning, achieving average returns up to 2.5 times higher than Q-learning and having a standard deviation of state values as low as 0.58.

KEYWORDS

Reinforcement Learning, Deep Learning, Multi Q-Learning

Share and Cite:

Duryea, E. , Ganger, M. and Hu, W. (2016) Exploring Deep Reinforcement Learning with Multi Q-Learning. Intelligent Control and Automation, 7, 129-144. doi: 10.4236/ica.2016.74012.

Cited by

[1]	The research on intelligent cooperative combat of UAV cluster with multi-agent reinforcement learning
	Aerospace Systems, 2022

[2]	Autonomous and cooperative control of UAV cluster with multi-agent reinforcement learning
	The Aeronautical Journal, 2022

[3]	A Semi-Automatic Wheelchair with Navigation Based on Virtual-Real 2D Grid Maps and EEG Signals
	Applied Sciences, 2022

[4]	Comprehensive and Self‐Contained Introduction to Deep Reinforcement Learning
	… Techniques for Wireless …, 2022

[5]	Distribution Grid planning by self-designing with reinforcement learning
	2022

[6]	Reinforcement Learning with Deep Q-Networks
	2022

[7]	面向分布式电网的多区域协同控制方法研究.
	Electric Machines & …, 2021

[8]	Application of reinforcement learning algorithm in delivery order system under supply chain environment
	Mobile Information Systems, 2021

[9]	Computational Modeling of Multi-Agent, Continuous Decision Making in Competitive Contexts
	2021

[10]	Wind Farm Power Generation Control via Double-Network-Based Deep Reinforcement Learning
	2021

[11]	Several Reinforcement Learning Methods in Mean-Field Games with Binary Action Spaces
	2021

[12]	基于多智能体深度确定策略梯度算法的有功-无功协调调度模型
	2021

[13]	The wisdom of the crowd: reliable deep reinforcement learning through ensembles of Q-functions
	2021

[14]	A Q-values sharing framework for multi-agent reinforcement learning under budget constraint
	2021

[15]	Deep Reinforcement Learning for Energy-efficient Train Operation of Automatic Driving
	2020

[16]	IPro: An Approach for Intelligent SDN Monitoring
	2020

[17]	Accelerating Reinforcement Learning with Prioritized Experience Replay for Maze Game
	2020

[18]	Gradient boosting in crowd ensembles for Q-learning using weight sharing
	2020

[19]	Run-to-Run Control of Chemical Mechanical Polishing Process Based on Deep Reinforcement Learning
	2020

[20]	Deep Reinforcement Learning Based Optimal Schedule for a Battery Swapping Station Considering Uncertainties
	2020

[21]	Weighted Densely Connected Convolutional Networks for Reinforcement Learning
	2019

[22]	Quantum Multiple Q-Learning
	2019

[23]	Distributional Reinforcement Learning with Quantum Neural Networks
	2019

[24]	Reinforcement Learning with Deep Quantum Neural Networks
	2019

[25]	Morphing Control of a New Bionic Morphing UAV with Deep Reinforcement Learning
	2019

[26]	Approximate Policy-Based Accelerated Deep Reinforcement Learning
	2019

[27]	利用空间优化的增强学习 Sarsa 改进预取算法
	2019

[28]	Multi Pseudo Q-Learning-Based Deterministic Policy Gradient for Tracking Control of Autonomous Underwater Vehicles
	2018

[29]	Wisdom of the crowd: reliable deep reinforcement learning through ensembles of Q-functions, The
	2018

[30]	Empirical Analysis of Decision Making of an AI Agent on IBM's 5Q Quantum Computer
	2018

[31]	Architecture of Management Game for Reinforced Deep Learning
	Intelligent Systems and Applications, 2018

[32]	Bayesian Nonparametric Models Characterize Instantaneous Strategies in a Competitive Dynamic Game
	2018

[33]	基于加权密集连接卷积的深度强化学习
	2018

[34]	Aprendizado profundo: conceitos, técnicas e es-tudo de caso de análise de imagens com Java
	2017

[35]	深度强化学习在智能制造中的应用展望综述
	Computer Engineering …, 1972

Journals Menu

Follow SCIRP

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Journals Menu

Home

About SCIRP

Service

Policies