Intelligent Control and Automation

Volume 7, Issue 4 (November 2016)

ISSN Print: 2153-0653   ISSN Online: 2153-0661

Google-based Impact Factor: 0.70  Citations  

Exploring Deep Reinforcement Learning with Multi Q-Learning

HTML  XML Download Download as PDF (Size: 2849KB)  PP. 129-144  
DOI: 10.4236/ica.2016.74012    4,052 Downloads   9,540 Views  Citations

ABSTRACT

Q-learning is a popular temporal-difference reinforcement learning algorithm which often explicitly stores state values using lookup tables. This implementation has been proven to converge to the optimal solution, but it is often beneficial to use a function-approximation system, such as deep neural networks, to estimate state values. It has been previously observed that Q-learning can be unstable when using value function approximation or when operating in a stochastic environment. This instability can adversely affect the algorithm’s ability to maximize its returns. In this paper, we present a new algorithm called Multi Q-learning to attempt to overcome the instability seen in Q-learning. We test our algorithm on a 4 × 4 grid-world with different stochastic reward functions using various deep neural networks and convolutional networks. Our results show that in most cases, Multi Q-learning outperforms Q-learning, achieving average returns up to 2.5 times higher than Q-learning and having a standard deviation of state values as low as 0.58.

Share and Cite:

Duryea, E. , Ganger, M. and Hu, W. (2016) Exploring Deep Reinforcement Learning with Multi Q-Learning. Intelligent Control and Automation, 7, 129-144. doi: 10.4236/ica.2016.74012.

Cited by

[1] The research on intelligent cooperative combat of UAV cluster with multi-agent reinforcement learning
Aerospace Systems, 2022
[2] Autonomous and cooperative control of UAV cluster with multi-agent reinforcement learning
The Aeronautical Journal, 2022
[3] A Semi-Automatic Wheelchair with Navigation Based on Virtual-Real 2D Grid Maps and EEG Signals
Applied Sciences, 2022
[4] Comprehensive and Self‐Contained Introduction to Deep Reinforcement Learning
… Techniques for Wireless …, 2022
[5] Distribution Grid planning by self-designing with reinforcement learning
2022
[6] Reinforcement Learning with Deep Q-Networks
2022
[7] 面向分布式电网的多区域协同控制方法研究.
Electric Machines & …, 2021
[8] Application of reinforcement learning algorithm in delivery order system under supply chain environment
Mobile Information Systems, 2021
[9] Computational Modeling of Multi-Agent, Continuous Decision Making in Competitive Contexts
2021
[10] Wind Farm Power Generation Control via Double-Network-Based Deep Reinforcement Learning
2021
[11] Several Reinforcement Learning Methods in Mean-Field Games with Binary Action Spaces
2021
[12] 基于多智能体深度确定策略梯度算法的有功-无功协调调度模型
2021
[13] The wisdom of the crowd: reliable deep reinforcement learning through ensembles of Q-functions
2021
[14] A Q-values sharing framework for multi-agent reinforcement learning under budget constraint
2021
[15] Deep Reinforcement Learning for Energy-efficient Train Operation of Automatic Driving
2020
[16] IPro: An Approach for Intelligent SDN Monitoring
2020
[17] Accelerating Reinforcement Learning with Prioritized Experience Replay for Maze Game
2020
[18] Gradient boosting in crowd ensembles for Q-learning using weight sharing
2020
[19] Run-to-Run Control of Chemical Mechanical Polishing Process Based on Deep Reinforcement Learning
2020
[20] Deep Reinforcement Learning Based Optimal Schedule for a Battery Swapping Station Considering Uncertainties
2020
[21] Weighted Densely Connected Convolutional Networks for Reinforcement Learning
2019
[22] Quantum Multiple Q-Learning
2019
[23] Distributional Reinforcement Learning with Quantum Neural Networks
2019
[24] Reinforcement Learning with Deep Quantum Neural Networks
2019
[25] Morphing Control of a New Bionic Morphing UAV with Deep Reinforcement Learning
2019
[26] Approximate Policy-Based Accelerated Deep Reinforcement Learning
2019
[27] 利用空间优化的增强学习 Sarsa 改进预取算法
2019
[28] Multi Pseudo Q-Learning-Based Deterministic Policy Gradient for Tracking Control of Autonomous Underwater Vehicles
2018
[29] Wisdom of the crowd: reliable deep reinforcement learning through ensembles of Q-functions, The
2018
[30] Empirical Analysis of Decision Making of an AI Agent on IBM's 5Q Quantum Computer
2018
[31] Architecture of Management Game for Reinforced Deep Learning
Intelligent Systems and Applications, 2018
[32] Bayesian Nonparametric Models Characterize Instantaneous Strategies in a Competitive Dynamic Game
2018
[33] 基于加权密集连接卷积的深度强化学习
2018
[34] Aprendizado profundo: conceitos, técnicas e es-tudo de caso de análise de imagens com Java
2017
[35] 深度强化学习在智能制造中的应用展望综述
Computer Engineering …, 1972

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.