Self-Play and Using an Expert to Learn to Play Backgammon with Temporal Difference Learning - Journal of Intelligent Learning Systems and Applications

JILSA > Vol.2 No.2, May 2010

Self-Play and Using an Expert to Learn to Play Backgammon with Temporal Difference Learning ()

HTML

Download as PDF (Size: 883KB) PP. 57-68

DOI: 10.4236/jilsa.2010.22009 7,480 Downloads 13,223 Views Citations

Author(s)

Marco A. Wiering

Affiliation(s)

ABSTRACT

A promising approach to learn to play board games is to use reinforcement learning algorithms that can learn a game position evaluation function. In this paper we examine and compare three different methods for generating training games: 1) Learning by self-play, 2) Learning by playing against an expert program, and 3) Learning from viewing ex-perts play against each other. Although the third possibility generates high-quality games from the start compared to initial random games generated by self-play, the drawback is that the learning program is never allowed to test moves which it prefers. Since our expert program uses a similar evaluation function as the learning program, we also examine whether it is helpful to learn directly from the board evaluations given by the expert. We compared these methods using temporal difference methods with neural networks to learn the game of backgammon.

KEYWORDS

Board Games, Reinforcement Learning, TD(λ), Self-Play, Learning From Demonstration

Share and Cite:

M. Wiering, "Self-Play and Using an Expert to Learn to Play Backgammon with Temporal Difference Learning," Journal of Intelligent Learning Systems and Applications, Vol. 2 No. 2, 2010, pp. 57-68. doi: 10.4236/jilsa.2010.22009.

Journals Menu

Follow SCIRP

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Journals Menu

Home

About SCIRP

Service

Policies