Review of the MuZero Algorithm with Implementation on Quoridor

Graduate Thesis uoadl:3393068 53 Read counter

Unit:
Department of Informatics and Telecommunications
Πληροφορική
Deposit date:
2024-03-27
Year:
2024
Author:
MYSTRIOTIS DIMITRIOS
Supervisors info:
Παναγιώτης Σταματόπουλος, Επίκουρος Καθηγητής, Τμήμα Πληροφορικής και Τηλεπικοινωνιών, Εθνικό και Καποδιστριακό Πανεπιστήμιο Αθηνών
Original Title:
Review of the MuZero Algorithm with Implementation on Quoridor
Languages:
English
Translated title:
Review of the MuZero Algorithm with Implementation on Quoridor
Summary:
This thesis discusses the development of the MuZero algorithm by DeepMind and its application in the game of Quoridor. The algorithm is a deep reinforcement learning algorithm that expands on previous algorithms to achieve exceptional performance in learning and planning. The key difference from its predecessors is the ability to operate in complex environments without any prior knowledge. All knowledge of game rules and dynamics is learned through interactions with the environment. The algorithm is trained through self-play, where it learns by playing games against itself, and uses the generated data to improve its performance. The thesis also discusses the environment of Quoridor, a competitive two-player strategy board game, and the application of the MuZero algorithm to it.
Main subject category:
Technology - Computer science
Keywords:
Machine learning, Reinforcement learning, deep learning, neural networks, Markov decision process, Monte Carlo tree search, deep reinforcement learning, board games
Index:
Yes
Number of index pages:
2
Contains images:
Yes
Number of references:
18
Number of pages:
54
DimitrisMystriotis_ptixiaki_v2.pdf (1 MB) Open in new window