Vlad Ilie

IT University of Copenhagen, Computer Science and Software Engineering, Alumnus

Followers

221

Following

Co-authors

Public Views

Hard-working, ambitious and enthusiastic student, capable of quick learning. Having solid time management, strong organizational and multitasking skills, I am dedicated to completing tasks on time.At the moment I am working on my MSc Thesis: implementing a Deep Reinforcement Learning agent in StarCraft II

less

Amr Kayid

The German University in Cairo

Justina Marcinskaite

Toke Faurby

Matt ffrench-Constant

University of Cambridge

Thanh Thi Nguyen

Monash University

Adil Khan

University Of Peshawar

Computer Science & Information Technology (CS & IT) Computer Science Conference Proceedings (CSCP)

tanoy dewanjee

IJERT Journal

Alejandro Declerk

Interests

Uploads

Papers by Vlad Ilie

On Reinforcement Learning and Rarity of Events algorithms in the Starcraft II Learning Environment

This MSc Thesis presents how a model-free deep reinforcement learning (DRL) agent can be trained ... more This MSc Thesis presents how a model-free deep reinforcement learning (DRL) agent can be trained successfully in environments with sparse rewards from the rarity and intensity of perceived events. As video games are excellent at providing complex yet transparent testing environments, the project tests the efficiency of the Rarity of Events (RoE) method in Starcraft II (SC2). Due to the high complexity of the action and observation spaces combined with the synchronous and fast-paced nature of the game-play, where rewards are sparse and delayed; traditional machine-learning (ML) algorithms under-perform, are not consistent and are prone to sub-optimal convergence of the artificial neural network (ANN) model.

To overcome these challenges, the project uses the Reaver framework to set up an initial baseline Synchronous Advantage Actor-Critic (A2C) agent, which is then modified to use the RoE method. The project uses the game's non-spatial features (NSF), including the game score, to define an intrinsic reward function associated with the start, progress and completion of actions. Having the algorithm score its own ability to play the game based on environmental changes allows the agent to perform an automated curriculum learning process, where it attempts gradually more complex tasks.

The RoE method has already been tested and proven successful for the VizDoom environment, this project extends its functionality to SC2 and implements three variations of it: Binary RoE, Quantitative RoE and Greedy RoE. Combinations of the three options can be used on the same agent and can be tailored for specific purposes, so the agent is guided towards desired behaviours.

Vlad Ilie

Uploads

Papers by Vlad Ilie

Log In