File(s) under permanent embargo
An empirical study of reward structures for actor-critic reinforcement learning in air combat manoeuvring simulation
conference contribution
posted on 2019-01-01, 00:00 authored by B Kurniawan, P Vamplew, M Papasimeon, Richard DazeleyRichard Dazeley, C FoaleReinforcement learning techniques for solving complex problems are resource-intensive and take a long time to converge, prompting a need for methods that encourage faster learning. In this paper we show our successful application of actor-critic reinforcement learning to the air combat simulation domain and how reward structures affect the learning speed to find effective air combat tactics.