Deakin University
Browse

File(s) under permanent embargo

A human mixed strategy approach to deep reinforcement learning

Version 2 2024-06-06, 07:49
Version 1 2019-05-22, 09:49
conference contribution
posted on 2024-06-06, 07:49 authored by ND Nguyen, S Nahavandi, T Nguyen
In 2015, Google's Deepmind announced an advancement in creating an autonomous agent based on deep reinforcement learning (DRL) that could beat a professional player in a series of 49 Atari games. However, the current manifestation of DRL is still immature, and has significant drawbacks. One of DRL's imperfections is its lack of 'exploration' during the training process, especially when working with high-dimensional problems. In this paper, we propose a mixed strategy approach that mimics behaviors of human when interacting with environment, and create a 'thinking' agent that allows for more efficient exploration in the DRL training process. The simulation results based on the Breakout game show that our scheme achieves a higher probability of obtaining a maximum score than does the baseline DRL algorithm, i.e., the asynchronous advantage actor-critic method. The proposed scheme therefore can be applied effectively to solving a complicated task in a real-world application.

History

Pagination

4023-4028

Location

Miyazaki, Japan

Start date

2018-10-07

End date

2018-10-10

ISBN-13

9781538666500

Language

eng

Publication classification

E1 Full written paper - refereed

Copyright notice

2018, IEEE

Editor/Contributor(s)

[Unknown]

Title of proceedings

SMC 2018 : The making of a human-centered cyber world : Proceedings of the 2018 IEEE International Conference on Systems, Man, and Cybernetics

Event

IEEE Systems, Man, and Cybernetics Society. Conference (2018 : Miyazaki, Japan)

Publisher

Institute of Electrical and Electronics Engineers

Place of publication

Piscataway, N.J.

Series

IEEE Systems, Man, and Cybernetics Society Conference

Usage metrics

    Research Publications

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC