Softmax exploration strategies for multiobjective reinforcement learning

Vamplew, P; Dazeley, Richard; Foale, C

Softmax exploration strategies for multiobjective reinforcement learning

journal contribution

posted on 2017-11-08, 00:00 authored by P Vamplew, Richard DazeleyRichard Dazeley, C Foale

Despite growing interest over recent years in applying reinforcement learning to multiobjective problems, there has been little research into the applicability and effectiveness of exploration strategies within the multiobjective context. This work considers several widely-used approaches to exploration from the single-objective reinforcement learning literature, and examines their incorporation into multiobjective Q-learning. In particular this paper proposes two novel approaches which extend the softmax operator to work with vector-valued rewards. The performance of these exploration strategies is evaluated across a set of benchmark environments. Issues arising from the multiobjective formulation of these benchmarks which impact on the performance of the exploration strategies are identified. It is shown that of the techniques considered, the combination of the novel softmax–epsilon exploration with optimistic initialisation provides the most effective trade-off between exploration and exploitation.

History

Journal

Neurocomputing

Volume

263

Pagination

74-86

Location

Amsterdam, The Netherlands

Publisher DOI

https://doi.org/10.1016/j.neucom.2016.09.141

ISSN

0925-2312

eISSN

1872-8286

Language

eng

Publication classification

C Journal article, C1.1 Refereed article in a scholarly journal

Copyright notice

2017, Elsevier B.V.

Publisher

Elsevier

Usage metrics

Keywords

multiobjective reinforcement learning exploration ϵ-greedy exploration optimistic initialisation softmax School of Information Technology 4602 Artificial intelligence 4611 Machine learning

Softmax exploration strategies for multiobjective reinforcement learning

History

Journal

Volume

Pagination

Location

Publisher DOI

ISSN

eISSN

Language

Publication classification

Copyright notice

Publisher

Usage metrics

Categories

Keywords

Licence

Exports