Balanced Q-learning: Combining the influence of optimistic and pessimistic targets
Version 2 2024-06-03, 01:04Version 2 2024-06-03, 01:04
Version 1 2023-10-20, 03:32Version 1 2023-10-20, 03:32
journal contribution
posted on 2024-06-03, 01:04 authored by Thommen Karimpanal GeorgeThommen Karimpanal George, Hung LeHung Le, M Abdolshah, Santu RanaSantu Rana, Sunil GuptaSunil Gupta, Truyen TranTruyen Tran, Svetha VenkateshSvetha VenkateshBalanced Q-learning: Combining the influence of optimistic and pessimistic targets
History
Journal
Artificial IntelligenceVolume
325Article number
104021Pagination
104021-104021Location
Amsterdam, The NetherlandsISSN
0004-3702Language
enPublication classification
C1 Refereed article in a scholarly journalPublisher
Elsevier BVPublication URL
Usage metrics
Categories
Keywords
Licence
Exports
RefWorksRefWorks
BibTeXBibTeX
Ref. managerRef. manager
EndnoteEndnote
DataCiteDataCite
NLMNLM
DCDC