Deakin University
Browse
nahavandi-reinforcementlearning-2002.pdf (452.15 kB)

A reinforcement learning approach for robot control in an unknown environment

Download (452.15 kB)
conference contribution
posted on 2002-01-01, 00:00 authored by N F Xiao, Saeid Nahavandi
In this paper, a control approach based on reinforcement learning is present for a robot to complete a dynamic task in an unknown environment. First, a temporal difference-based reinforcement learning algorithm and its evaluation function are used to make the robot learn with its trials and errors as well as experiences. Second, the simulation are carried out to adjust the parameters of the learning algorithm and determine an optimal policy by using the models of a robot. Last, the effectiveness of the present approach is demonstrated by balancing an inverse pendulum in the unknown environment.

History

Title of proceedings

IEEE ICIT' 02 : 2002 IEEE International Conference on Industrial Technology : productivity reincarnation through robotics & automation : 11-14 December 2002, Shangri-La Hotel, Bangkok, Thailand

Event

IEEE International Conference on Industrial Technology (2002 : Bangkok, Thailand)

Pagination

1096 - 1099

Publisher

IEEE Xplore

Location

Bangkok, Thailand

Place of publication

Piscataway, N.J.

Start date

2002-12-11

End date

2002-12-14

ISBN-13

9780780376571

ISBN-10

0780376579

Language

eng

Notes

This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

Publication classification

E1 Full written paper - refereed

Copyright notice

2002, IEEE

Usage metrics

    Research Publications

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC