The application of a reinforcement learning agent to a multi-product manufacturing facility
Creighton, Douglas and Nahavandi, Saeid 2002, The application of a reinforcement learning agent to a multi-product manufacturing facility, in IEEE ICIT' 02 : 2002 IEEE International Conference on Industrial Technology : productivity reincarnation through robotics & automation : 11-14 December 2002, Shangri-La Hotel, Bangkok, Thailand, IEEE Xplore, Piscataway, N.J., pp. 1229-1234.
Attached Files
(Some files may be inaccessible until you login with your Deakin Research Online credentials)
An intelligent agent-based scheduling system, consisting of a reinforcement learning agent and a simulation model has been developed and tested on a classic scheduling problem. The production facility studied is a multiproduct serial line subject to stochastic failure. The agent goal is to minimise total production costs, through selection of job sequence and batch size. To explore state space the agent used reinforcement learning. By applying an independent inventory control policy for each product, the agent successfully identified optimal operating policies for a real production facility.
Notes
This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.
Unless expressly stated otherwise, the copyright for items in Deakin Research Online is owned by the author, with all rights reserved.
Every reasonable effort has been made to ensure that permission has been obtained for items included in DRO.
If you believe that your rights have been infringed by this repository, please contact drosupport@deakin.edu.au.