Potentials of Reinforcement Learning for Production

JournalIndustrie 4.0 Management
Issue Volume 37, 2021, Edition 2, Pages 25-29
Open Accesshttps://doi.org/10.30844/I40M_21-2_S25-29
Bibliography Share Cite Download

Abstract

Reinforcement learning (RL) can be more and more used for real-world decision problems in production. The article gives an introduction into the functionalities of RL as well as its preferred areas of application. It further describes project examples from everyday production. The presented knowledge of current research is intended to make this sub-area of artificial intelligence accessible to a broader audience and to increase the added value in production.

Bibliography

[1] D.Silver,J.Schrittwieser,K.Simonyan, I. Antonoglou, A. Huang, A. Guez, T. Hubert, L. Baker, M. Lai, A. Bolton, Y. Chen, T. Lillicrap, F. Hui, L. Sifre, G. van den Driessche, T. Graepel und D. Hassabis, „Mastering the game of Go without human knokwledge,“ Nature 550, Oktober 2017.
[2] A. Borghesi, A. Bartolini, M. Lombardi, M. Milano und L. Benini,„Anomaly Detection Using Autoencoders in High Performance Computing Systems,“ in The Thirty-First AAAI Conference on Innovative Applications of Artificial Intelligence, 2019.
[3] R.S.SuttonundA.G.Barto,Reinforcement learning: An introduction, MIT press, 2018.
[4] T. Haarnoja, A. Zhou, K. Hartikainen, G. Tucker, S. Ha, J. Tan, V. Kumar, H. Zhu, A. Gupta, P. Abbeel und S. Levine, „Soft Actor-Critic Algorithms and Applications,“ 2018.
[5] J.Schulman,F.Wolski,P.Dhariwal, A. Radford und O. Klimov, „Proximal Policy Optimization Algorithms,“ arXiv Preprint, 2017.
[6] X.Peng,W.Andrychowicz,W. Zaremba und P. Abbeel, „Sim-to-Real Transfer of Robotic Control with Dynamics Randomization,“ 2017.
[7] Y. Chebotar, A. Handa, V. Makoviychuk, M. Macklin, J. Issac, N. Ratliff und D. Fox, „Closing the Sim-To-Real Loop: Adapting Simulation, Randomization with Real World Experience,“ 2018.
[8] R. Dittmar und B.-M. Pfeiffer, Modellbasierte prädiktive Regelung: Eine Einführung für Ingenieure, Walter de Gruyter, 2009.
[9] B. Waschneck, Autonome Entscheidungsfindung in der Produktionssteuerung komplexer Werkstattfertigungen, Stuttgart: 2020.
[10]T. Altenmüller, T. Stüker, B. Waschneck, A. Kuhnle und G. Lanza, „Reinforcement learning for an intelligent and autonomous production control of complex job-shops under time constraints,“ Production Engineering 14, 2020.
[11] D. Silver und J. Veness, „Monte-Carlo Planning in Large POMDPs,“ (NIPS) Advances in Neural Information Processing Systems, 2010.
[12] M. El-Shamouty, K. Kleeberger, A. Lämmle und M. Huber, „Simulation-driven machine learning for robotics and automation,“ tm -Technisches Messen, pp. 673-684, August 2019.

Your downloads