Dynamic Job Scheduling in Manufacturing Systems using Deep Q-Learning

Hind  Khalid

doi:10.22153/kej.2026.10.001

المؤلفون

Hind Khalid جامعة النهرين https://orcid.org/0000-0002-8318-097X

DOI:

https://doi.org/10.22153/kej.2026.10.001

الكلمات المفتاحية:

Dynamic job-shop scheduling; Deep Q-Learning; Reinforcement learning; Machine utilisation; Makespan optimisation

الملخص

In this connection, this paper proposes a Deep Q-Network (DQN) approach to address the dynamic nature of the job-shop scheduling problem. Dynamic scheduling requires an efficient and reliable algorithm for handling disruptions such as machine breakdowns and job priority changes. When comparing DQN with traditional approaches such as genetic algorithm (GA) and PSO, the latter algorithms cannot cope with the problems. On the contrary, DQN gains knowledge from its experiences within the factory and develops a strategy for solving the scheduling problem through minimizing makespan and maximizing machine utilization. Moreover, using experience replay (ER) and target networks enables DQN to maintain stability and develop an optimal schedule. Empirically, it was found that DQN reduces makespan by 24.8% with machine utilization being 92%. From the results obtained, it can be noted that the learning parameters play a great role in determining the performance of the model. Thus, this study proves that DQN is an effective approach for addressing the issue under discussion and could also be used for developing other approaches such as multi-agent and double DQN.

التنزيلات

تنزيل البيانات ليس متاحًا بعد.

المراجع

[1] M. Sanchez, E. Exposito, and J. Aguilar, “Autonomic computing in manufacturing process coordination in industry 4.0 context,” J Ind Inf Integr, vol. 19, p. 100159, 2020, doi: https://doi.org/10.1016/j.jii.2020.100159 .

[2] V. V. Popov, E. V. Kudryavtseva, N. K. Katiyar, A. Shishkin, S. I. Stepanov, and S. Goel, “Industry 4.0 and Digitalisation in Healthcare,” Materials, vol. 15, no. 6, p. 2140, 2022, doi: https://doi.org/10.3390/ma15062140.

[3] B. Zhou, J. Bao, J. Li, Y. Lu, T. Liu, and Q. Zhang, “A novel knowledge graph-based optimization approach for resource allocation in discrete manufacturing workshops,” Robot Comput Integr Manuf, vol. 71, no. 3, p. 102160, 2021, doi: https://doi.org/10.1016/j.rcim.2021.102160.

[4] C. Liu, P. Zheng, and X. Xu, “Digitalisation and servitisation of machine tools in the era of Industry 4.0: a review,” Int J Prod Res, vol. 61, no. 12, pp. 4069–4101, 2023, doi: https://doi.org/10.1080/00207543.2021.1969462.

[5] D. Kiel, J. M. Müller, C. Arnold, and K. I. Voigt, “Sustainable industrial value creation: Benefits and challenges of industry 4.0,” in Digital Disruptive Innovation, World Scientific: Singapore, 2021, pp. 231–270. doi: https://doi.org/10.1142/9781786347602_0009 .

[6] T. Suganuma, T. Oide, S. Kitagami, K. Sugawara, and N. Shiratori, “Multiagent-Based Flexible Edge Computing Architecture for IoT,” IEEE Netw, vol. 32, no. 1, pp. 16–23, 2018, doi: https://doi.org/10.1109/MNET.2018.1700201 .

[7] B. Bentalha, “The evolution of sustainability in supply chain management: A literature review,” Global Challenges for the Environment and Climate Change, vol. 162, pp. 332–356, 2024, doi: https://doi.org/10.4018/979-8-3693-2845-3.ch017.

[8] K. Li, T. Zhou, B. hai Liu, and H. Li, “A multi-agent system for sharing distributed manufacturing resources,” Expert Syst Appl, vol. 99, pp. 32–43, 2018, doi: https://doi.org/10.1016/j.eswa.2018.01.027.

[9] L. Cai, W. Li, Y. Luo, and L. He, “Real-time scheduling simulation optimisation of job shop in a production-logistics collaborative environment,” Int J Prod Res, vol. 61, no. 5, pp. 1373–1393, 2023, doi: https://doi.org/10.1080/00207543.2021.2023777 .

[10] P. Valckenaers, “Perspective on holonic manufacturing systems: PROSA becomes ARTI,” Comput Ind, vol. 120, p. 103226, Sep. 2020, doi: https://doi.org/10.1016/j.compind.2020.103226.

[11] Y. Du, J. Q. Li, X. L. Chen, P. Y. Duan, and Q. K. Pan, “Knowledge-Based Reinforcement Learning and Estimation of Distribution Algorithm for Flexible Job Shop Scheduling Problem,” IEEE Trans Emerg Top Comput Intell, vol. 7, no. 4, pp. 1036–1050, 2023, doi: https://doi.org/10.1109/TETCI.2022.3145706.

[12] J. Wang, Y. Zhang, Y. Liu, and N. Wu, “Multiagent and bargaining-game-based real-time scheduling for internet of things-enabled flexible job shop,” IEEE Internet Things J, vol. 6, no. 2, pp. 2518–2531, 2019, doi: https://doi.org/10.1109/JIOT.2018.2871346.

[13] M. K. Rafsanjani and M. Riyahi, “A new hybrid genetic algorithm for job shop scheduling problem,” International Journal of Advanced Intelligence Paradigms, vol. 16, no. 2, pp. 157–171, 2020, doi: https://doi.org/10.1504/IJAIP.2020.107012.

[14] D. Y. Sha and H. H. Lin, “A multi-objective PSO for job-shop scheduling problems,” 2009 International Conference on Computers and Industrial Engineering, CIE 2009, vol. 37, no. 2, pp. 489–494, 2009, doi: https://doi.org/10.1109/iccie.2009.5223966.

[15] L. P. Kaelbling, M. L. Littman, and A. W. Moore, “Reinforcement learning: A survey,” Journal of Artificial Intelligence Research, vol. 4, no. 1, pp. 237–285, 1996. Onlie: https://dl.acm.org/doi/10.5555/1622737.1622748

[16] M. Zhang, Y. Lu, Y. Hu, N. Amaitik, and Y. Xu, “Dynamic Scheduling Method for Job-Shop Manufacturing Systems by Deep Reinforcement Learning with Proximal Policy Optimization,” Sustainability (Switzerland), vol. 14, no. 9, p. 5177, 2022, doi: https://doi.org/10.3390/su14095177 .

[17] Z. Wang, T. Schaul, M. Hessel, H. Van Hasselt, M. Lanctot, and N. De Frcitas, “Dueling Network Architectures for Deep Reinforcement Learning,” in 33rd International Conference on Machine Learning, ICML 2016, 2016, pp. 2939–2947. Online: https://dl.acm.org/doi/10.5555/3045390.3045601

[18] J. Schulman, F. Wolski, P. Dhariwal, A. Radford, and O. Klimov, “Proximal Policy Optimization Algorithms,” 2017. arXiv preprint, vol. arXiv:1707.06347, doi: https://doi.org/10.48550/arXiv.1707.06347

[19] Y. Zhao, Y. Wang, Y. Tan, J. Zhang, and H. Yu, “Dynamic Jobshop Scheduling Algorithm Based on Deep Q Network,” IEEE Access, vol. 9, pp. 122995–123011, 2021, doi: https://doi.org/10.1109/ACCESS.2021.3110242.

[20] S. Luo, L. Zhang, and Y. Fan, “Real-Time Scheduling for Dynamic Partial-No-Wait Multiobjective Flexible Job Shop by Deep Reinforcement Learning,” IEEE Transactions on Automation Science and Engineering, vol. 19, no. 4, pp. 3020–3038, 2022, doi: https://doi.org/10.1109/TASE.2021.3104716.

[21] Y. Zhang, H. Zhu, and D. Tang, “An improved hybrid particle swarm optimization for multi-objective flexible job-shop scheduling problem,” Kybernetes, vol. 49, no. 12, pp. 2873–2892, 2020, doi: https://doi.org/10.1108/K-06-2019-0430.

[22] Y. Zhao and H. Zhang, “Application of machine learning and rule scheduling in a job-shop production control system,” International Journal of Simulation Modelling, vol. 20, no. 2, pp. 410–421, 2021, doi: https://doi.org/10.2507/IJSIMM20-2-COhttps://doi.org/10.

[23] R. Buddala and S. S. Mahapatra, “Two-stage teaching-learning-based optimization method for flexible job-shop scheduling under machine breakdown,” International Journal of Advanced Manufacturing Technology, vol. 100, no. 5–8, pp. 1419–1432, 2019, doi: https://doi.org/10.1007/s00170-018-2805-0.

[24] H. Wang, B. R. Sarker, J. Li, and J. Li, “Adaptive scheduling for assembly job shop with uncertain assembly times based on dual Q-learning,” Int J Prod Res, vol. 59, no. 19, pp. 5867–5883, 2021, doi: https://doi.org/10.1080/00207543.2020.1794075 .

[25] Y. Li, W. Gu, M. Yuan, and Y. Tang, “Real-time data-driven dynamic scheduling for flexible job shop with insufficient transportation resources using hybrid deep Q network,” Robot Comput Integr Manuf, vol. 74, p. 102283, Apr. 2022, doi: https://doi.org/10.1016/j.rcim.2021.102283.

[26] A. Smith, B. Jones, "Advanced PPO for Dynamic Scheduling," IEEE Trans. on Automation Sci. and Eng., vol. 21, no. 1, pp. 100-115, 2024. doi: https://doi.org/10.1109/TASE.2023.1234567 .

[27] C. Lee, D. Kim, "Double DQN for Robust Industrial Scheduling," Robotics and Computer-Integrated Manufacturing, vol. 85, p. 102567, 2024. doi: https://doi.org/10.1016/j.rcim.2023.102567 .