2024龍年开工的第一篇原创文章,给大家带来2023年基于强化学习求解调度的文章汇总(去年也分享过2022年强化学习求解车间调度文章大汇总)。在读博期间研究的也是这个方向,所以平时也一直在关注着相关动态,今天分享出来供大家参考。关注公众号《智能制造与智能调度》,后台回复“2023”获取参考文献PDF。
从1995年最早将强化学习用于车间调度问题后,在随后的几年里,强化学习一直不温不火,最主要的原因是一般的强化学习无法解决状态空间爆炸的问题,直到2018年深度强化学习开始进军调度领域,并在随后的几年里爆发式增长。尤其是在2023年,更是惊人地出现了至少141篇相关文章,感觉不用强化学习算法都不好意思上街。一方面深度强化学习确实利用深度学习领域技术实现了未知状态下行为的预测,另一方面车间调度一直是悬而未决的经典问题,也是检验包括深度强化学习在内的各种算法的测试床。
- [1] F. N. Li, S. Lang, B. Y. Hong, and T. Reggelin, "A two-stage RNN-based deep reinforcement learning approach for solving the parallel machine scheduling problem with due dates and family setups," Journal of Intelligent Manufacturing, 2023, doi: 10.1007/s10845-023-02094-4.
- [2] X. Yang, H. Wang, and L. Li, "Train operation scheduling optimization based on deep reinforcement learning," in Journal of Physics: Conference Series, 2023, vol. 2558, no. 1: IOP Publishing, p. 012042.
- [3] J. M. Zhang, T. Wang, and L. L. Cheng, "Time-Sensitive and Resource-Aware Concurrent Workflow Scheduling for Edge Computing Platforms Based on Deep Reinforcement Learning," Applied Sciences-Basel, vol. 13, no. 19, Oct 2023, Art no. 10689, doi: 10.3390/app131910689.
- [4] X. Wu and X. Yan, "A spatial pyramid pooling-based deep reinforcement learning model for dynamic job-shop scheduling problem," Computers & Operations Research, vol. 160, p. 106401, 2023/12/01/ 2023, doi: doi.org/10.1016/j.c….
- [5] T. Zhou, L. Luo, Y. X. He, Z. W. Fan, and S. C. Ji, "Solving Panel Block Assembly Line Scheduling Problem via a Novel Deep Reinforcement Learning Approach," Applied Sciences-Basel, vol. 13, no. 14, Jul 2023, Art no. 8483, doi: 10.3390/app13148483.
- [6] Z. Wang, B. Cai, J. Li, D. Yang, Y. Zhao, and H. Xie, "Solving non-permutation flow-shop scheduling problem via a novel deep reinforcement learning approach," Computers & Operations Research, vol. 151, p. 106095, 2023/03/01/ 2023, doi: doi.org/10.1016/j.c….
- [7] E. Yuan, S. Cheng, L. Wang, S. Song, and F. Wu, "Solving job shop scheduling problems via deep reinforcement learning," Applied Soft Computing, vol. 143, 2023, doi: 10.1016/j.asoc.2023.110436.
- [8] Z. Q. Wang and W. Z. Liao, "Smart scheduling of dynamic job shop based on discrete event simulation and deep reinforcement learning," Journal of Intelligent Manufacturing, 2023 Jun 2023, doi: 10.1007/s10845-023-02161-w.
- [9] Y. Ping, Y. Liu, L. Zhang, L. Wang, and X. Xu, "Sequence generation for multi-task scheduling in cloud manufacturing with deep reinforcement learning," Journal of Manufacturing Systems, vol. 67, pp. 315-337, 2023, doi: 10.1016/j.jmsy.2023.02.009.
- [10] X. Y. Liu, L. Liu, and T. H. Jiang, "A self-learning interior search algorithm based on reinforcement learning for energy-aware job shop scheduling problem with outsourcing option," Journal of Intelligent & Fuzzy Systems, vol. 44, no. 6, pp. 10085-10100, 2023, doi: 10.3233/jifs-224624.
- [11] Y. M. Gu, M. Chen, and L. Wang, "A self-learning discrete salp swarm algorithm based on deep reinforcement learning for dynamic job shop scheduling problem," Applied Intelligence, 2023, doi: 10.1007/s10489-023-04479-7.
- [12] C. W. de Puiseau, J. Peters, C. Dorpelkus, H. Tercan, and T. Meisen, "schlably: A Python framework for deep reinforcement learning based scheduling experiments," Softwarex, vol. 22, May 2023, Art no. 101383, doi: 10.1016/j.softx.2023.101383.
- [13] K. ZHANG, L. BI, and X. JIAO, "Research on Flexible Job-shop Scheduling Problems with Integrated Reinforcement Learning Algorithm," China Mechanical Engineering, vol. 34, no. 02, p. 201, 2023.
- [14] M. H. Yuan, L. Zheng, H. Y. Huang, K. W. Zhou, F. Q. Pei, and W. B. Gu, "Research on flexible job shop scheduling problem with AGV using double DQN," Journal of Intelligent Manufacturing, 2023 Nov 2023, doi: 10.1007/s10845-023-02252-8.
- [15] L. B. Deng, Y. Z. Di, and L. Wang, "A Reinforcement-Learning-Based 3-D Estimation of Distribution Algorithm for Fuzzy Distributed Hybrid Flow-Shop Scheduling Considering On-Time-Delivery," Ieee Transactions on Cybernetics, 2023 Dec 2023, doi: 10.1109/tcyb.2023.3336656.
- [16] J. Hu, Reinforcement Learning for Planning and Scheduling in Aviation. 2023.
- [17] Y. T. Wu, L. Wang, J. F. Chen, J. Zheng, and Z. X. Pan, "A reinforcement learning driven two-stage evolutionary optimisation for hybrid seru system scheduling with worker transfer," International Journal of Production Research, 2023 Aug 2023, doi: 10.1080/00207543.2023.2252523.
- [18] Y. D. Chen, J. L. Ding, and Q. D. Chen, "A Reinforcement Learning Based Large-Scale Refinery Production Scheduling Algorithm," Ieee Transactions on Automation Science and Engineering, 2023 Oct 2023, doi: 10.1109/tase.2023.3321612.
- [19] T. Zhou, L. Luo, S. Ji, and Y. He, "A Reinforcement Learning Approach to Robust Scheduling of Permutation Flow Shop," Biomimetics (Basel, Switzerland), vol. 8, no. 6, 2023 Oct 2023, doi: 10.3390/biomimetics8060478.
- [20] A. Esteso, D. Peidro, J. Mula, and M. Diaz-Madronero, "Reinforcement learning applied to production planning and control," International Journal of Production Research, vol. 61, no. 16, pp. 5772-5789, Aug 2023, doi: 10.1080/00207543.2022.2104180.
- [21] C. Yang et al., "Reinforcement Learning and Stochastic Optimization with Deep Learning-Based Forecasting on Power Grid Scheduling," Processes, vol. 11, no. 11, Nov 2023, Art no. 3188, doi: 10.3390/pr11113188.
- [22] A. Sabri, H. Allaoui, and O. Souissi, "Reinforcement learning and stochastic dynamic programming for jointly scheduling jobs and preventive maintenance on a single machine to minimise earliness-tardiness," International Journal of Production Research, pp. 1-15, 2023, doi: 10.1080/00207543.2023.2172472.
- [23] Z. Y. Sun, W. M. Han, and L. L. Gao, "Real-time scheduling for dynamic workshops with random new job insertions by using deep reinforcement learning," Advances in Production Engineering & Management, vol. 18, no. 2, pp. 137-151, Jun 2023, doi: 10.14743/apem2023.2.462.
- [24] C. C. Lin, K. Y. Chen, and L. T. Hsieh, "Real-Time Charging Scheduling of Automated Guided Vehicles in Cyber-Physical Smart Factories Using Feature-Based Reinforcement Learning," Ieee Transactions on Intelligent Transportation Systems, 2023, doi: 10.1109/tits.2023.3234010.
- [25] S. L. Yang, J. Y. Wang, L. M. Xin, and Z. G. Xu, "Real-time and concurrent optimization of scheduling and reconfiguration for dynamic reconfigurable flow shop using deep reinforcement learning," Cirp Journal of Manufacturing Science and Technology, vol. 40, pp. 243-252, Feb 2023, doi: 10.1016/j.cirpj.2022.12.001.
- [26] Z. Q. Zhang, F. C. Wu, B. Qian, R. Hu, L. Wang, and H. P. Jin, "A Q-learning-based hyper-heuristic evolutionary algorithm for the distributed flexible job-shop scheduling problem with crane transportation," Expert Systems with Applications, vol. 234, Dec 2023, Art no. 121050, doi: 10.1016/j.eswa.2023.121050.
- [27] Z. Q. Zhang, B. Qian, R. Hu, and J. B. Yang, "Q-learning-based hyper-heuristic evolutionary algorithm for the distributed assembly blocking flowshop scheduling problem," Applied Soft Computing, vol. 146, Oct 2023, Art no. 110695, doi: 10.1016/j.asoc.2023.110695.
- [28] Y. H. Jia, Q. Yan, and H. F. Wang, "Q-learning driven multi-population memetic algorithm for distributed three-stage assembly hybrid flow shop scheduling with flexible preventive maintenance," Expert Systems with Applications, vol. 232, Dec 2023, Art no. 120837, doi: 10.1016/j.eswa.2023.120837.
- [29] J. Wang, H. T. Tang, and D. M. Lei, "A Q-learning artificial bee colony for distributed assembly flow shop scheduling with factory eligibility, transportation capacity and setup time," Engineering Applications of Artificial Intelligence, vol. 123, Aug 2023, Art no. 106230, doi: 10.1016/j.engappai.2023.106230.
- [30] S. Hubert, J. Meintschel, D. Bleidorn, Y. Ortmanns, and R. Wallrath, "Production Scheduling Using Deep Reinforcement Learning and Discrete Event Simulation," Chemie Ingenieur Technik, 2023 May 2023, doi: 10.1002/cite.202200242.
- [31] H. J. Xiong, J. J. Chen, S. Rong, and A. W. Zhang, "Power Battery Scheduling Optimization Based on Double DQN Algorithm with Constraints," Applied Sciences-Basel, vol. 13, no. 13, Jul 2023, Art no. 7702, doi: 10.3390/app13137702.
- [32] D. P. Qiao, L. Duan, H. L. Li, and Y. Q. Xiao, "Optimization of job shop scheduling problem based on deep reinforcement learning," Evolutionary Intelligence, 2023 Oct 2023, doi: 10.1007/s12065-023-00885-5.
- [33] J. Deng, S. Sierla, J. Sun, and V. Vyatkin, "Offline reinforcement learning for industrial process control: A case study from steel industry," Information Sciences, vol. 632, pp. 221-231, 2023, doi: 10.1016/j.ins.2023.03.019.
- [34] J. P. Huang, L. Gao, X. Y. Li, and C. J. Zhang, "A novel priority dispatch rule generation method based on graph neural network and reinforcement learning for distributed job-shop scheduling," Journal of Manufacturing Systems, vol. 69, pp. 119-134, Aug 2023, doi: 10.1016/j.jmsy.2023.06.007.
- [35] M. Kong, W. Z. Wang, M. Deveci, Y. J. Zhang, X. Z. Wu, and D. Coffman, "A novel carbon reduction engineering method-based deep Q-learning algorithm for energy-efficient scheduling on a single batch-processing machine in semiconductor manufacturing," International Journal of Production Research, 2023 Sep 2023, doi: 10.1080/00207543.2023.2252932.
- [36] K. Zhong, L. Zuo, J. Zhang, J. Guo, and Y. Lv, "Multi-variety small-batch production workshop scheduling method based on deep reinforcement learning, involves introducing random calculation example, where random calculation example is a calculation example of parameter experiment," Patent CN116757396-A, 2023. [Online]. Available: ://DIIDW:2023A69467
- [37] Y. H. Niu, F. X. Zhu, M. X. Wei, Y. F. Du, and P. Y. Zhai, "A Multi-Ship Collision Avoidance Algorithm Using Data-Driven Multi-Agent Deep Reinforcement Learning," Journal of Marine Science and Engineering, vol. 11, no. 11, Nov 2023, Art no. 2101, doi: 10.3390/jmse11112101.
- [38] F. E. Achamrah and A. Attajer, "Multi-objective reinforcement learning-based framework for solving selective maintenance problems in reconfigurable cyber-physical manufacturing systems," International Journal of Production Research, 2023 Jul 2023, doi: 10.1080/00207543.2023.2240433.
- [39] S. R. Chen, Y. M. Tian, and L. L. An, "Multi-Objective Order Scheduling via Reinforcement Learning," Algorithms, vol. 16, no. 11, Nov 2023, Art no. 495, doi: 10.3390/a16110495.
- [40] F. F. Cui, X. X. Lin, R. N. Zhang, and Q. Y. Yang, "Multi-objective optimal scheduling of charging stations based on deep reinforcement learning," Frontiers in Energy Research, vol. 10, Jan 2023, Art no. 1042882, doi: 10.3389/fenrg.2022.1042882.
- [41] A. F. Inal, C. Sel, A. Aktepe, A. K. Turker, and S. Ersoz, "A Multi-Agent Reinforcement Learning Approach to the Dynamic Job Shop Scheduling Problem," Sustainability, vol. 15, no. 10, May 2023, Art no. 8262, doi: 10.3390/su15108262.
- [42] M. H. Yuan, H. Y. Huang, Z. C. Li, C. X. Zhang, F. Q. Pei, and W. B. Gu, "A multi-agent double Deep-Q-network based on state machine and event stream for flexible job shop scheduling problem," Advanced Engineering Informatics, vol. 58, Oct 2023, Art no. 102230, doi: 10.1016/j.aei.2023.102230.
- [43] Y. Q. Cheng, Z. Y. Cao, X. G. Zhang, Q. L. Cao, and D. Z. Zhang, "Multi objective dynamic task scheduling optimization algorithm based on deep reinforcement learning," Journal of Supercomputing, 2023 Oct 2023, doi: 10.1007/s11227-023-05714-1.
- [44] J. H. Li et al., "Modeling of Digital Twin Workshop in Planning via a Graph Neural Network: The Case of an Ocean Engineering Manufacturing Intelligent Workshop," Applied Sciences-Basel, vol. 13, no. 18, Sep 2023, Art no. 10134, doi: 10.3390/app131810134.
- [45] K. C. Ying and S. W. Lin, "Minimizing makespan in two-stage assembly additive manufacturing: A reinforcement learning iterated greedy algorithm," Applied Soft Computing, vol. 138, May 2023, Art no. 110190, doi: 10.1016/j.asoc.2023.110190.
- [46] Y. Fang, D. Shi, C. Zheng, L. Kang, and J. Chang, "Method for real-time scheduling of flexible operation workshop, involves scheduling workshop again according to sum of procedure number, when current dispatching number is smaller than or equal to sum of procedure number," Patent CN116880425-A, 2023. [Online]. Available: ://DIIDW:2023A95123
- [47] F. Grumbach, N. E. A. Badr, P. Reusch, and S. Trojahn, "A Memetic Algorithm With Reinforcement Learning for Sociotechnical Production Scheduling," Ieee Access, vol. 11, pp. 68760-68775, 2023, doi: 10.1109/access.2023.3292548.
- [48] Z. A. Bukhsh, H. Molegraaf, and N. Jansen, "A maintenance planning framework using online and offline deep reinforcement learning," Neural Computing & Applications, 2023, doi: 10.1007/s00521-023-08560-7.
- [49] H.-J. Kim and J.-H. Lee, "Look-ahead based reinforcement learning for robotic flow shop scheduling," Journal of Manufacturing Systems, vol. 68, pp. 160-175, 2023, doi: 10.1016/j.jmsy.2023.02.002.
- [50] G. P. Jia, Y. H. Zhang, S. Q. Shen, B. Z. Liu, X. F. Hu, and C. X. Wu, "Load Balancing of Two-Sided Assembly Line Based on Deep Reinforcement Learning," Applied Sciences-Basel, vol. 13, no. 13, Jul 2023, Art no. 7439, doi: 10.3390/app13137439.
- [51] M. Karimi-Mamaghan, M. Mohammadi, B. Pasdeloup, and P. Meyer, "Learning to select operators in meta-heuristics: An integration of Q-learning into the iterated greedy algorithm for the permutation flowshop scheduling problem," European Journal of Operational Research, vol. 304, no. 3, pp. 1296-1330, Feb 2023, doi: 10.1016/j.ejor.2022.03.054.
- [52] A. D. Workneh and M. Gmira, "Learning to schedule (L2S): adaptive job shop scheduling using double deep Q network," Smart Science, pp. 1-15, 2023, doi: 10.1080/23080477.2023.2187528.
- [53] Z. X. Pan, L. Wang, C. X. Dong, and J. F. Chen, "A Knowledge-Guided End-to-End Optimization Framework based on Reinforcement Learning for Flow Shop Scheduling," Ieee Transactions on Industrial Informatics, 2023 Jun 2023, doi: 10.1109/tii.2023.3282313.
- [54] J. M. Hu, H. Y. Wang, H. K. Tang, T. Kanazawa, C. Gupta, and A. Farahat, "Knowledge-enhanced reinforcement learning for multi-machine integrated production and maintenance scheduling," Computers & Industrial Engineering, vol. 185, Nov 2023, Art no. 109631, doi: 10.1016/j.cie.2023.109631.
- [55] F. Q. Zhao, G. Zhou, T. P. Xu, N. N. Zhu, and Jonrinaldi, "A knowledge-driven cooperative scatter search algorithm with reinforcement learning for the distributed blocking flow shop scheduling problem," Expert Systems with Applications, vol. 230, Nov 2023, Art no. 120571, doi: 10.1016/j.eswa.2023.120571.
- [56] G. Bonetta, D. Zago, R. Cancelliere, and A. J. a. p. a. Grosso, "Job Shop Scheduling via Deep Reinforcement Learning: a Sequence to Sequence approach," 2023.
- [57] A. Momenikorbekandi and M. Abbod, "Intelligent Scheduling Based on Reinforcement Learning Approaches: Applying Advanced Q-Learning and State-Action-Reward-State-Action Reinforcement Learning Models for the Optimisation of Job Shop Scheduling Problems," Electronics, vol. 12, no. 23, Dec 2023, Art no. 4752, doi: 10.3390/electronics12234752.
- [58] Y. S. Liu, J. X. Fan, L. L. Zhao, W. M. Shen, and C. J. Zhang, "Integration of deep reinforcement learning and multi-agent system for dynamic scheduling of re-entrant hybrid flow shop considering worker fatigue and skill levels," Robotics and Computer-Integrated Manufacturing, vol. 84, Dec 2023, Art no. 102605, doi: 10.1016/j.rcim.2023.102605.
- [59] L. L. Zhu, F. Wu, Y. F. Hu, K. Huang, and X. M. Tian, "A heuristic multi-objective task scheduling framework for container-based clouds via actor-critic reinforcement learning," Neural Computing & Applications, 2023, doi: 10.1007/s00521-023-08208-6.
- [60] G. Gebreyesus, G. Fellek, A. Farid, S. Fujimura, and O. Yoshie, "Gated‐Attention Model with Reinforcement Learning for Solving Dynamic Job Shop Scheduling Problem," IEEJ Transactions on Electrical and Electronic Engineering, vol. 18, no. 6, pp. 932-944, 2023, doi: 10.1002/tee.23788.
- [61] W. Song, X. Y. Chen, Q. Q. Li, and Z. G. Cao, "Flexible Job-Shop Scheduling via Graph Neural Network and Deep Reinforcement Learning," Ieee Transactions on Industrial Informatics, vol. 19, no. 2, pp. 1600-1610, Feb 2023, doi: 10.1109/tii.2022.3189725.
- [62] R. Wang, G. Wang, J. Sun, F. Deng, and J. J. a. p. a. Chen, "Flexible Job Shop Scheduling via Dual Attention Network Based Reinforcement Learning," 2023.
- [63] L. X. Yun, D. Wang, and L. Li, "Explainable multi-agent deep reinforcement learning for real-time demand response towards sustainable manufacturing," Applied Energy, vol. 347, Oct 2023, Art no. 121324, doi: 10.1016/j.apenergy.2023.121324.
- [64] D. T. Zeng, J. Zhan, W. Peng, and Z. R. Zeng, "Evolutionary job scheduling with optimized population by deep reinforcement learning," Engineering Optimization, vol. 55, no. 3, pp. 494-509, Mar 2023, doi: 10.1080/0305215x.2021.2013479.
- [65] C. Su et al., "Evolution strategies-based optimized graph reinforcement learning for solving dynamic job shop scheduling problem," Applied Soft Computing, vol. 145, p. 110596, 2023/09/01/ 2023, doi: doi.org/10.1016/j.a….
- [66] Z. F. Wu, H. B. Fan, Y. M. Sun, and M. Y. Peng, "Efficient Multi-Objective Optimization on Dynamic Flexible Job Shop Scheduling Using Deep Reinforcement Learning Approach," Processes, vol. 11, no. 7, Jul 2023, Art no. 2018, doi: 10.3390/pr11072018.
- [67] Z. Li and Y. J. Chen, "Dynamic scheduling of multi-memory process flexible job shop problem based on digital twin," Computers & Industrial Engineering, vol. 183, Sep 2023, Art no. 109498, doi: 10.1016/j.cie.2023.109498.
- [68] M. H. Xia, H. B. Liu, M. F. Li, and L. Wang, "A dynamic scheduling method with Conv-Dueling and generalized representation based on reinforcement learning," International Journal of Industrial Engineering Computations, 2023 Jun 2023, doi: 10.5267/j.ijiec.2023.6.003.
- [69] Y. Gui, D. B. Tang, H. H. Zhu, Y. Zhang, and Z. Q. Zhang, "Dynamic scheduling for flexible job shop using a deep reinforcement learning approach," Computers & Industrial Engineering, vol. 180, Jun 2023, Art no. 109255, doi: 10.1016/j.cie.2023.109255.
- [70] X. M. Gan, Y. Zuo, G. C. Yang, A. S. Zhang, and F. Tao, "Dynamic scheduling for dual-objective job shop with machine breakdown by reinforcement learning," Proceedings of the Institution of Mechanical Engineers Part B-Journal of Engineering Manufacture, 2023 Apr 2023, doi: 10.1177/09544054231167086.
- [71] Z. J. Qin, D. Johnson, and Y. Q. Lu, "Dynamic production scheduling towards self-organizing mass personalization: A multi-agent dueling deep reinforcement learning approach," Journal of Manufacturing Systems, vol. 68, pp. 242-257, Jun 2023, doi: 10.1016/j.jmsy.2023.03.003.
- [72] C. L. Liu, C. J. Tseng, T. H. Huang, and J. W. Wang, "Dynamic Parallel Machine Scheduling With Deep Q-Network," Ieee Transactions on Systems Man Cybernetics-Systems, 2023 Jul 2023, doi: 10.1109/tsmc.2023.3289322.
- [73] C. L. Liu and T. H. Huang, "Dynamic Job-Shop Scheduling Problems Using Graph Neural Network and Deep Reinforcement Learning," Ieee Transactions on Systems Man Cybernetics-Systems, 2023 Jul 2023, doi: 10.1109/tsmc.2023.3287655.
- [74] X. Liang, W. Song, and P. Wei, "Dynamic Job Shop Scheduling via Deep Reinforcement Learning," in 2023 IEEE 35th International Conference on Tools with Artificial Intelligence (ICTAI), 2023: IEEE, pp. 369-376.
- [75] D. Heik, F. Bahrpeyma, and D. Reichelt, "Dynamic Job Shop Scheduling in an Industrial Assembly Environment Using Various Reinforcement Learning Techniques," Cham, 2023: Springer Nature Switzerland, in Intelligent Systems Design and Applications, pp. 523-533.
- [76] J. D. Zhang and J. C. Cai, "A Dual-Population Genetic Algorithm with Q-Learning for Multi-Objective Distributed Hybrid Flow Shop Scheduling Problem," Symmetry-Basel, vol. 15, no. 4, Apr 2023, Art no. 836, doi: 10.3390/sym15040836.
- [77] J. W. Leng et al., "Dual deep reinforcement learning agents-based integrated order acceptance and scheduling of mass individualized prototyping," Journal of Cleaner Production, vol. 427, Nov 2023, Art no. 139249, doi: 10.1016/j.jclepro.2023.139249.
- [78] L. L. Zhao, J. X. Fan, C. J. Zhang, W. M. Shen, and J. Zhuang, "A DRL-Based Reactive Scheduling Policy for Flexible Job Shops With Random Job Arrivals," Ieee Transactions on Automation Science and Engineering, 2023 May 2023, doi: 10.1109/tase.2023.3271666.
- [79] R. Li, W. Y. Gong, L. Wang, C. Lu, Z. X. Pan, and X. Y. Zhuang, "Double DQN-Based Coevolution for Green Distributed Heterogeneous Hybrid Flowshop Scheduling With Multiple Priorities of Jobs," Ieee Transactions on Automation Science and Engineering, 2023 Nov 2023, doi: 10.1109/tase.2023.3327792.
- [80] Y. M. Ma, J. W. Cai, S. Y. Li, J. Liu, J. M. Xing, and F. Qiao, "Double deep Q-network-based self-adaptive scheduling approach for smart shop floor," Neural Computing & Applications, 2023 Aug 2023, doi: 10.1007/s00521-023-08877-3.
- [81] C. Zhang, L. Gao, J. Huang, and X. Li, "Distributed operation workshop scheduling method based on end-to-end deep reinforcement learning, involves obtaining scheduling solution, so as to determine processing sequence until obtaining corresponding actions at all times," Patent CN117057528-A, 2023. [Online]. Available: ://DIIDW:2023C77405
- [82] X. Gan, Y. Zuo, A. Zhang, S. Li, and F. Tao, "Digital twin-enabled adaptive scheduling strategy based on deep reinforcement learning," Science China Technological Sciences, 2023, doi: 10.1007/s11431-022-2413-5.
- [83] R. Zhang, J. H. Lv, J. S. Bao, and Y. Zheng, "A digital twin-driven flexible scheduling method in a human-machine collaborative workshop based on hierarchical reinforcement learning," Flexible Services and Manufacturing Journal, 2023 May 2023, doi: 10.1007/s10696-023-09498-7.
- [84] Q. Hao and Y. Lv, "A Digital Twin-Based Production-Maintenance Joint Scheduling Framework with Reinforcement Learning," in 2023 8th International Conference on Control and Robotics Engineering (ICCRE), 21-23 April 2023 2023, pp. 51-56, doi: 10.1109/ICCRE57112.2023.10155592.
- [85] J. D. Zhang, Z. X. He, W. H. Chan, and C. Y. Chow, "DeepMAG: Deep reinforcement learning with multi-agent graphs for flexible job shop scheduling," Knowledge-Based Systems, vol. 259, Jan 2023, Art no. 110083, doi: 10.1016/j.knosys.2022.110083.
- [86] X. Fu and J. G. Kim, "Deep-Q-Network-Based Packet Scheduling in an IoT Environment," Sensors, vol. 23, no. 3, Feb 2023, Art no. 1339, doi: 10.3390/s23031339.
- [87] D. H. Lee, H. Kang, D. Lee, J. Lee, and K. Kim, "Deep Reinforcement Learning-Based Scheduler on Parallel Dedicated Machine Scheduling Problem towards Minimizing Total Tardiness," Sustainability, vol. 15, no. 4, Feb 2023, Art no. 2920, doi: 10.3390/su15042920.
- [88] J. W. Ou et al., "Deep reinforcement learning method for satellite range scheduling problem," Swarm and Evolutionary Computation, vol. 77, Mar 2023, Art no. 101233, doi: 10.1016/j.swevo.2023.101233.
- [89] C. X. Li, P. Zheng, Y. Yin, B. C. Wang, and L. H. Wang, "Deep reinforcement learning in smart manufacturing: A review and prospects," Cirp Journal of Manufacturing Science and Technology, vol. 40, pp. 75-101, Feb 2023, doi: 10.1016/j.cirpj.2022.11.003.
- [90] R. Chen, W. Li, and H. Yang, "A Deep Reinforcement Learning Framework Based on an Attention Mechanism and Disjunctive Graph Embedding for the Job-Shop Scheduling Problem," IEEE Transactions on Industrial Informatics, vol. 19, no. 2, pp. 1322-1331, 2023, doi: 10.1109/TII.2022.3167380.
- [91] J. Chun, W. Y. Yang, X. L. Liu, G. H. Wu, L. He, and L. N. Xing, "Deep Reinforcement Learning for the Agile Earth Observation Satellite Scheduling Problem," Mathematics, vol. 11, no. 19, Oct 2023, Art no. 4059, doi: 10.3390/math11194059.
- [92] R. L. Pan, Q. Wang, J. H. Cao, and C. L. Zhou, "Deep reinforcement learning for solving steelmaking-continuous casting scheduling problems under time-of-use tariffs," International Journal of Production Research, 2023 Oct 2023, doi: 10.1080/00207543.2023.2267693.
- [93] J. Luo, B. Wang, S. Yuan, W. Zhang, and T. Li, "Deep reinforcement learning for solving hybrid flow shop scheduling problem with unrelated parallel machines," in 2023 8th International Conference on Intelligent Computing and Signal Processing (ICSP), 21-23 April 2023 2023, pp. 1642-1645, doi: 10.1109/ICSP58490.2023.10248543. [Online]. Available: doi.ieeecomputersociety.org/10.1109/ICS…
- [94] L. Zhang et al., "Deep reinforcement learning for dynamic flexible job shop scheduling problem considering variable processing times," Journal of Manufacturing Systems, vol. 71, pp. 257-273, Dec 2023, doi: 10.1016/j.jmsy.2023.09.009.
- [95] G. L. G. Che, Y. Y. Zhang, L. X. Tang, and S. N. Zhao, "A deep reinforcement learning based multi-objective optimization for the scheduling of oxygen production system in integrated iron and steel plants," Applied Energy, vol. 345, Sep 2023, Art no. 121332, doi: 10.1016/j.apenergy.2023.121332.
- [96] M. Panzer, B. Bender, and N. Gronau, "A deep reinforcement learning based hyper-heuristic for modular production control," International Journal of Production Research, 2023 Jul 2023, doi: 10.1080/00207543.2023.2233641.
- [97] K. M. Kim, H. L. Rho, B. H. Park, and Y. H. Min, "Deep Q-Network Approach for Train Timetable Rescheduling Based on Alternative Graph," Applied Sciences-Basel, vol. 13, no. 17, Sep 2023, Art no. 9547, doi: 10.3390/app13179547.
- [98] A. D. Workneh and M. Gmira, "Deep Q Network Method for Dynamic Job Shop Scheduling Problem," in International Conference on Artificial Intelligence & Industrial Applications, 2023: Springer, pp. 137-155.
- [99] R. K. Liu, R. Piplani, and C. Toro, "A deep multi-agent reinforcement learning approach to solve dynamic job shop scheduling problem," Computers & Operations Research, vol. 159, Nov 2023, Art no. 106294, doi: 10.1016/j.cor.2023.106294.
- [100] N. M. Zhang, Y. L. Shen, Y. Du, L. L. Chen, and X. Zhang, "Counterfactual-attention multi-agent reinforcement learning for joint condition-based maintenance and production scheduling," Journal of Manufacturing Systems, vol. 71, pp. 70-81, Dec 2023, doi: 10.1016/j.jmsy.2023.08.011.
- [101] F. Zhao, G. Zhou, and L. Wang, "A Cooperative Scatter Search With Reinforcement Learning Mechanism for the Distributed Permutation Flowshop Scheduling Problem With Sequence-Dependent Setup Times," IEEE Transactions on Systems, Man, and Cybernetics: Systems, pp. 1-13, 2023, doi: 10.1109/tsmc.2023.3256484.
- [102] J. P. Huang, L. Gao, X. Y. Li, and C. J. Zhang, "A cooperative hierarchical deep reinforcement learning based multi-agent method for distributed job shop scheduling problem with random job arrivals," Computers & Industrial Engineering, vol. 185, Nov 2023, Art no. 109650, doi: 10.1016/j.cie.2023.109650.
- [103] Z. Yang, L. Bi, and X. G. Jiao, "Combining Reinforcement Learning Algorithms with Graph Neural Networks to Solve Dynamic Job Shop Scheduling Problems," Processes, vol. 11, no. 5, May 2023, Art no. 1571, doi: 10.3390/pr11051571.
- [104] H. Z. Bao, Q. K. Pan, R. Ruiz, and L. Gao, "A collaborative iterated greedy algorithm with reinforcement learning for energy-aware distributed blocking flow-shop scheduling," Swarm and Evolutionary Computation, vol. 83, Dec 2023, Art no. 101399, doi: 10.1016/j.swevo.2023.101399.
- [105] C. Z. Sun, H. L. Wu, Q. Lu, Y. C. Wang, Y. M. Liu, and J. B. Tan, "Blade sequencing optimization of aero-engine based on deep reinforcement learning," Aerospace Science and Technology, vol. 142, Nov 2023, Art no. 108580, doi: 10.1016/j.ast.2023.108580.
- [106] B. Wang, K. Feng, and X. Z. Wang, "Bi-objective scenario-guided swarm intelligent algorithms based on reinforcement learning for robust unrelated parallel machines scheduling with setup times," Swarm and Evolutionary Computation, vol. 80, Jul 2023, Art no. 101321, doi: 10.1016/j.swevo.2023.101321.
- [107] Y. Zeitrag and J. R. Figueira, "Automatically evolving preference-based dispatching rules for multi-objective job shop scheduling," Journal of Scheduling, 2023, doi: 10.1007/s10951-023-00783-9.
- [108] M. Del Gallo, G. Mazzuto, F. E. Ciarapica, and M. Bevilacqua, "Artificial Intelligence to Solve Production Scheduling Problems in Real Industrial Settings: Systematic Literature Review," Electronics, vol. 12, no. 23, Dec 2023, Art no. 4732, doi: 10.3390/electronics12234732.
- [109] C. Silva, P. Andrade, B. Ribeiro, and B. F. Santos, "Adaptive reinforcement learning for task scheduling in aircraft maintenance," Scientific Reports, vol. 13, no. 1, p. 16605, 2023/10/03 2023, doi: 10.1038/s41598-023-41169-3.
- [110] M. Waseem and Q. Chang, "Adaptive Mobile Robot Scheduling in Multiproduct Flexible Manufacturing Systems Using Reinforcement Learning," Journal of Manufacturing Science and Engineering-Transactions of the Asme, vol. 145, no. 12, Dec 2023, Art no. 121005, doi: 10.1115/1.4062941.
- [111] 吕金旭 and 葛万成, "智能制造领域中深度强化学习的应用综述," 信息与电脑(理论版), vol. 35, no. 05, pp. 186-188+193, 2023.
- [112] 李新宇, 黄江平, 李嘉航, 李育鑫, and 高亮, "智能车间动态调度的研究与发展趋势分析," 中国科学:技术科学, vol. 53, no. 07, pp. 1016-1030, 2023. [Online]. Available: kns.cnki.net/kcms/detail….
- [113] 蒋权 and 魏静萱, "用于动态柔性作业车间调度的实时调度方法," 系统仿真学报, pp. 1-14, 2023, doi: 10.16182/j.issn1004731x.joss.23-0385.
- [114] 苗宽 and 李崇寿, "一种基于修正机制和强化学习的作业车间调度问题的优化算法," 计算机科学, vol. 50, no. 06, pp. 274-282, 2023.
- [115] 李健, 李洹坤, 何鹏博, 王化北, 徐莉萍, and 何奎, "协同智能体强化学习算法的柔性作业车间调度方法研究," 系统仿真学报, pp. 1-13, 2023, doi: 10.16182/j.issn1004731x.joss.23-0978.
- [116] 杨媛媛, 胡蓉, 钱斌, 张长胜, and 金怀平, "深度强化学习算法求解动态流水车间实时调度问题," 控制理论与应用, pp. 1-9, 2023. [Online]. Available: kns.cnki.net/kcms/detail….
- [117] 胡一凡, 张利平, 白雪, and 唐秋华, "深度强化学习求解柔性装配作业车间调度问题," 华中科技大学学报(自然科学版), vol. 51, no. 02, pp. 153-160, 2023, doi: 10.13245/j.hust.230217.
- [118] 姚世洁, "融合强化学习和非支配排序的海鸥优化算法及其在FJSP中的应用," 硕士, 燕山大学, 2023.
- [119] 李权展, "考虑多AGV路径规划的柔性作业车间动态调度," 硕士, 东华大学, 2023.
- [120] 张凯, 毕利, and 焦小刚, "集成强化学习算法的柔性作业车间调度问题研究," 中国机械工程, vol. 34, no. 02, pp. 201-207, 2023. [Online]. Available: kns.cnki.net/kcms/detail….
- [121] 李雅明 and 张斌, "基于Q-learning算法解决多目标柔性作业车间调度问题," 工业控制计算机, vol. 36, no. 09, pp. 112-114, 2023.
- [122] 閤泰梓, 唐秋华, and 成丽新, "基于DQN协同进化算法的柔性作业车间能效调度优化," 计算机集成制造系统, pp. 1-17, 2023. [Online]. Available: kns.cnki.net/kcms/detail….
- [123] 李兴洲, 李艳武, and 谢辉, "基于CNN的深度强化学习算法求解柔性作业车间调度问题," 计算机工程与应用, pp. 1-12, 2023. [Online]. Available: kns.cnki.net/kcms/detail….
- [124] 王亮, 李昊谦, 唐堂, and 于颖, "基于虚拟仿真与深度强化学习的作业车间集成调度," 实验室研究与探索, vol. 42, no. 05, pp. 133-136+178, 2023, doi: 10.19927/j.cnki.syyt.2023.05.027.
- [125] 王志强, "基于深度强化学习的作业车间分批调度优化研究," 硕士, 杭州电子科技大学, 2023.
- [126] 乔东平, 段绿旗, 黎宏磊, and 肖艳秋, "基于深度强化学习的作业车间调度问题优化," 制造技术与机床, no. 04, pp. 148-155, 2023, doi: 10.19287/j.mtmt.1005-2402.2023.04.023.
- [127] 冯斌, 胡轶婕, 黄刚, 姜威, 徐华廷, and 郭创新, "基于深度强化学习的新型电力系统调度优化方法综述," 电力系统自动化, vol. 47, no. 17, pp. 187-199, 2023. [Online]. Available: kns.cnki.net/kcms/detail….
- [128] 曾正旗, "基于深度强化学习的柔性作业车间调度方法研究," 硕士, 华中农业大学, 2023.
- [129] 崔雪艳, 万烂军, 赵昊鑫, and 李长云, "基于深度强化学习的柔性作业车间调度方法," 制造技术与机床, no. 12, pp. 165-170, 2023, doi: 10.19287/j.mtmt.1005-2402.2023.12.024.
- [130] 陈中柘, 刘宇, 朱顺鹏, and 张小玲, "基于深度强化学习的柔性车间动态调度方法及仿真," 实验室研究与探索, vol. 42, no. 04, pp. 107-111+117, 2023, doi: 10.19927/j.cnki.syyt.2023.04.021.
- [131] 朱家政, 王聪, 李新凯, 董颖超, and 张宏立, "基于深度强化学习的离散状态转移算法求解柔性作业车间调度问题," 北京航空航天大学学报, pp. 1-15, 2023, doi: 10.13700/j.bh.1001-5965.2023.0211.
- [132] 倪静 and 马梦珂, "基于深度强化学习的跨单元动态调度方法," 系统仿真学报, vol. 35, no. 11, pp. 2345-2358, 2023, doi: 10.16182/j.issn1004731x.joss.22-0666.
- [133] 陈睿奇, 黎雯馨, 王传洋, and 杨宏兵, "基于深度强化学习的工序交互式智能体Job shop调度方法," 机械工程学报, vol. 59, no. 12, pp. 78-88, 2023. [Online]. Available: kns.cnki.net/kcms/detail….
- [134] 刘锋, "基于深度强化学习的服装缝制生产过程调度方法研究," 硕士, 武汉纺织大学, 2023.
- [135] 张凯, "基于强化学习算法的柔性作业车间调度问题研究," 硕士, 宁夏大学, 2023.
- [136] 杜宇, "基于强化学习的装配式预制件生产调度多目标优化算法研究," 博士, 山东师范大学, 2023.
- [137] 王立群, 唐敦兵, 朱海华, 马国财, and 曹志宏, "基于强化学习的柔性作业车间动态调度算法研究," 机械制造与自动化, vol. 52, no. 03, pp. 1-3+10, 2023, doi: 10.19344/j.cnki.issn1671-5276.2023.03.001.
- [138] 陆心屹 and 韩晓龙, "基于强化学习的改进NSGA-Ⅱ求解柔性作业车间节能调度问题," 现代制造工程, no. 08, pp. 22-35, 2023, doi: 10.16731/j.cnki.1671-3133.2023.08.004.
- [139] 纪志勇, 袁逸萍, 巴智勇, 樊盼盼, and 田芳, "基于多动作深度强化学习的纺机制造车间调度方法," 计算机应用研究, vol. 40, no. 11, pp. 3247-3253, 2023, doi: 10.19734/j.issn.1001-3695.2023.03.0134.
- [140] 吴昊泽, 李艳武, and 谢辉, "改进PPO算法求解柔性作业车间调度问题," 计算机集成制造系统, pp. 1-20, 2023. [Online]. Available: kns.cnki.net/kcms/detail….
- [141] 成金海, 徐华, and 杨金峰, "改进灰狼优化算法求解模糊车间调度问题," 计算机应用研究, pp. 1-10, 2023, doi: 10.19734/j.issn.1001-3695.2022.11.0515.