Two reinforcement learning strategies based meta-heuristics for scheduling partial reentrant distributed flow-shops.
| Author | Jia, Yanan |
| Author | Gao, Kaizhou |
| Author | Ren, Yaxian |
| Author | Suganthan, Ponnuthurai Nagaratnam |
| Author | Sang, Hongyan |
| Available date | 2025-11-09T11:43:06Z |
| Publication Date | 2025-10 |
| Publication Name | Journal of Industrial and Management Optimization |
| Identifier | http://dx.doi.org/10.3934/jimo.2025127 |
| Citation | Jia, Y., Gao, K., Ren, Y., Suganthan, P. N., & Sang, H. (2025). Two reinforcement learning strategies based meta-heuristics for scheduling partial reentrant distributed flow-shops. Journal of Industrial and Management Optimization, 21(10), 6167-6189. |
| ISSN | 1547-5816 |
| Abstract | Reentrant or partial reentrant widely exists in practical manufacturing scenarios, which is rarely considered in literature. This work investigates a distributed flow-shop scheduling problem with partial reentrant constraint (DFSP PR). The objective is to minimize the maximum completion time (makespan). First, a mathematical model for the DFSP PR is developed, which integrates the characteristics of partial reentrant and distributed manufacturing scenarios. Second, three meta-heuristics are employed and enhanced to solve the concerned problems. The Nawaz-Enscore-Ham (NEH) heuristic is used to initialize the population. Based on the nature of the DFSP PR, six local search strategies are designed to improve the convergence efficiency of meta-heuristics. Third, two cutting-edge reinforcement learning algorithms, Q-learning and state-action-reward-state’-action’ (SARSA), are integrated into the meta-heuristics to select the most effective local search strategy during iterations. Finally, comprehensive experiments on 48 benchmark instances with varying scales demonstrate the effectiveness of the proposed approaches, where Q-learning and SARSA significantly improving the performance of the meta-heuristics. |
| Language | en |
| Publisher | American Institute of Mathematical Sciences |
| Subject | Distributed scheduling flow-shop meta-heuristics partial reentrant Q-learning SARSA |
| Type | Article |
| Pagination | 6167-6189 |
| Issue Number | 10 |
| Volume Number | 21 |
Files in this item
This item appears in the following Collection(s)
-
Computer Science & Engineering [2491 items ]


