TY - JOUR
T1 - Learning Optimal Solutions via an LSTM-Optimization Framework
AU - Yilmaz, Dogacan
AU - Büyüktahtakın, I. Esra
N1 - Funding Information:
We gratefully acknowledge the support of the National Science Foundation CAREER Award co-funded by the CBET/ENG Environmental Sustainability program and the Division of Mathematical Sciences in MPS/NSF under Grant No. CBET-1554018. We acknowledge the support of the New Jersey Institute of Technology Academic & Research Computing Systems for the High-Performance Computing resources. We also thank the referees for their constructive reviews.
Publisher Copyright:
© 2023, The Author(s), under exclusive licence to Springer Nature Switzerland AG.
PY - 2023/6
Y1 - 2023/6
N2 - In this study, we present a deep learning-optimization framework to tackle dynamic mixed-integer programs. Specifically, we develop a bidirectional Long Short Term Memory (LSTM) framework that can process information forward and backward in time to learn optimal solutions to sequential decision-making problems. We demonstrate our approach in predicting the optimal decisions for the single-item capacitated lot-sizing problem (CLSP), where a binary variable denotes whether to produce in a period or not. Due to the dynamic nature of the production and inventory levels that must meet the periodic demand, the CLSP can be treated as a sequence labeling task where a recurrent neural network can capture the problem’s temporal dynamics. Computational results show that our LSTM-Optimization (LSTM-Opt) framework significantly reduces the solution time of benchmark CLSP problems without much loss in feasibility and optimality. For example, the predictions at the 85% level reduce the CPLEX solution time by a factor of 9 on average for over 240000 test instances with an optimality gap of less than 0.05% and 0.4% infeasibility in the test set. Also, models trained using shorter planning horizons can successfully predict the optimal solution of the instances with longer planning horizons. For the hardest data set, the LSTM predictions at the 25% level reduce the solution time of 70 CPU hours to less than 2 CPU minutes with an optimality gap of 0.8% and without any infeasibility. The LSTM-Opt framework outperforms classical ML algorithms, such as the logistic regression and random forest, in terms of the solution quality, and exact approaches, such as the (ℓ , S) and dynamic programming-based inequalities, with respect to the solution time improvement. Our machine learning approach could be beneficial in tackling sequential decision-making problems similar to CLSP, which need to be solved repetitively, frequently, and in a fast manner.
AB - In this study, we present a deep learning-optimization framework to tackle dynamic mixed-integer programs. Specifically, we develop a bidirectional Long Short Term Memory (LSTM) framework that can process information forward and backward in time to learn optimal solutions to sequential decision-making problems. We demonstrate our approach in predicting the optimal decisions for the single-item capacitated lot-sizing problem (CLSP), where a binary variable denotes whether to produce in a period or not. Due to the dynamic nature of the production and inventory levels that must meet the periodic demand, the CLSP can be treated as a sequence labeling task where a recurrent neural network can capture the problem’s temporal dynamics. Computational results show that our LSTM-Optimization (LSTM-Opt) framework significantly reduces the solution time of benchmark CLSP problems without much loss in feasibility and optimality. For example, the predictions at the 85% level reduce the CPLEX solution time by a factor of 9 on average for over 240000 test instances with an optimality gap of less than 0.05% and 0.4% infeasibility in the test set. Also, models trained using shorter planning horizons can successfully predict the optimal solution of the instances with longer planning horizons. For the hardest data set, the LSTM predictions at the 25% level reduce the solution time of 70 CPU hours to less than 2 CPU minutes with an optimality gap of 0.8% and without any infeasibility. The LSTM-Opt framework outperforms classical ML algorithms, such as the logistic regression and random forest, in terms of the solution quality, and exact approaches, such as the (ℓ , S) and dynamic programming-based inequalities, with respect to the solution time improvement. Our machine learning approach could be beneficial in tackling sequential decision-making problems similar to CLSP, which need to be solved repetitively, frequently, and in a fast manner.
KW - Bidirectional Long Short Term Memory (LSTM)
KW - Capacitated lot-sizing
KW - Machine learning
KW - Mixed integer programming
KW - Recurrent neural networks
KW - Sequential decision-making
UR - http://www.scopus.com/inward/record.url?scp=85160769001&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85160769001&partnerID=8YFLogxK
U2 - 10.1007/s43069-023-00224-5
DO - 10.1007/s43069-023-00224-5
M3 - Article
AN - SCOPUS:85160769001
SN - 2662-2556
VL - 4
JO - Operations Research Forum
JF - Operations Research Forum
IS - 2
M1 - 48
ER -