研究生: Carlos Andres Palacios Caicedo
Carlos Andres Palacios Caicedo
論文名稱: 應用深度增強式學習模型於永續物流管理中求解汙染運途問題
A Deep Reinforcement Learning Model for Solving the Pollution Routing Problem in Sustainable Logistics Management
指導教授: 羅士哲
Shih-Che Lo
口試委員: 歐陽超
Chao Ou-Yang
Shih-Hsien Tseng
Shih-Che Lo
學位類別: 碩士
系所名稱: 管理學院 - 工業管理系
Department of Industrial Management
論文出版年: 2023
畢業學年度: 111
語文別: 英文
論文頁數: 59
中文關鍵詞: 深度強化學習污染運途問題潤滑油產業永續物流管理配銷管道最佳化
外文關鍵詞: Deep Reinforcement Learning, Pollution-Routing Problem, Lubes Industry, Sustainable Logistics Management, Distribution Channel Optimization
點閱:531下載:0
隨著地球暖化日益嚴重,環境污染日漸增加,新的法規強制對公司從內部要求管制朝向永續環保。生產者使用的物流運輸是二氧化碳的主要排放者,是減少碳足跡的路徑最佳化的重要因素之一。本論文的主要目標是透過最佳化物流配送路徑問題以減少碳排放,因此,本論文提出了一種使用深度強化學習的方法用來解決污染運途問題新的解決方案。此模型從距離和重量中獲取訓練資訊,最佳化卡車在路途上的油耗以減少二氧化碳排放。本論文所提出的模型從不同實例中訓練與學習,隨後可被呼叫並使用於新的案例而無需重新訓練。此模型是基植於對 Actor 使用注意力機制的 Actor-Critic 模式建構。經過嚴格的實驗設計,對模型的超參數進行了最佳化選取,使本論文所提出的模型能夠求解出近似最佳解的結果。我們將此模型在各種情境下進行測試,以獲得最佳和可行的解決方案。最後,本論文以哥倫比亞的潤滑油產業進行個案研究,以了解實施此類新模型的重要性。本論文所提出的模型可以應用於關注環保議題的物流業。最後,這種模式將允許公司通過日常路線追踪和控制他們的碳排放量,以符合政府法規的要求。

Nowadays, environmental contamination has increased, demanding new regulations and internal controls for companies toward sustainability. The logistics transportation used by the companies is an important producer of CO2, making it an important factor to be optimized for reducing the carbon footprint. The main objective of this thesis is to contribute to reduce carbon emissions by optimizing the routing in the delivery process of the companies. Therefore, a new methodology using Deep Reinforcement Learning (DRL) to solve the Pollution-Routing Problem (PRP) is presented. The model captures the information from distance and weight for optimizing the fuel consumption of the trucks. This model learns from the training with different instances and then can easily be called for inference to new instances without requiring re-training. The model is based on an Actor-Critic method using an attention mechanism for the Actor. The hyperparameters of the algorithm were optimized to acquire near-optimal solutions by rigorous design of experiments. The proposed model is tested under various scenarios to achieve optimal and feasible solutions. Finally, a case study from the Lubes industry in Colombia is introduced to illustrate the importance of the implementation of the proposed model in this thesis. The model can be applied to delivery processes for the logistics industry with environmental concerns. Finally, this model allows companies to track and control their carbon emissions by the daily routing to follow government regulations.

Abstract i 中文摘要 ii Acknowledgment iii Table of Content iv List of Tables vi List of Figures vii Chapter 1 Introduction 1 1.1 Research Motivation 1 1.2 Focus and Scope 3 1.3 Research Objective 3 1.4 Research Overview 4 Chapter 2 Literature Review 6 2.1 DRL Concepts 6 2.1.1 Reinforcement Learning Structure 6 2.1.2 Actor-Critic Structure 6 2.1.3 Attention Mechanisms 7 2.2 Vehicle Routing Problem 7 2.2.1 VRP Evolution and Variations 7 2.2.2 DRL Models Applied to VRP 7 2.2.3 Metaheuristics Applied to VRP 8 2.2.4 Classic Heuristics Applied to VRP 9 2.2.5 Exact Algorithms Applied to VRP 9 2.3 Pollution Routing Problem 10 2.3.1 PRP Evolution and Variations 11 2.3.2 Metaheuristics Applied to PRP 11 2.3.3 Exact Algorithms Applied to PRP 11 2.4 Green Vehicle Routing Problem 12 2.4.1 GVRP Evolution and Variations 12 2.4.2 Metaheuristics Applied to GVRP 12 2.4.3 Classic Heuristics Applied to GVRP 13 2.5 Other Combinatorial Problems 14 2.5.1 DRL Models Applied to Other Combinatorial Problems 14 2.5.2 Metaheuristics Applied to Other Combinatorial Problems 15 2.5.3 Classic Heuristics Applied to Other Combinatorial Problems 15 2.5.4 Exact Algorithms Applied to Other Combinatorial Problems 15 Chapter 3 Research Methodology 18 3.1 Mathematical Models 18 3.1.1 Parameters 18 3.1.2 Decision Variables 18 3.1.3 Objective Function 18 3.1.4 Constraints 18 3.2 DRL Model Applied 19 Chapter 4 Computational Experiments 24 4.1 Parameter Tunning 24 4.2 Training 27 4.3 Testing 27 Chapter 5 Case Study 32 5.1 Lubes Industry 32 5.2 Manufacturing Process 32 5.3 Distribution Channels 34 5.4 Company Overview 36 Chapter 6 Conclusions and Future Research 39 Appendix 41 References 46

