Multi-Echelon Inventory Optimization Using Deep Reinforcement Learning

Hammler, Patric; Riesterer, Nicolas; Mu, Gang; Braun, Torsten (2022). Multi-Echelon Inventory Optimization Using Deep Reinforcement Learning. In: Canci, Jung Kyu; Mekler, Philipp; Mu, Gang (eds.) Quantitative Models in Life Science Business. SpringerBriefs in economics (pp. 73-93). Springer 10.1007/978-3-031-11814-2_5

[img]
Preview
Text
Multi-Echelon_Inventory_Optimization_Using_Deep_Reinforcement_Learning.pdf - Published Version
Available under License Creative Commons: Attribution (CC-BY).

Download (840kB) | Preview

In this chapter, we provide an overview of inventory management within the pharmaceutical industry and how to model and optimize it. Inventory management is a highly relevant topic, as it causes high costs such as holding, shortage, and reordering costs. Especially the event of a stock-out can cause damage that goes beyond monetary damage in the form of lost sales. To minimize those costs is the task of an optimized reorder policy. A reorder policy is optimal when it minimizes the accumulated cost in every situation. However, finding an optimal policy is not trivial. First, the problem is highly stochastic as we need to consider variable demands and lead times. Second, the supply chain consists of several warehouses incl. the factory, global distribution warehouses, and local affiliate warehouses, whereby the reorder policy of each warehouse has an impact on the optimal reorder policy of related warehouses. In this context, we discuss the concept of multi-echelon inventory optimization and a methodology that is capable of capturing both, the stochastic behavior of the environment and how it is impacted by the reorder policy: Markov decision processes (MDPs). On this basis, we introduce the concept, its related benefits and weaknesses of a methodology named Reinforcement Learning (RL). RL is capable of finding (near-) optimal (reorder) policies for MDPs. Furthermore, some simulation-based results and current research directions are presented.

Item Type:

Book Section (Book Chapter)

Division/Institute:

08 Faculty of Science > Institute of Computer Science (INF) > Communication and Distributed Systems (CDS)
08 Faculty of Science > Institute of Computer Science (INF)

UniBE Contributor:

Hammler, Patric, Braun, Torsten

Subjects:

000 Computer science, knowledge & systems
500 Science > 510 Mathematics

ISSN:

2191-5512

ISBN:

978-3-031-11813-5

Series:

SpringerBriefs in economics

Publisher:

Springer

Language:

English

Submitter:

Dimitrios Xenakis

Date Deposited:

25 Jan 2023 11:16

Last Modified:

26 Aug 2023 23:10

Publisher DOI:

10.1007/978-3-031-11814-2_5

BORIS DOI:

10.48350/176625

URI:

https://boris.unibe.ch/id/eprint/176625

Actions (login required)

Edit item Edit item
Provide Feedback