Deep Reinforcement Learning for Dynamic Twin Automated Stacking Cranes Scheduling Problem

Citation DataElectronics (Switzerland), ISSN: 2079-9292, Vol: 12, Issue: 15

Publication Year2023

1
Citations
0
Usage
4
Captures
2
Mentions
0
Social Media

Metric Options: Counts1 Year3 Year

Metrics Details

Citations
1
- Citation Indexes
  1
Captures
4
- Readers
  4
Mentions
2
- Blog Mentions
  1
- News Mentions
  1

Most Recent Blog

Electronics, Vol. 12, Pages 3288: Deep Reinforcement Learning for Dynamic Twin Automated Stacking Cranes Scheduling Problem

July 31, 2023
MDPI Publishing

Electronics, Vol. 12, Pages 3288: Deep Reinforcement Learning for Dynamic Twin Automated Stacking Cranes Scheduling Problem Electronics doi: 10.3390/electronics12153288 Authors: Xin Jin Nan Mi Wen

Most Recent News

Studies from Shandong University Yield New Data on Electronics (Deep Reinforcement Learning for Dynamic Twin Automated Stacking Cranes Scheduling Problem)

August 10, 2023
Electronics Daily

2023 AUG 10 (NewsRx) -- By a News Reporter-Staff News Editor at Electronics Daily -- Researchers detail new data in electronics. According to news reporting

Article Description

Effective dynamic scheduling of twin Automated Stacking Cranes (ASCs) is essential for improving the efficiency of automated storage yards. While Deep Reinforcement Learning (DRL) has shown promise in a variety of scheduling problems, the dynamic twin ASCs scheduling problem is challenging owing to its unique attributes, including the dynamic arrival of containers, sequence-dependent setup and potential ASC interference. A novel DRL method is proposed in this paper to minimize the ASC run time and traffic congestion in the yard. Considering the information interference from ineligible containers, dynamic masked self-attention (DMA) is designed to capture the location-related relationship between containers. Additionally, we propose local information complementary attention (LICA) to supplement congestion-related information for decision making. The embeddings grasped by the LICA-DMA neural architecture can effectively represent the system state. Extensive experiments show that the agent can learn high-quality scheduling policies. Compared with rule-based heuristics, the learned policies have significantly better performance with reasonable time costs. The policies also exhibit impressive generalization ability in unseen scenarios with various scales or distributions.

Bibliographic Details

DOI10.3390/electronics12153288

URL IDhttp://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=85167788011&origin=inward; http://dx.doi.org/10.3390/electronics12153288; https://www.mdpi.com/2079-9292/12/15/3288; https://dx.doi.org/10.3390/electronics12153288

AUTHOR(S)

Xin Jin; Nan Mi; Wen Song; Qiqiang Li

PUBLISHER(S)

MDPI AG

TAG(S)

Engineering; Computer Science

Provide Feedback

Have ideas for a new metric? Would you like to see something else here?Let us know