Simulated annealing with reinforcement learning for the set team orienteering problem with time windows

Citation DataExpert Systems with Applications, ISSN: 0957-4174, Vol: 238, Page: 121996

Publication Year2024

10
Citations
157
Usage
17
Captures
1
Mentions
0
Social Media

Metric Options: Counts1 Year3 Year

Metrics Details

Citations
10
- Citation Indexes
  10
Usage
157
- Downloads
  117
- Abstract Views
  40
Captures
17
- Readers
  17
Mentions
1
- News Mentions
  1

Most Recent News

Reports from Chang Gung University Add New Data to Findings in Technology (Simulated Annealing With Reinforcement Learning for the Set Team Orienteering Problem With Time Windows)

March 14, 2024
Sports Research Daily

2024 MAR 14 (NewsRx) -- By a News Reporter-Staff News Editor at Sports Research Daily -- Fresh data on Technology are presented in a new

Article Description

This research investigates the Set Team Orienteering Problem with Time Windows (STOPTW), a new variant of the well-known Team Orienteering Problem with Time Windows and Set Orienteering Problem. In the STOPTW, customers are grouped into clusters. Each cluster is associated with a profit attainable when a customer in the cluster is visited within the customer’s time window. A Mixed Integer Linear Programming model is formulated for STOPTW to maximizing total profit while adhering to time window constraints. Since STOPTW is an NP-hard problem, a Simulated Annealing with Reinforcement Learning (SA RL ) algorithm is developed. The proposed SA RL incorporates the core concepts of reinforcement learning, utilizing the ε-greedy algorithm to learn the fitness values resulting from neighborhood moves. Numerical experiments are conducted to assess the performance of SA RL, comparing the results with those obtained by CPLEX and Simulated Annealing (SA). For small instances, both SA RL and SA algorithms outperform CPLEX by obtaining eight optimal solutions and 12 better solutions. For large instances, both algorithms obtain better solutions to 28 out of 29 instances within shorter computational times compared to CPLEX. Overall, SA RL outperforms SA by resulting in lower gap percentages within the same computational times. Specifically, SA RL outperforms SA in solving 13 large STOPTW benchmark instances. Finally, a sensitivity analysis is conducted to derive managerial insights.

Bibliographic Details

DOI10.1016/j.eswa.2023.121996

REPOSITORY URLhttps://ink.library.smu.edu.sg/sis_research/8265

URL IDhttp://www.sciencedirect.com/science/article/pii/S0957417423024983; http://dx.doi.org/10.1016/j.eswa.2023.121996; http://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=85174162908&origin=inward; https://linkinghub.elsevier.com/retrieve/pii/S0957417423024983; https://ink.library.smu.edu.sg/sis_research/8265; https://ink.library.smu.edu.sg/cgi/viewcontent.cgi?article=9268&context=sis_research; https://dx.doi.org/10.1016/j.eswa.2023.121996

AUTHOR(S)

Vincent F. Yu; Nabila Yuraisyah Salsabila; Shih-Wei Lin; Aldy Gunawan

PUBLISHER(S)

Elsevier BV

TAG(S)

Engineering; Computer Science

Provide Feedback

Have ideas for a new metric? Would you like to see something else here?Let us know