Abstract
Multirobot patrolling systems with various sensing and communications devices are deployed to guarantee maritime safety. Patrolling path planning for multiple robots can be modeled as a multiobjective optimization problem. The positions of patrolling nodes impact the length of patrolling paths and execution efficiency of robots. To compute them, a huge solution space is encountered. Besides, multiple patrolling nodes on the same line lead to the same patrolling scheme. Thus, how to promote solution (population) diversity becomes a new challenge. To tackle it, this work proposes a learning-inspired immune algorithm. It uses the historical information in the previous generations during iterations to realize a learning process. Unlike saving all the individuals themselves and training a model for them, the useful historical information is extracted by using upper confidence bound-based and actor-critic-inspired methods. Both time consumption and storage space can be dramatically saved. The experimental results indicate that the proposed algorithm can generate multiple patrolling schemes for the decision makers and outperforms the state-of-the-art.
Original language | English (US) |
---|---|
Pages (from-to) | 9870-9881 |
Number of pages | 12 |
Journal | IEEE Internet of Things Journal |
Volume | 11 |
Issue number | 6 |
DOIs | |
State | Published - Mar 15 2024 |
Externally published | Yes |
All Science Journal Classification (ASJC) codes
- Signal Processing
- Information Systems
- Hardware and Architecture
- Computer Science Applications
- Computer Networks and Communications
Keywords
- Actorâ€Â"critic
- immune algorithm
- multiobjective optimization
- multirobot maritime patrolling
- upper confidence bound (UCB)