Learning-Based Multi-UAV Flocking Control With Limited Visual Field and Instinctive Repulsion

Chengchao Bai; Peng Yan; Haiyin Piao; Wei Pan; Jifeng Guo

doi:10.1109/TCYB.2023.3246985

Learning-Based Multi-UAV Flocking Control With Limited Visual Field and Instinctive Repulsion

Chengchao Bai, Peng Yan, Haiyin Piao, Wei Pan, Jifeng Guo

Robot Dynamics

Research output: Contribution to journal › Article › Scientific › peer-review

1 Citation (Scopus)

29 Downloads (Pure)

Abstract

This article explores deep reinforcement learning (DRL) for the flocking control of unmanned aerial vehicle (UAV) swarms. The flocking control policy is trained using a centralized-learning-decentralized-execution (CTDE) paradigm, where a centralized critic network augmented with additional information about the entire UAV swarm is utilized to improve learning efficiency. Instead of learning inter-UAV collision avoidance capabilities, a repulsion function is encoded as an inner-UAV 'instinct.' In addition, the UAVs can obtain the states of other UAVs through onboard sensors in communication-denied environments, and the impact of varying visual fields on flocking control is analyzed. Through extensive simulations, it is shown that the proposed policy with the repulsion function and limited visual field has a success rate of 93.8% in training environments, 85.6% in environments with a high number of UAVs, 91.2% in environments with a high number of obstacles, and 82.2% in environments with dynamic obstacles. Furthermore, the results indicate that the proposed learning-based methods are more suitable than traditional methods in cluttered environments.

Original language	English
Pages (from-to)	462-475
Journal	IEEE Transactions on Cybernetics
Volume	54
Issue number	1
DOIs	https://doi.org/10.1109/TCYB.2023.3246985
Publication status	Published - 2024

Bibliographical note

Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Keywords

Autonomous aerial vehicles
Collision avoidance
Deep reinforcement learning (DRL)
flocking control
inter-unmanned aerial vehicle (UAV) collision avoidance
limited visual field
Optimization
Reinforcement learning
Sensors
Training
UAVs
Visualization

Access to Document

10.1109/TCYB.2023.3246985

Learning-Based_Multi-UAV_Flocking_Control_With_Limited_Visual_Field_and_Instinctive_RepulsionFinal published version, 6.89 MB

Cite this

@article{9c9e744641fa4ba484c303575873e5b0,

title = "Learning-Based Multi-UAV Flocking Control With Limited Visual Field and Instinctive Repulsion",

abstract = "This article explores deep reinforcement learning (DRL) for the flocking control of unmanned aerial vehicle (UAV) swarms. The flocking control policy is trained using a centralized-learning-decentralized-execution (CTDE) paradigm, where a centralized critic network augmented with additional information about the entire UAV swarm is utilized to improve learning efficiency. Instead of learning inter-UAV collision avoidance capabilities, a repulsion function is encoded as an inner-UAV 'instinct.' In addition, the UAVs can obtain the states of other UAVs through onboard sensors in communication-denied environments, and the impact of varying visual fields on flocking control is analyzed. Through extensive simulations, it is shown that the proposed policy with the repulsion function and limited visual field has a success rate of 93.8% in training environments, 85.6% in environments with a high number of UAVs, 91.2% in environments with a high number of obstacles, and 82.2% in environments with dynamic obstacles. Furthermore, the results indicate that the proposed learning-based methods are more suitable than traditional methods in cluttered environments.",

keywords = "Autonomous aerial vehicles, Collision avoidance, Deep reinforcement learning (DRL), flocking control, inter-unmanned aerial vehicle (UAV) collision avoidance, limited visual field, Optimization, Reinforcement learning, Sensors, Training, UAVs, Visualization",

author = "Chengchao Bai and Peng Yan and Haiyin Piao and Wei Pan and Jifeng Guo",

note = "Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.",

year = "2024",

doi = "10.1109/TCYB.2023.3246985",

language = "English",

volume = "54",

pages = "462--475",

journal = "IEEE Transactions on Cybernetics",

issn = "2168-2267",

publisher = "IEEE Advancing Technology for Humanity",

number = "1",

}

TY - JOUR

T1 - Learning-Based Multi-UAV Flocking Control With Limited Visual Field and Instinctive Repulsion

AU - Bai, Chengchao

AU - Yan, Peng

AU - Piao, Haiyin

AU - Pan, Wei

AU - Guo, Jifeng

N1 - Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

PY - 2024

Y1 - 2024

N2 - This article explores deep reinforcement learning (DRL) for the flocking control of unmanned aerial vehicle (UAV) swarms. The flocking control policy is trained using a centralized-learning-decentralized-execution (CTDE) paradigm, where a centralized critic network augmented with additional information about the entire UAV swarm is utilized to improve learning efficiency. Instead of learning inter-UAV collision avoidance capabilities, a repulsion function is encoded as an inner-UAV 'instinct.' In addition, the UAVs can obtain the states of other UAVs through onboard sensors in communication-denied environments, and the impact of varying visual fields on flocking control is analyzed. Through extensive simulations, it is shown that the proposed policy with the repulsion function and limited visual field has a success rate of 93.8% in training environments, 85.6% in environments with a high number of UAVs, 91.2% in environments with a high number of obstacles, and 82.2% in environments with dynamic obstacles. Furthermore, the results indicate that the proposed learning-based methods are more suitable than traditional methods in cluttered environments.

AB - This article explores deep reinforcement learning (DRL) for the flocking control of unmanned aerial vehicle (UAV) swarms. The flocking control policy is trained using a centralized-learning-decentralized-execution (CTDE) paradigm, where a centralized critic network augmented with additional information about the entire UAV swarm is utilized to improve learning efficiency. Instead of learning inter-UAV collision avoidance capabilities, a repulsion function is encoded as an inner-UAV 'instinct.' In addition, the UAVs can obtain the states of other UAVs through onboard sensors in communication-denied environments, and the impact of varying visual fields on flocking control is analyzed. Through extensive simulations, it is shown that the proposed policy with the repulsion function and limited visual field has a success rate of 93.8% in training environments, 85.6% in environments with a high number of UAVs, 91.2% in environments with a high number of obstacles, and 82.2% in environments with dynamic obstacles. Furthermore, the results indicate that the proposed learning-based methods are more suitable than traditional methods in cluttered environments.

KW - Autonomous aerial vehicles

KW - Collision avoidance

KW - Deep reinforcement learning (DRL)

KW - flocking control

KW - inter-unmanned aerial vehicle (UAV) collision avoidance

KW - limited visual field

KW - Optimization

KW - Reinforcement learning

KW - Sensors

KW - Training

KW - UAVs

KW - Visualization

UR - http://www.scopus.com/inward/record.url?scp=85149820607&partnerID=8YFLogxK

U2 - 10.1109/TCYB.2023.3246985

DO - 10.1109/TCYB.2023.3246985

M3 - Article

AN - SCOPUS:85149820607

SN - 2168-2267

VL - 54

SP - 462

EP - 475

JO - IEEE Transactions on Cybernetics

JF - IEEE Transactions on Cybernetics

IS - 1

ER -

Learning-Based Multi-UAV Flocking Control With Limited Visual Field and Instinctive Repulsion

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this