Meta-Reinforcement Learning in Time-Varying UAV Communications: Adaptive Anti-Jamming Channel Selection

Hu, L.; Shao, Y.; Qian, Y.; Du, F.; Li, J.; Lin, Y.; Wang, Z.

doi:10.13164/re.2024.0417

Meta-Reinforcement Learning in Time-Varying UAV Communications: Adaptive Anti-Jamming Channel Selection

dc.contributor.author	Hu, L.
dc.contributor.author	Shao, Y.
dc.contributor.author	Qian, Y.
dc.contributor.author	Du, F.
dc.contributor.author	Li, J.
dc.contributor.author	Lin, Y.
dc.contributor.author	Wang, Z.
dc.coverage.issue	3	cs
dc.coverage.volume	33	cs
dc.date.accessioned	2025-04-04T11:12:54Z
dc.date.available	2025-04-04T11:12:54Z
dc.date.issued	2024-09	cs
dc.description.abstract	Unmanned Aerial Vehicle (UAV) communication networks are vulnerable to malicious jamming and co-channel interference, deteriorating the performance of the networks. Therefore, the exploration of anti-jamming methods to enhance communication security becomes a significant challenge. In this paper, we propose a novel anti-jamming channel selection scheme in a multi-channel multi-UAV network. We first formulate the anti-jamming problem as a Partially Observable Stochastic Game (POSG), where the UAV pairs with partial observability compete for a limited number of communication channels against a Markov jammer. To ensure rapid adaptation to the dynamic jamming environment, we propose a Meta-Mean-Field Q-learning (MMFQ) algorithm, which provides a Nash Equilibrium (NE) solution to the POSG problem. Furthermore, we derive the expressions of the upper bound for the loss function of MMFQ and prove the convergence of the proposed algorithm. Simulation results demonstrate that the proposed algorithm can achieve a superior average reward compared to the benchmark algorithms, facilitating throughput enhancement and resource utilization increase, especially for large-scale UAV communication networks.	en
dc.format	text	cs
dc.format.extent	417-431	cs
dc.format.mimetype	application/pdf	en
dc.identifier.citation	Radioengineering. 2024 vol. 33, iss. 3, s. 417-431. ISSN 1210-2512	cs
dc.identifier.doi	10.13164/re.2024.0417	en
dc.identifier.issn	1210-2512
dc.identifier.uri	https://hdl.handle.net/11012/250702
dc.language.iso	en	cs
dc.publisher	Radioengineering Society	cs
dc.relation.ispartof	Radioengineering	cs
dc.relation.uri	https://www.radioeng.cz/fulltexts/2024/24_03_0417_0431.pdf	cs
dc.rights	Creative Commons Attribution 4.0 International license	en
dc.rights.access	openAccess	en
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/	en
dc.subject	Unmanned aerial vehicle (UAV) communication	en
dc.subject	anti-jamming	en
dc.subject	meta-reinforcement learning	en
dc.subject	mean field	en
dc.title	Meta-Reinforcement Learning in Time-Varying UAV Communications: Adaptive Anti-Jamming Channel Selection	en
dc.type.driver	article	en
dc.type.status	Peer-reviewed	en
dc.type.version	publishedVersion	en
eprints.affiliatedInstitution.faculty	Fakulta elektrotechniky a komunikačních technologií	cs

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 24_03_0417_0431.pdf
Size:: 757.26 KB
Format:: Adobe Portable Document Format

Download

Collections

2024/3