Meta-Reinforcement Learning in Time-Varying UAV Communications: Adaptive Anti-Jamming Channel Selection
| dc.contributor.author | Hu, L. | |
| dc.contributor.author | Shao, Y. | |
| dc.contributor.author | Qian, Y. | |
| dc.contributor.author | Du, F. | |
| dc.contributor.author | Li, J. | |
| dc.contributor.author | Lin, Y. | |
| dc.contributor.author | Wang, Z. | |
| dc.coverage.issue | 3 | cs |
| dc.coverage.volume | 33 | cs |
| dc.date.accessioned | 2025-04-04T11:12:54Z | |
| dc.date.available | 2025-04-04T11:12:54Z | |
| dc.date.issued | 2024-09 | cs |
| dc.description.abstract | Unmanned Aerial Vehicle (UAV) communication networks are vulnerable to malicious jamming and co-channel interference, deteriorating the performance of the networks. Therefore, the exploration of anti-jamming methods to enhance communication security becomes a significant challenge. In this paper, we propose a novel anti-jamming channel selection scheme in a multi-channel multi-UAV network. We first formulate the anti-jamming problem as a Partially Observable Stochastic Game (POSG), where the UAV pairs with partial observability compete for a limited number of communication channels against a Markov jammer. To ensure rapid adaptation to the dynamic jamming environment, we propose a Meta-Mean-Field Q-learning (MMFQ) algorithm, which provides a Nash Equilibrium (NE) solution to the POSG problem. Furthermore, we derive the expressions of the upper bound for the loss function of MMFQ and prove the convergence of the proposed algorithm. Simulation results demonstrate that the proposed algorithm can achieve a superior average reward compared to the benchmark algorithms, facilitating throughput enhancement and resource utilization increase, especially for large-scale UAV communication networks. | en |
| dc.format | text | cs |
| dc.format.extent | 417-431 | cs |
| dc.format.mimetype | application/pdf | en |
| dc.identifier.citation | Radioengineering. 2024 vol. 33, iss. 3, s. 417-431. ISSN 1210-2512 | cs |
| dc.identifier.doi | 10.13164/re.2024.0417 | en |
| dc.identifier.issn | 1210-2512 | |
| dc.identifier.uri | https://hdl.handle.net/11012/250702 | |
| dc.language.iso | en | cs |
| dc.publisher | Radioengineering Society | cs |
| dc.relation.ispartof | Radioengineering | cs |
| dc.relation.uri | https://www.radioeng.cz/fulltexts/2024/24_03_0417_0431.pdf | cs |
| dc.rights | Creative Commons Attribution 4.0 International license | en |
| dc.rights.access | openAccess | en |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | en |
| dc.subject | Unmanned aerial vehicle (UAV) communication | en |
| dc.subject | anti-jamming | en |
| dc.subject | meta-reinforcement learning | en |
| dc.subject | mean field | en |
| dc.title | Meta-Reinforcement Learning in Time-Varying UAV Communications: Adaptive Anti-Jamming Channel Selection | en |
| dc.type.driver | article | en |
| dc.type.status | Peer-reviewed | en |
| dc.type.version | publishedVersion | en |
| eprints.affiliatedInstitution.faculty | Fakulta elektrotechniky a komunikačních technologií | cs |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- 24_03_0417_0431.pdf
- Size:
- 757.26 KB
- Format:
- Adobe Portable Document Format
