论文标题

启用AI的无线网络的单一和多代理深入学习学习:教程

Single and Multi-Agent Deep Reinforcement Learning for AI-Enabled Wireless Networks: A Tutorial

论文作者

Feriani, Amal, Hossain, Ekram

论文摘要

深度强化学习(DRL)最近见证了重大进步,这在解决各个领域的顺序决策问题方面取得了多种成功,尤其是在无线通信中。预计未来的第六代(6G)网络将提供可扩展的,低延迟的超级可靠服务,并通过数据驱动的人工智能(AI)赋予能力。未来6G网络的关键技术,例如智能的元曲面,空中网络和AI,涉及多个代理,这激发了多机构学习技术的重要性。此外,合作对于建立自我组织,自我维持和分散的网络至关重要。在这种情况下,本教程的重点是DRL的作用,重点是针对AI支持的6G网络的深层多机构增强学习(MARL)。本文的第一部分将介绍单代理RL和MARL的数学框架。这项工作的主要思想是激励RL超出近年来广泛采用的无模型观点的应用。因此,我们提供了RL算法的选择性描述,例如基于模型的RL(MBRL)和合作MARL,并重点介绍了它们在6G无线网络中的潜在应用。最后,我们在移动边缘计算(MEC),无人驾驶汽车(UAV)网络和无细胞的大型MIMO等领域中概述了MARL的最先进,并确定了有希望的未来研究指示。我们希望该教程能够刺激更多的研究努力,以建立基于MARL的可扩展和分散的系统。

Deep Reinforcement Learning (DRL) has recently witnessed significant advances that have led to multiple successes in solving sequential decision-making problems in various domains, particularly in wireless communications. The future sixth-generation (6G) networks are expected to provide scalable, low-latency, ultra-reliable services empowered by the application of data-driven Artificial Intelligence (AI). The key enabling technologies of future 6G networks, such as intelligent meta-surfaces, aerial networks, and AI at the edge, involve more than one agent which motivates the importance of multi-agent learning techniques. Furthermore, cooperation is central to establishing self-organizing, self-sustaining, and decentralized networks. In this context, this tutorial focuses on the role of DRL with an emphasis on deep Multi-Agent Reinforcement Learning (MARL) for AI-enabled 6G networks. The first part of this paper will present a clear overview of the mathematical frameworks for single-agent RL and MARL. The main idea of this work is to motivate the application of RL beyond the model-free perspective which was extensively adopted in recent years. Thus, we provide a selective description of RL algorithms such as Model-Based RL (MBRL) and cooperative MARL and we highlight their potential applications in 6G wireless networks. Finally, we overview the state-of-the-art of MARL in fields such as Mobile Edge Computing (MEC), Unmanned Aerial Vehicles (UAV) networks, and cell-free massive MIMO, and identify promising future research directions. We expect this tutorial to stimulate more research endeavors to build scalable and decentralized systems based on MARL.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源