论文标题
启用AI的无线网络的单一和多代理深入学习学习:教程
Single and Multi-Agent Deep Reinforcement Learning for AI-Enabled Wireless Networks: A Tutorial
论文作者
论文摘要
深度强化学习(DRL)最近见证了重大进步,这在解决各个领域的顺序决策问题方面取得了多种成功,尤其是在无线通信中。预计未来的第六代(6G)网络将提供可扩展的,低延迟的超级可靠服务,并通过数据驱动的人工智能(AI)赋予能力。未来6G网络的关键技术,例如智能的元曲面,空中网络和AI,涉及多个代理,这激发了多机构学习技术的重要性。此外,合作对于建立自我组织,自我维持和分散的网络至关重要。在这种情况下,本教程的重点是DRL的作用,重点是针对AI支持的6G网络的深层多机构增强学习(MARL)。本文的第一部分将介绍单代理RL和MARL的数学框架。这项工作的主要思想是激励RL超出近年来广泛采用的无模型观点的应用。因此,我们提供了RL算法的选择性描述,例如基于模型的RL(MBRL)和合作MARL,并重点介绍了它们在6G无线网络中的潜在应用。最后,我们在移动边缘计算(MEC),无人驾驶汽车(UAV)网络和无细胞的大型MIMO等领域中概述了MARL的最先进,并确定了有希望的未来研究指示。我们希望该教程能够刺激更多的研究努力,以建立基于MARL的可扩展和分散的系统。
Deep Reinforcement Learning (DRL) has recently witnessed significant advances that have led to multiple successes in solving sequential decision-making problems in various domains, particularly in wireless communications. The future sixth-generation (6G) networks are expected to provide scalable, low-latency, ultra-reliable services empowered by the application of data-driven Artificial Intelligence (AI). The key enabling technologies of future 6G networks, such as intelligent meta-surfaces, aerial networks, and AI at the edge, involve more than one agent which motivates the importance of multi-agent learning techniques. Furthermore, cooperation is central to establishing self-organizing, self-sustaining, and decentralized networks. In this context, this tutorial focuses on the role of DRL with an emphasis on deep Multi-Agent Reinforcement Learning (MARL) for AI-enabled 6G networks. The first part of this paper will present a clear overview of the mathematical frameworks for single-agent RL and MARL. The main idea of this work is to motivate the application of RL beyond the model-free perspective which was extensively adopted in recent years. Thus, we provide a selective description of RL algorithms such as Model-Based RL (MBRL) and cooperative MARL and we highlight their potential applications in 6G wireless networks. Finally, we overview the state-of-the-art of MARL in fields such as Mobile Edge Computing (MEC), Unmanned Aerial Vehicles (UAV) networks, and cell-free massive MIMO, and identify promising future research directions. We expect this tutorial to stimulate more research endeavors to build scalable and decentralized systems based on MARL.