启用AI的无线网络的单一和多代理深入学习学习：教程

论文标题

启用AI的无线网络的单一和多代理深入学习学习：教程

Single and Multi-Agent Deep Reinforcement Learning for AI-Enabled Wireless Networks: A Tutorial

论文作者

Feriani, Amal, Hossain, Ekram

论文摘要

深度强化学习（DRL）最近见证了重大进步，这在解决各个领域的顺序决策问题方面取得了多种成功，尤其是在无线通信中。预计未来的第六代（6G）网络将提供可扩展的，低延迟的超级可靠服务，并通过数据驱动的人工智能（AI）赋予能力。未来6G网络的关键技术，例如智能的元曲面，空中网络和AI，涉及多个代理，这激发了多机构学习技术的重要性。此外，合作对于建立自我组织，自我维持和分散的网络至关重要。在这种情况下，本教程的重点是DRL的作用，重点是针对AI支持的6G网络的深层多机构增强学习（MARL）。本文的第一部分将介绍单代理RL和MARL的数学框架。这项工作的主要思想是激励RL超出近年来广泛采用的无模型观点的应用。因此，我们提供了RL算法的选择性描述，例如基于模型的RL（MBRL）和合作MARL，并重点介绍了它们在6G无线网络中的潜在应用。最后，我们在移动边缘计算（MEC），无人驾驶汽车（UAV）网络和无细胞的大型MIMO等领域中概述了MARL的最先进，并确定了有希望的未来研究指示。我们希望该教程能够刺激更多的研究努力，以建立基于MARL的可扩展和分散的系统。

Deep Reinforcement Learning (DRL) has recently witnessed significant advances that have led to multiple successes in solving sequential decision-making problems in various domains, particularly in wireless communications. The future sixth-generation (6G) networks are expected to provide scalable, low-latency, ultra-reliable services empowered by the application of data-driven Artificial Intelligence (AI). The key enabling technologies of future 6G networks, such as intelligent meta-surfaces, aerial networks, and AI at the edge, involve more than one agent which motivates the importance of multi-agent learning techniques. Furthermore, cooperation is central to establishing self-organizing, self-sustaining, and decentralized networks. In this context, this tutorial focuses on the role of DRL with an emphasis on deep Multi-Agent Reinforcement Learning (MARL) for AI-enabled 6G networks. The first part of this paper will present a clear overview of the mathematical frameworks for single-agent RL and MARL. The main idea of this work is to motivate the application of RL beyond the model-free perspective which was extensively adopted in recent years. Thus, we provide a selective description of RL algorithms such as Model-Based RL (MBRL) and cooperative MARL and we highlight their potential applications in 6G wireless networks. Finally, we overview the state-of-the-art of MARL in fields such as Mobile Edge Computing (MEC), Unmanned Aerial Vehicles (UAV) networks, and cell-free massive MIMO, and identify promising future research directions. We expect this tutorial to stimulate more research endeavors to build scalable and decentralized systems based on MARL.

下载PDF全文

下载文献需遵守相关版权规定

论文标题