Enhancing User Throughput in Multi-panel mmWave Radio Access Networks for Beam-based MU-MIMO Using a DRL Method
AI 摘要
该论文提出了一种基于DRL的波束管理策略,优化毫米波MU-MIMO系统的用户吞吐量和降低延迟。
主要贡献
- 提出了一种基于DRL的自适应波束管理策略
- 利用空间域特征(波束相关性,RSRP,波束使用统计)进行波束选择
- 在多面板毫米波无线接入网络中验证了方案的有效性
方法论
利用深度强化学习,将通信过程建模为MDP,基于实时观测优化波束选择,提升频谱效率,降低端到端延迟。
原文摘要
Millimeter-wave (mmWave) communication systems, particularly those leveraging multi-user multiple-input and multiple-output (MU-MIMO) with hybrid beamforming, face challenges in optimizing user throughput and minimizing latency due to the high complexity of dynamic beam selection and management. This paper introduces a deep reinforcement learning (DRL) approach for enhancing user throughput in multi-panel mmWave radio access networks in a practical network setup. Our DRL-based formulation utilizes an adaptive beam management strategy that models the interaction between the communication agent and its environment as a Markov decision process (MDP), optimizing beam selection based on real-time observations. The proposed framework exploits spatial domain (SD) characteristics by incorporating the cross-correlation between the beams in different antenna panels, the measured reference signal received power (RSRP), and the beam usage statistics to dynamically adjust beamforming decisions. As a result, the spectral efficiency is improved and end-to-end latency is reduced. The numerical results demonstrate an increase in throughput of up to 16% and a reduction in latency by factors 3-7x compared to baseline (legacy beam management).