A Collaborative Safety Shield for Safe and Efficient CAV Lane Changes in Congested On-Ramp Merging
AI 摘要
提出了一种基于多智能体强化学习和安全盾的协同自动驾驶车辆变道策略。
主要贡献
- 提出了Multi-Agent Safety Shield (MASS),利用Control Barrier Functions (CBFs) 确保安全。
- 将MASS集成到多智能体强化学习 (MARL) 控制器中,平衡安全和效率。
- 设计了定制的奖励函数,提高了MARL策略的稳定性。
方法论
采用多智能体强化学习(MARL)训练变道策略,并利用基于Control Barrier Functions (CBFs)的安全盾保证安全性。
原文摘要
Lane changing in dense traffic is a significant challenge for Connected and Autonomous Vehicles (CAVs). Existing lane change controllers primarily either ensure safety or collaboratively improve traffic efficiency, but do not consider these conflicting objectives together. To address this, we propose the Multi-Agent Safety Shield (MASS), designed using Control Barrier Functions (CBFs) to enable safe and collaborative lane changes. The MASS enables collaboration by capturing multi-agent interactions among CAVs through interaction topologies constructed as a graph using a simple algorithm. Further, a state-of-the-art Multi-Agent Reinforcement Learning (MARL) lane change controller is extended by integrating MASS to ensure safety and defining a customised reward function to prioritise efficiency improvements. As a result, we propose a lane change controller, known as MARL-MASS, and evaluate it in a congested on-ramp merging simulation. The results demonstrate that MASS enables collaborative lane changes with safety guarantees by strictly respecting the safety constraints. Moreover, the proposed custom reward function improves the stability of MARL policies trained with a safety shield. Overall, by encouraging the exploration of a collaborative lane change policy while respecting safety constraints, MARL-MASS effectively balances the trade-off between ensuring safety and improving traffic efficiency in congested traffic. The code for MARL-MASS is available with an open-source licence at https://github.com/hkbharath/MARL-MASS