Quantum Reinforcement Learning with Transformers for the Capacitated Vehicle Routing Problem
AI 摘要
论文比较了经典和量子强化学习解决带容量约束车辆路径问题(CVRP),混合量子方法性能最佳。
主要贡献
- 比较了经典、全量子和混合量子强化学习方法在CVRP上的表现
- 将Transformer架构集成到强化学习智能体中,用于捕捉车辆、客户和车场之间的关系
- 实验结果表明混合量子方法在CVRP上优于经典方法,并生成更鲁棒的路径
方法论
采用经典、全量子和混合量子A2C智能体,使用Transformer架构,在多车辆容量约束的CVRP问题上进行实验对比。
原文摘要
This paper addresses the Capacitated Vehicle Routing Problem (CVRP) by comparing classical and quantum Reinforcement Learning (RL) approaches. An Advantage Actor-Critic (A2C) agent is implemented in classical, full quantum, and hybrid variants, integrating transformer architectures to capture the relationships between vehicles, clients, and the depot through self- and cross-attention mechanisms. The experiments focus on multi-vehicle scenarios with capacity constraints, considering 20 clients and 4 vehicles, and are conducted over ten independent runs. Performance is assessed using routing distance, route compactness, and route overlap. The results show that all three approaches are capable of learning effective routing policies. However, quantum-enhanced models outperform the classical baseline and produce more robust route organization, with the hybrid architecture achieving the best overall performance across distance, compactness, and route overlap. In addition to quantitative improvements, qualitative visualizations reveal that quantum-based models generate more structured and coherent routing solutions. These findings highlight the potential of hybrid quantum-classical reinforcement learning models for addressing complex combinatorial optimization problems such as the CVRP.