Towards Sustainable Investment Policies Informed by Opponent Shaping
AI 摘要
论文利用对手塑造算法,改善投资行为,促进可持续投资政策的制定。
主要贡献
- 形式化了InvestESG中的社会困境
- 应用Advantage Alignment算法影响agent学习
- 理论解释了Advantage Alignment促进合作均衡的原因
方法论
构建InvestESG多智能体仿真环境,应用Advantage Alignment算法,并通过理论分析验证算法效果。
原文摘要
Addressing climate change requires global coordination, yet rational economic actors often prioritize immediate gains over collective welfare, resulting in social dilemmas. InvestESG is a recently proposed multi-agent simulation that captures the dynamic interplay between investors and companies under climate risk. We provide a formal characterization of the conditions under which InvestESG exhibits an intertemporal social dilemma, deriving theoretical thresholds at which individual incentives diverge from collective welfare. Building on this, we apply Advantage Alignment, a scalable opponent shaping algorithm shown to be effective in general-sum games, to influence agent learning in InvestESG. We offer theoretical insights into why Advantage Alignment systematically favors socially beneficial equilibria by biasing learning dynamics toward cooperative outcomes. Our results demonstrate that strategically shaping the learning processes of economic agents can result in better outcomes that could inform policy mechanisms to better align market incentives with long-term sustainability goals.