LLM Reasoning 相关度: 6/10

UrbanFM: Scaling Urban Spatio-Temporal Foundation Models

Wei Chen, Yuqian Wu, Junle Chen, Xiaofang Zhou, Yuxuan Liang
arXiv: 2602.20677v1 发布: 2026-02-24 更新: 2026-02-24

AI 摘要

构建大规模城市时空基础模型,实现跨城市、跨任务的零样本泛化。

主要贡献

  • 构建了包含全球城市数据的WorldST数据集
  • 提出了MiniST单元,统一网格和传感器数据表示
  • 提出了轻量级自注意力架构UrbanFM,学习时空依赖
  • 构建了大规模城市时空基准EvalST

方法论

通过数据、计算和架构的scaling,解决城市时空数据的异质性、相关性和动态性问题,训练轻量级自注意力模型。

原文摘要

Urban systems, as dynamic complex systems, continuously generate spatio-temporal data streams that encode the fundamental laws of human mobility and city evolution. While AI for Science has witnessed the transformative power of foundation models in disciplines like genomics and meteorology, urban computing remains fragmented due to "scenario-specific" models, which are overfitted to specific regions or tasks, hindering their generalizability. To bridge this gap and advance spatio-temporal foundation models for urban systems, we adopt scaling as the central perspective and systematically investigate two key questions: what to scale and how to scale. Grounded in first-principles analysis, we identify three critical dimensions: heterogeneity, correlation, and dynamics, aligning these principles with the fundamental scientific properties of urban spatio-temporal data. Specifically, to address heterogeneity through data scaling, we construct WorldST. This billion-scale corpus standardizes diverse physical signals, such as traffic flow and speed, from over 100 global cities into a unified data format. To enable computation scaling for modeling correlations, we introduce the MiniST unit, a novel split mechanism that discretizes continuous spatio-temporal fields into learnable computational units to unify representations of grid-based and sensor-based observations. Finally, addressing dynamics via architecture scaling, we propose UrbanFM, a minimalist self-attention architecture designed with limited inductive biases to autonomously learn dynamic spatio-temporal dependencies from massive data. Furthermore, we establish EvalST, the largest-scale urban spatio-temporal benchmark to date. Extensive experiments demonstrate that UrbanFM achieves remarkable zero-shot generalization across unseen cities and tasks, marking a pivotal first step toward large-scale urban spatio-temporal foundation models.

标签

城市计算 时空数据 基础模型 零样本学习

arXiv 分类

cs.LG cs.AI