ACE-Brain-0: Spatial Intelligence as a Shared Scaffold for Universal Embodiments
AI 摘要
ACE-Brain-0通过空间智能统一多种具身智能任务,提升泛化能力和特定领域性能。
主要贡献
- 提出ACE-Brain-0通用基础模型
- 提出Scaffold-Specialize-Reconcile (SSR) 范式
- 采用Group Relative Policy Optimization (GRPO) 提升模型能力
方法论
利用空间智能作为通用支架,构建统一的MLLM,通过SSR范式和GRPO进行优化。
原文摘要
Universal embodied intelligence demands robust generalization across heterogeneous embodiments, such as autonomous driving, robotics, and unmanned aerial vehicles (UAVs). However, existing embodied brain in training a unified model over diverse embodiments frequently triggers long-tail data, gradient interference, and catastrophic forgetting, making it notoriously difficult to balance universal generalization with domain-specific proficiency. In this report, we introduce ACE-Brain-0, a generalist foundation brain that unifies spatial reasoning, autonomous driving, and embodied manipulation within a single multimodal large language model~(MLLM). Our key insight is that spatial intelligence serves as a universal scaffold across diverse physical embodiments: although vehicles, robots, and UAVs differ drastically in morphology, they share a common need for modeling 3D mental space, making spatial cognition a natural, domain-agnostic foundation for cross-embodiment transfer. Building on this insight, we propose the Scaffold-Specialize-Reconcile~(SSR) paradigm, which first establishes a shared spatial foundation, then cultivates domain-specialized experts, and finally harmonizes them through data-free model merging. Furthermore, we adopt Group Relative Policy Optimization~(GRPO) to strengthen the model's comprehensive capability. Extensive experiments demonstrate that ACE-Brain-0 achieves competitive and even state-of-the-art performance across 24 spatial and embodiment-related benchmarks.