Hierarchical Latent Structure Learning through Online Inference
AI 摘要
HOLMES模型结合在线推理和层级贝叶斯,实现了序列数据中层级结构的发现和学习。
主要贡献
- 提出了HOLMES模型,用于在线学习层级潜在结构。
- 验证了HOLMES模型在预测性能和表示紧凑性方面的优势。
- 提供了一个可处理的计算框架,用于发现序列数据中的层级结构。
方法论
结合嵌套的中国餐馆过程先验和序列蒙特卡洛推理,实现对层级潜在表示的在线推理。
原文摘要
Learning systems must balance generalization across experiences with discrimination of task-relevant details. Effective learning therefore requires representations that support both. Online latent-cause models support incremental inference but assume flat partitions, whereas hierarchical Bayesian models capture multilevel structure but typically require offline inference. We introduce the Hierarchical Online Learning of Multiscale Experience Structure (HOLMES) model, a computational framework for hierarchical latent structure learning through online inference. HOLMES combines a variation on the nested Chinese Restaurant Process prior with sequential Monte Carlo inference to perform tractable trial-by-trial inference over hierarchical latent representations without explicit supervision over the latent structure. In simulations, HOLMES matched the predictive performance of flat models while learning more compact representations that supported one-shot transfer to higher-level latent categories. In a context-dependent task with nested temporal structure, HOLMES also improved outcome prediction relative to flat models. These results provide a tractable computational framework for discovering hierarchical structure in sequential data.