FactorEngine: A Program-level Knowledge-Infused Factor Mining Framework for Quantitative Investment
AI 摘要
FactorEngine利用程序级知识挖掘框架,提升量化投资中alpha因子的发现效率和效果。
主要贡献
- 提出程序级因子发现框架FactorEngine
- 知识引导的bootstrapping模块,从非结构化金融报告生成可执行因子程序
- 经验知识库支持轨迹感知改进,从失败中学习
方法论
将因子建模为图灵完备代码,分离逻辑修订和参数优化,LLM引导搜索与贝叶斯超参数搜索,结合知识引导和经验学习。
原文摘要
We study alpha factor mining, the automated discovery of predictive signals from noisy, non-stationary market data-under a practical requirement that mined factors be directly executable and auditable, and that the discovery process remain computationally tractable at scale. Existing symbolic approaches are limited by bounded expressiveness, while neural forecasters often trade interpretability for performance and remain vulnerable to regime shifts and overfitting. We introduce FactorEngine (FE), a program-level factor discovery framework that casts factors as Turing-complete code and improves both effectiveness and efficiency via three separations: (i) logic revision vs. parameter optimization, (ii) LLM-guided directional search vs. Bayesian hyperparameter search, and (iii) LLM usage vs. local computation. FE further incorporates a knowledge-infused bootstrapping module that transforms unstructured financial reports into executable factor programs through a closed-loop multi-agent extraction-verification-code-generation pipeline, and an experience knowledge base that supports trajectory-aware refinement (including learning from failures). Across extensive backtests on real-world OHLCV data, FE produces factors with substantially stronger predictive stability and portfolio impact-for example, higher IC/ICIR (and Rank IC/ICIR) and improved AR/Sharpe, than baseline methods, achieving state-of-the-art predictive and portfolio performance.