AI Agents 相关度: 9/10

FAMOSE: A ReAct Approach to Automated Feature Discovery

Keith Burghardt, Jienan Liu, Sadman Sakib, Yuning Hao, Bo Li
arXiv: 2602.17641v1 发布: 2026-02-19 更新: 2026-02-19

AI 摘要

FAMOSE利用ReAct框架,自主进行特征工程,在表格数据上实现了自动化特征发现。

主要贡献

  • 首次将ReAct框架应用于自动化特征工程
  • 提出了自动特征增强和选择的智能体架构FAMOSE
  • 在回归和分类任务上取得了SOTA或接近SOTA的效果

方法论

构建基于ReAct范式的智能体,迭代探索、生成和优化特征,并集成特征选择和评估工具。

原文摘要

Feature engineering remains a critical yet challenging bottleneck in machine learning, particularly for tabular data, as identifying optimal features from an exponentially large feature space traditionally demands substantial domain expertise. To address this challenge, we introduce FAMOSE (Feature AugMentation and Optimal Selection agEnt), a novel framework that leverages the ReAct paradigm to autonomously explore, generate, and refine features while integrating feature selection and evaluation tools within an agent architecture. To our knowledge, FAMOSE represents the first application of an agentic ReAct framework to automated feature engineering, especially for both regression and classification tasks. Extensive experiments demonstrate that FAMOSE is at or near the state-of-the-art on classification tasks (especially tasks with more than 10K instances, where ROC-AUC increases 0.23% on average), and achieves the state-of-the-art for regression tasks by reducing RMSE by 2.0% on average, while remaining more robust to errors than other algorithms. We hypothesize that FAMOSE's strong performance is because ReAct allows the LLM context window to record (via iterative feature discovery and evaluation steps) what features did or did not work. This is similar to a few-shot prompt and guides the LLM to invent better, more innovative features. Our work offers evidence that AI agents are remarkably effective in solving problems that require highly inventive solutions, such as feature engineering.

标签

自动化特征工程 ReAct AI Agent 表格数据

arXiv 分类

cs.LG cs.AI