LLM Reasoning 相关度: 6/10

xplainfi: Feature Importance and Statistical Inference for Machine Learning in R

Lukas Burk, Fiona Katharina Ewald, Giuseppe Casalicchio, Marvin N. Wright, Bernd Bischl
arXiv: 2603.15306v1 发布: 2026-03-16 更新: 2026-03-16

AI 摘要

xplainfi是一个R包,提供多种特征重要性方法和统计推断,增强机器学习模型的可解释性。

主要贡献

  • 实现多种特征重要性方法
  • 提供基于高斯分布等的条件抽样架构
  • 支持多种统计推断方法

方法论

该软件包基于mlr3生态系统,实现了置换特征重要性、条件特征重要性等方法,并提供统计推断工具。

原文摘要

We introduce xplainfi, an R package built on top of the mlr3 ecosystem for global, loss-based feature importance methods for machine learning models. Various feature importance methods exist in R, but significant gaps remain, particularly regarding conditional importance methods and associated statistical inference procedures. The package implements permutation feature importance, conditional feature importance, relative feature importance, leave-one-covariate-out, and generalizations thereof, and both marginal and conditional Shapley additive global importance methods. It provides a modular conditional sampling architecture based on Gaussian distributions, adversarial random forests, conditional inference trees, and knockoff-based samplers, which enable conditional importance analysis for continuous and mixed data. Statistical inference is available through multiple approaches, including variance-corrected confidence intervals and the conditional predictive impact framework. We demonstrate that xplainfi produces importance scores consistent with existing implementations across multiple simulation settings and learner types, while offering competitive runtime performance. The package is available on CRAN and provides researchers and practitioners with a comprehensive toolkit for feature importance analysis and model interpretation in R.

标签

特征重要性 机器学习 R语言 模型可解释性

arXiv 分类

cs.LG