How to Optimize Multispecies Set Predictions in Presence-Absence Modeling ?
AI 摘要
提出了MaxExp和SSE两种方法,用于优化多物种存在-缺席模型的二值化预测。
主要贡献
- 提出了MaxExp二值化框架,通过最大化评估指标选择最佳物种组合
- 提出了SSE方法,基于预期物种丰富度预测组合,计算效率高
- 通过多个案例研究验证了MaxExp和SSE的有效性,尤其是在类别不平衡和稀有物种情况下
方法论
提出两种二值化方法,MaxExp通过直接优化评估指标选择物种组合,SSE基于预期物种丰富度进行预测,并使用实际数据进行验证。
原文摘要
Species distribution models (SDMs) commonly produce probabilistic occurrence predictions that must be converted into binary presence-absence maps for ecological inference and conservation planning. However, this binarization step is typically heuristic and can substantially distort estimates of species prevalence and community composition. We present MaxExp, a decision-driven binarization framework that selects the most probable species assemblage by directly maximizing a chosen evaluation metric. MaxExp requires no calibration data and is flexible across several scores. We also introduce the Set Size Expectation (SSE) method, a computationally efficient alternative that predicts assemblages based on expected species richness. Using three case studies spanning diverse taxa, species counts, and performance metrics, we show that MaxExp consistently matches or surpasses widely used thresholding and calibration methods, especially under strong class imbalance and high rarity. SSE offers a simpler yet competitive option. Together, these methods provide robust, reproducible tools for multispecies SDM binarization.