AI Agents 相关度: 9/10

ELISA: An Interpretable Hybrid Generative AI Agent for Expression-Grounded Discovery in Single-Cell Genomics

Omar Coser
arXiv: 2603.11872v1 发布: 2026-03-12 更新: 2026-03-12

AI 摘要

ELISA是一个可解释的混合生成式AI Agent,用于单细胞基因组学中的表达驱动发现。

主要贡献

  • 提出了ELISA框架,整合scGPT嵌入、BioBERT语义检索和LLM解释。
  • 实现了自动查询分类和多种分析模块,直接操作嵌入数据。
  • 在多个单细胞数据集上验证了ELISA的有效性,优于CellWhisperer。

方法论

ELISA结合了scGPT嵌入、BioBERT检索和LLM解释,构建混合AI Agent,进行单细胞数据的分析和生物学假设生成。

原文摘要

Translating single-cell RNA sequencing (scRNA-seq) data into mechanistic biological hypotheses remains a critical bottleneck, as agentic AI systems lack direct access to transcriptomic representations while expression foundation models remain opaque to natural language. Here we introduce ELISA (Embedding-Linked Interactive Single-cell Agent), an interpretable framework that unifies scGPT expression embeddings with BioBERT-based semantic retrieval and LLM-mediated interpretation for interactive single-cell discovery. An automatic query classifier routes inputs to gene marker scoring, semantic matching, or reciprocal rank fusion pipelines depending on whether the query is a gene signature, natural language concept, or mixture of both. Integrated analytical modules perform pathway activity scoringacross 60+ gene sets, ligand--receptor interaction prediction using 280+ curated pairs, condition-aware comparative analysis, and cell-type proportion estimation all operating directly on embedded data without access to the original count matrix. Benchmarked across six diverse scRNA-seq datasets spanning inflammatory lung disease, pediatric and adult cancers, organoid models, healthy tissue, and neurodevelopment, ELISA significantly outperforms CellWhisperer in cell type retrieval (combined permutation test, $p < 0.001$), with particularly large gains on gene-signature queries (Cohen's $d = 5.98$ for MRR). ELISA replicates published biological findings (mean composite score 0.90) with near-perfect pathway alignment and theme coverage (0.98 each), and generates candidate hypotheses through grounded LLM reasoning, bridging the gap between transcriptomic data exploration and biological discovery. Code available at: https://github.com/omaruno/ELISA-An-AI-Agent-for-Expression-Grounded-Discovery-in-Single-Cell-Genomics.git (If you use ELISA in your research, please cite this work).

标签

single-cell genomics AI Agent LLM scRNA-seq interpretability

arXiv 分类

q-bio.GN cs.AI