LLM Memory & RAG 相关度: 8/10

Retrieval-Augmented Question Answering over Scientific Literature for the Electron-Ion Collider

Tina. J. Jat, T. Ghosh, Karthik Suresh
arXiv: 2604.02259v1 发布: 2026-04-02 更新: 2026-04-02

AI 摘要

该论文构建了一个基于本地LLaMA模型的Electron-Ion Collider (EIC) 领域RAG问答系统。

主要贡献

  • 构建了本地部署的EIC相关文档RAG系统
  • 使用开源LLaMA模型进行答案生成
  • 提供了一种数据隐私保护的解决方案

方法论

使用本地arXiv EIC文章数据库作为知识库,结合LLaMA模型,通过RAG流程实现领域问答。

原文摘要

To harness the power of Language Models in answering domain specific specialized technical questions, Retrieval Augmented Generation (RAG) is been used widely. In this work, we have developed a Q\&A application inspired by the Retrieval Augmented Generation (RAG), which is comprised of an in-house database indexed on the arXiv articles related to the Electron-Ion Collider (EIC) experiment - one of the largest international scientific collaboration and incorporated an open-source LLaMA model for answer generation. This is an extension to it's proceeding application built on proprietary model and Cloud-hosted external knowledge-base for the EIC experiment. This locally-deployed RAG-system offers a cost-effective, resource-constraint alternative solution to build a RAG-assisted Q\&A application on answering domain-specific queries in the field of experimental nuclear physics. This set-up facilitates data-privacy, avoids sending any pre-publication scientific data and information to public domain. Future improvement will expand the knowledge base to encompass heterogeneous EIC-related publications and reports and upgrade the application pipeline orchestration to the LangGraph framework.

标签

RAG LLaMA Electron-Ion Collider 领域问答 本地部署

arXiv 分类

hep-ex cs.AI physics.ins-det