LLM-Driven 3D Scene Generation of Agricultural Simulation Environments
AI 摘要
利用多LLM流水线从自然语言提示生成农业模拟环境的3D场景,提高效率和精度。
主要贡献
- 提出了一种模块化的多LLM流水线用于生成农业3D模拟环境。
- 结合了3D资产检索、领域知识注入和代码生成技术。
- 采用混合策略优化LLM,包括few-shot prompting, RAG, finetuning和validation。
方法论
构建多LLM流水线,利用LLM生成Unreal引擎代码,并采用RAG和微调等技术提升LLM性能,最后通过用户研究和专家对比评估。
原文摘要
Procedural generation techniques in 3D rendering engines have revolutionized the creation of complex environments, reducing reliance on manual design. Recent approaches using Large Language Models (LLMs) for 3D scene generation show promise but often lack domain-specific reasoning, verification mechanisms, and modular design. These limitations lead to reduced control and poor scalability. This paper investigates the use of LLMs to generate agricultural synthetic simulation environments from natural language prompts, specifically to address the limitations of lacking domain-specific reasoning, verification mechanisms, and modular design. A modular multi-LLM pipeline was developed, integrating 3D asset retrieval, domain knowledge injection, and code generation for the Unreal rendering engine using its API. This results in a 3D environment with realistic planting layouts and environmental context, all based on the input prompt and the domain knowledge. To enhance accuracy and scalability, the system employs a hybrid strategy combining LLM optimization techniques such as few-shot prompting, Retrieval-Augmented Generation (RAG), finetuning, and validation. Unlike monolithic models, the modular architecture enables structured data handling, intermediate verification, and flexible expansion. The system was evaluated using structured prompts and semantic accuracy metrics. A user study assessed realism and familiarity against real-world images, while an expert comparison demonstrated significant time savings over manual scene design. The results confirm the effectiveness of multi-LLM pipelines in automating domain-specific 3D scene generation with improved reliability and precision. Future work will explore expanding the asset hierarchy, incorporating real-time generation, and adapting the pipeline to other simulation domains beyond agriculture.