Agent Tuning & Optimization 相关度: 6/10

Pareto Optimal Benchmarking of AI Models on ARM Cortex Processors for Sustainable Embedded Systems

Pranay Jain, Maximilian Kasper, Göran Köber, Axel Plinge, Dominik Seuß
arXiv: 2602.17508v1 发布: 2026-02-19 更新: 2026-02-19

AI 摘要

针对嵌入式系统,研究ARM Cortex处理器上AI模型的能效优化,提出了Pareto最优基准测试框架。

主要贡献

  • 构建了自动化测试平台,评估不同处理器和AI模型的性能指标
  • 揭示了浮点运算(FLOPs)与推理时间的线性关系
  • 通过Pareto分析,平衡能耗和模型精度之间的权衡

方法论

设计自动化测试平台,在ARM Cortex处理器上运行AI模型,收集KPI数据,进行Pareto分析,找出最优配置。

原文摘要

This work presents a practical benchmarking framework for optimizing artificial intelligence (AI) models on ARM Cortex processors (M0+, M4, M7), focusing on energy efficiency, accuracy, and resource utilization in embedded systems. Through the design of an automated test bench, we provide a systematic approach to evaluate across key performance indicators (KPIs) and identify optimal combinations of processor and AI model. The research highlights a nearlinear correlation between floating-point operations (FLOPs) and inference time, offering a reliable metric for estimating computational demands. Using Pareto analysis, we demonstrate how to balance trade-offs between energy consumption and model accuracy, ensuring that AI applications meet performance requirements without compromising sustainability. Key findings indicate that the M7 processor is ideal for short inference cycles, while the M4 processor offers better energy efficiency for longer inference tasks. The M0+ processor, while less efficient for complex AI models, remains suitable for simpler tasks. This work provides insights for developers, guiding them to design energy-efficient AI systems that deliver high performance in realworld applications.

标签

嵌入式系统 AI模型优化 ARM Cortex 能效 Pareto分析

arXiv 分类

cs.AI