Book your room in the Turing Hotel! A symmetric and distributed Turing Test with multiple AIs and humans
AI 摘要
论文提出一种新型图灵测试“图灵酒店”,在多智能体和人类混合社区中进行,所有参与者既是裁判又是参与者。
主要贡献
- 提出新的图灵测试框架“图灵酒店”
- 设计并实现UNaIVERSE平台用于实验
- 首次在分布式环境中进行图灵测试
方法论
构建包含人类和LLM的混合社区,在UNaIVERSE平台上进行时间限制的讨论,分析判断结果。
原文摘要
In this paper, we report our experience with ``TuringHotel'', a novel extension of the Turing Test based on interactions within mixed communities of Large Language Models (LLMs) and human participants. The classical one-to-one interaction of the Turing Test is reinterpreted in a group setting, where both human and artificial agents engage in time-bounded discussions and, interestingly, are both judges and respondents. This community is instantiated in the novel platform UNaIVERSE (https://unaiverse.io), creating a ``World'' which defines the roles and interaction dynamics, facilitated by the platform's built-in programming tools. All communication occurs over an authenticated peer-to-peer network, ensuring that no third parties can access the exchange. The platform also provides a unified interface for humans, accessible via both mobile devices and laptops, that was a key component of the experience in this paper. Results of our experimentation involving 17 human participants and 19 LLMs revealed that current models are still sometimes confused as humans. Interestingly, there are several unexpected mistakes, suggesting that human fingerprints are still identifiable but not fully unambiguous, despite the high-quality language skills of artificial participants. We argue that this is the first experiment conducted in such a distributed setting, and that similar initiatives could be of national interest to support ongoing experiments and competitions aimed at monitoring the evolution of large language models over time.