Human tests for machine models: What lies “Beyond the Imitation Game”?
Abstract Benchmarking large language models (LLMs) is a key practice for evaluating their capabilities and risks. This paper considers the development of “BIG Bench,” a crowdsourced benchmark designed to test LLMs “Beyond the Imitation Game.” Drawing on linguistic anthropological and ethnographic analysis of the project's GitHub repository, we examine ...
Noya Kohavi, Anna Weichselbraun
wiley +1 more source
When algorithmic managers fail to fulfill their promises: The role of anthropomorphism in shaping justice perceptions. [PDF]
Fousiani K +3 more
europepmc +1 more source
Why should I use social chatbots? On potential users' acceptance and the role of anthropomorphism. [PDF]
Rüth M, Eifler JM, Schneider AC.
europepmc +1 more source
Beyond the Machine: An Integrative Framework of Anthropomorphism in AI. [PDF]
Curșeu PL, Radu Ș.
europepmc +1 more source
Understanding Chinese Consumers' Purchase Resistance in Virtual Live Streaming Rooms: The Role of Negative Anthropomorphism Disconfirmation and Service Guarantees. [PDF]
Qin F, Li L, Mi J.
europepmc +1 more source
Effect of AI empathy perception on employees' prosocial behavior: mediating role of warmth and moderating role of AI anthropomorphism. [PDF]
Xue J, Liu Y, Ren Z, Wu Y.
europepmc +1 more source
Partner or burden? The dual pathways linking perceived attributes of intelligent cockpits to human-machine collaboration willingness via cognitive load. [PDF]
Li S.
europepmc +1 more source
The Lady who served the potion: the Eleusinian sacrament personified [PDF]
Ruck, Carl A.
core +1 more source
One size fits all? Transferring social mindfulness measures to HRI. [PDF]
Nientimp D +3 more
europepmc +1 more source

