Results 191 to 200 of about 2,284,887 (317)
Human tests for machine models: What lies “Beyond the Imitation Game”?
Abstract Benchmarking large language models (LLMs) is a key practice for evaluating their capabilities and risks. This paper considers the development of “BIG Bench,” a crowdsourced benchmark designed to test LLMs “Beyond the Imitation Game.” Drawing on linguistic anthropological and ethnographic analysis of the project's GitHub repository, we examine ...
Noya Kohavi, Anna Weichselbraun
wiley +1 more source
AI-Assisted Tools for Scientific Review Writing: Opportunities and Cautions. [PDF]
Silva JCMC +6 more
europepmc +1 more source
Abstract This paper asks how LLM‐based systems can produce text that is taken as contextually appropriate by humans without having seen text in its broader context. To understand how this is possible, context and co‐text have to be distinguished. Co‐text is input to LLMs during training and at inference as well as the primary resource of sense‐making ...
Ole Pütz
wiley +1 more source
Children's Numerical Estimation Is Biased by Male Informants More Than Female Informants
ABSTRACT Numerical estimation is one of the key early math skills and predicts children's long‐term math achievement. While children are born with an intuitive “number sense,” they do not form a mapping between nonverbal numerical representations and symbolic numbers until about 5 years of age. This protracted learning process is embedded in children's
Kathleen Cracknell +5 more
wiley +1 more source
Geriatric core competencies for non-geriatricians and nurses: a scoping review. [PDF]
Yang DC +5 more
europepmc +1 more source
Body height and the excess cancer risk in men
What's new? In cancers that affect both sexes, men usually have a higher risk than women. While this is often attributed to behavioral factors, such as exposure to environmental carcinogens, there may be an intrinsic biological mechanism involved. Tall stature has been associated with increased cancer risk.
Cecilia Radkiewicz +6 more
wiley +1 more source
A self-correcting Agentic Graph RAG for clinical decision support in hepatology. [PDF]
Hu Y +6 more
europepmc +1 more source
ABSTRACT Background In the Information Age, prospective teachers increasingly rely on online sources for research and lesson preparation. This entails dynamic, situation‐specific interactions within digital environments, which are increasingly shaped by artificial intelligence.
Carla Schelle +4 more
wiley +1 more source
ABSTRACT This meta‐analysis examines whether stereotype threat (ST) influences consumer‐related outcomes—domains traditionally excluded from performance‐based ST research. Drawing on 247 effects from 83 experimental studies (N = 11,683), we found a marginal to small overall effect when comparing ST conditions to control groups.
Yuri Marcel Dallabrida +2 more
wiley +1 more source

