Human tests for machine models: What lies “Beyond the Imitation Game”?
Abstract Benchmarking large language models (LLMs) is a key practice for evaluating their capabilities and risks. This paper considers the development of “BIG Bench,” a crowdsourced benchmark designed to test LLMs “Beyond the Imitation Game.” Drawing on linguistic anthropological and ethnographic analysis of the project's GitHub repository, we examine ...
Noya Kohavi, Anna Weichselbraun
wiley +1 more source
Detecting Beta-amyloid Plaque via Low Rank Based Orthogonal Projection and Spatial-spectrum Detector Using High-resolution Quantitative Susceptibility Mapping for Preclinical Studies. [PDF]
Chen J +10 more
europepmc +1 more source
More Than Regulation: Challenging Habermas on the Future of the Public Sphere
Journal of Social Philosophy, EarlyView.
Bernardo Ferro
wiley +1 more source
Lability in Hittite and Indo‐European: A Diachronic Perspective
ABSTRACT Lability is defined as the possibility of a verb to enter a valency alternation without undergoing any change in its form. Labile verbs were common in ancient Indo‐European languages, including Hittite, which mostly features anticausative lability, with reflexive and reciprocal lability being less prominent.
Guglielmo Inglese
wiley +1 more source
Unveiling optimal mother wavelets by COPRAS Method Analyzing speech signals despite face mask and shield obstacles. [PDF]
Marxim Rahula Bharathi B +6 more
europepmc +1 more source
Equivariant toric geometry and Euler–Maclaurin formulae
Abstract We first investigate torus‐equivariant motivic characteristic classes of toric varieties, and then apply them via the equivariant Riemann–Roch formalism to prove very general Euler–Maclaurin‐type formulae for full‐dimensional simple lattice polytopes.
Sylvain E. Cappell +3 more
wiley +1 more source
Ultrasonic Localization of Transformer Patrol Robot Based on Wavelet Transform and Narrowband Beamforming. [PDF]
Ji H +5 more
europepmc +1 more source
Bayesian Inference for Spatially‐Temporally Misaligned Data Using Predictive Stacking
ABSTRACT Air pollution remains a major environmental risk factor that is often associated with adverse health outcomes. However, quantifying and evaluating its effects on human health is challenging due to the complex nature of exposure data. Recent technological advances have led to the collection of various indicators of air pollution at increasingly
Soumyakanti Pan, Sudipto Banerjee
wiley +1 more source
Carbon market price prediction in the Yangtze River Basin based on improved deep learning ensemble model with CEEMDAN and Attention-RNN. [PDF]
Lu Z, Cao Z, Xiang Z, Li J, Li M.
europepmc +1 more source
Forecasting Carbon Prices: A Literature Review
ABSTRACT Carbon emissions trading is utilized by a growing number of states as a significant tool for addressing greenhouse gas emissions (GHG), global warming problem and the climate crisis. Accurate forecasting of carbon prices is essential for effective policy design and investment strategies in climate change mitigation.
Konstantinos Bisiotis +2 more
wiley +1 more source

