Results 171 to 180 of about 2,342,816 (359)
Benchmarking LLMs' Judgments with No Gold Standard [PDF]
We introduce the GEM (Generative Estimator for Mutual Information), an evaluation metric for assessing language generation by Large Language Models (LLMs), particularly in generating informative judgments, without the need for a gold standard reference.
arxiv
Biopsy is the gold standard of diagnosis of celiac sprue [PDF]
Joseph A. Murray, Peter H.R. Green
openalex +1 more source
Assessing the Ecological Value: Monetizing Process Innovations in Tailored Forming
This article introduces a method for evaluating the sustainability of innovations, even with limited data. The method is illustrated through an analysis of the “Tailored Forming” technology, which explores the impact of sustainability on economic value added.
Jonas Schneider+4 more
wiley +1 more source
Conversational Gold: Evaluating Personalized Conversational Search System using Gold Nuggets [PDF]
The rise of personalized conversational search systems has been driven by advancements in Large Language Models (LLMs), enabling these systems to retrieve and generate answers for complex information needs. However, the automatic evaluation of responses generated by Retrieval Augmented Generation (RAG) systems remains an understudied challenge. In this
arxiv
Breast ultrasound ‐ the ‘gold standard’ and other problems [PDF]
B.-Joachim Hackelöer
openalex +1 more source
The Gold Standard and the Great Depression [PDF]
This paper, written primarily for historians, attempts to explain why political leaders and central bankers continued to adhere to the gold standard as the Great Depression intensified.
Barry Eichengreen, Peter Temin
core
Mechanisms of De‐icing by Surface Rayleigh and Plate Lamb Acoustic Waves
Ice accretion impacts daily life, renewable energy generation, maintenance, and security in industries and aeronautics. Acoustic waves (AW) are a promising method for ice removal, although de‐icing mechanisms require further investigation to optimize energy efficiency.
Shilpi Pandey+15 more
wiley +1 more source
Liver biopsy in the long-term follow-up of liver transplant patients: Still the gold standard
C A Riely, Santiago Vera
openalex +1 more source
Is the Crohnʼs Disease Activity Index Outdated? Yes. Is It the Gold Standard that Clinicians Should Use? No. [PDF]
Burton I. Korelitz
openalex +1 more source