Results 1 to 10 of about 32,714,285 (374)
Some of the next articles are maybe not open access.
A Survey on Evaluation of Large Language Models [PDF]
Large language models (LLMs) are gaining increasing popularity in both academia and industry, owing to their unprecedented performance in various applications. As LLMs continue to play a vital role in both research and daily use, their evaluation becomes
Yu-Chu Chang +15 more
semanticscholar +1 more source
MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models [PDF]
Multimodal Large Language Model (MLLM) relies on the powerful LLM to perform multimodal tasks, showing amazing emergent abilities in recent studies, such as writing poems based on an image. However, it is difficult for these case studies to fully reflect
Chaoyou Fu +12 more
semanticscholar +1 more source
CLIPScore: A Reference-free Evaluation Metric for Image Captioning [PDF]
Image captioning has conventionally relied on reference-based automatic evaluations, where machine captions are compared against captions written by humans. This is in contrast to the reference-free manner in which humans assess caption quality.
Jack Hessel +4 more
semanticscholar +1 more source
Holistic Evaluation of Language Models
Language models (LMs) like GPT‐3, PaLM, and ChatGPT are the foundation for almost all major language technologies, but their capabilities, limitations, and risks are not well understood. We present Holistic Evaluation of Language Models (HELM) to improve
Percy Liang +49 more
semanticscholar +1 more source
A research agenda for malaria eradication: monitoring, evaluation, and surveillance. [PDF]
Monitoring, evaluation, and surveillance measure how well public health programs operate over time and achieve their goals. As countries approach malaria elimination, these activities will need to shift from measuring reductions in morbidity and ...
malERA Consultative Group on Monitoring, Evaluation, and Surveillance
doaj +1 more source
Risk of herpes zoster following mRNA COVID-19 vaccine administration
Background Adverse events following mRNA COVID-19 vaccines, including herpes zoster (HZ), have been reported. We conducted a cohort study to evaluate the association between mRNA COVID-19 vaccination and subsequent HZ at Kaiser Permanente Southern ...
Ana Florea +7 more
doaj +1 more source
An evaluation of course evaluations [PDF]
Abstract Student ratings of teaching have been used, studied, and debated for almost a century. This article examines student ratings of teaching from a statistical perspective. The common practice of relying on averages of student teaching evaluation scores as the primary measure of teaching effectiveness for promotion and tenure decisions should be ...
Philip Stark, Richard Freishtat
openaire +2 more sources

