Reliability of Gemini 2.5 Pro, ChatGPT 4.1, DeepSeek V3, and Claude Opus 4 in generating standardized CMR protocols. [PDF]
Licu RA +11 more
europepmc +1 more source
Divergent creativity in humans and large language models. [PDF]
Bellemare-Pepin A +7 more
europepmc +1 more source
Abstract LLM detectors aim at detecting text generated by an LLM. They can be categorized into two main types: specific detectors and general detectors. Specific detectors target a particular type of language or context, such as hate speech or spam. In contrast, general detectors aim to identify a broad range of problematic languages, such as
openaire +1 more source
Open- and closed-source LLMs in medical and engineering education. [PDF]
Sun L +9 more
europepmc +1 more source
Can Artificial Intelligence Models Appropriately Recommend Knee Arthroplasty Surgeons? [PDF]
Emrich CM +5 more
europepmc +1 more source
Bayesian teaching enables probabilistic reasoning in large language models. [PDF]
Qiu L +5 more
europepmc +1 more source
Uptake of Large Language Models by London Medical Students: Exploratory Qualitative Interview Study. [PDF]
Alazzawi M, Lam K.
europepmc +1 more source
Este proyecto se centra en el desarrollo de un sistema diseñado para evaluar y comparar la eficiencia de LLMs (modelos de lenguaje de gran tamaño), como GPT o Cohere los cuales son los utilizados en este proyecto. El objetivo principal fue crear una herramienta que permita la interacción con un LLM en prueba y utilizar un LLM de referencia para evaluar
Ramos González, Gonzalo +2 more
openaire +1 more source
Human and large language model judgments of cognitive impairment from language: An explainable artificial intelligence approach. [PDF]
Zadok M +5 more
europepmc +1 more source
Evaluating reasoning large language models with human-like thinking in ophthalmic question answering. [PDF]
Wang Z +9 more
europepmc +1 more source

