Moving LLM evaluation forward: lessons from human judgment research. [PDF]
Polonioli A.
europepmc +1 more source
Evaluating Large Language Models for Burning Mouth Syndrome Diagnosis. [PDF]
Suga T, Uehara O, Abiko Y, Toyofuku A.
europepmc +1 more source
Semantic embeddings reveal and address taxonomic incommensurability in psychological measurement. [PDF]
Wulff DU, Mata R.
europepmc +1 more source
Comparative evaluation of OpenAI O1 and human performance in higher order cognition. [PDF]
Latif E +8 more
europepmc +1 more source
The effect of generative artificial intelligence literacy on academic achievement: the mediating role of academic self-efficacy and the moderating role of critical thinking. [PDF]
Wang Y.
europepmc +1 more source
An experimental study of classical truth logic on multi-propositions consistent and incompatible: Dual-process theories and modal syllogistic of deduction. [PDF]
Waheed S, Waheed A, Habib S.
europepmc +1 more source
Tracking vaccine effectiveness in an evolving pandemic, countering misleading hot takes and epidemiologic fallacies. [PDF]
Morris JS.
europepmc +1 more source
Affective Reactions When Learning That Our Answer Is Biased: The Role of Negative Feedback in the Arousal of Epistemic Emotions. [PDF]
Nerantzaki K +2 more
europepmc +1 more source
Ensuring scientific rigour: essential recommendations on how to identify prior evidence in medicine-author's response. [PDF]
GuimarĂ£es JSF, Leite MF, Oliveira AG.
europepmc +1 more source

