Benchmarking Large Language Models Against Psychiatry Residents Using Traditional Institutional Assessments. [PDF]
Sethi MIS +7 more
europepmc +1 more source
Enhancing large language model clinical support information with machine learning risk and explainability: a feasibility study. [PDF]
Yeh YC +5 more
europepmc +1 more source
Zero-shot performance of selected large language and multimodal models on the 2023 Brazilian Portuguese medical residency exam. [PDF]
Truyts CAM +9 more
europepmc +1 more source
Evaluating cognitive depth of AI-generated multiple-choice questions with Bloom's Taxonomy. [PDF]
Nguyen TT +4 more
europepmc +1 more source
PsychiatryBench: a multi-task benchmark for LLMs in psychiatry. [PDF]
Fouda AE +3 more
europepmc +1 more source
ChatGPT, Gemini, and Claude in clinical and dermoscopic image analysis of basal cell carcinoma and its common mimickers: A comparative performance analysis. [PDF]
Boostani M +13 more
europepmc +1 more source
Impact of Large Language Model Assistance on Radiologists' Diagnostic Performance for Brain Tumors by Experience Level. [PDF]
Song CW +8 more
europepmc +1 more source
Folgore da San Gimignano, Cenne de la Chitarra d’Arezzo, Couronnes et autres sonnets
Hélène Basso
doaj +1 more source
Evaluation of validity, reliability, and readability of AI chatbots for gestational diabetes mellitus: a multi-model comparative study. [PDF]
Wang X +5 more
europepmc +1 more source
A comparative accuracy study of multimodal LLMs, VLM and agent-based framework for pulmonary nodule detection on chest radiographs. [PDF]
Khovanova D +5 more
europepmc +1 more source

