Performance of large language models on sleep medicine certification examination: a comprehensive multi-model analysis. [PDF]
Koç A +3 more
europepmc +1 more source
Multi-center benchmarking of large language models for clinical decision support in lung cancer screening. [PDF]
Duan Z +14 more
europepmc +1 more source
Comparative Evaluation of Responses from ChatGPT-5, Gemini 2.5 Flash, Grok 4, and Claude Sonnet-4 Chatbots to Questions About Endodontic Iatrogenic Events. [PDF]
Taşyürek M, Adıgüzel Ö, Ortaç H.
europepmc +1 more source
Four decades of ADHD: a systematic AI-assisted analysis of conceptual shifts across six DSM editions. [PDF]
Ophir Y, Shir-Raz Y, Tikochinski R.
europepmc +1 more source
Challenges and Limitations of Multimodal Large Language Models in Interpreting Pediatric Panoramic Radiographs. [PDF]
Mine Y +9 more
europepmc +1 more source
Benchmarking Large Language Models for Drug Combination Alerts: Achieving Expert-Level Reliability via Knowledge Grounding and Contextual Reasoning. [PDF]
Hu H +6 more
europepmc +1 more source
Diagnostic accuracy of generative large language artificial intelligence models for the assessment of dental crowding. [PDF]
Wafaie K +5 more
europepmc +1 more source
Comparative evaluation of viral hepatitis question responses: ChatGPT-4.5 outperforms three established models. [PDF]
Ma J +8 more
europepmc +1 more source

