Results 221 to 230 of about 176,171 (277)
The search criteria for three systematic reviews were replicated using ChatGPTv3.5 and Bard. Outputs included author, title, publication year, and journal and were cross‐referenced with medical databases and compared to the original reviews. Large language models failed to replicate peer‐reviewed methodologies.
Ajibola B. Bakare +4 more
wiley +1 more source
Leveraging Large Language Models to Improve the Readability of German Online Medical Texts: Evaluation Study. [PDF]
Miftaroski A +3 more
europepmc +1 more source
A Survey on Medical Competence Evaluation Benchmarks for Large Language Models
Our study presents a comprehensive review of the established methodologies and benchmarks for evaluating the medical competence of LLMs, encompassing a thorough analysis of current assessment practices across medical knowledge, clinical practice competence, and ethical–safety considerations.
Qiting Wang +5 more
wiley +1 more source
In a nationwide survey of 1247 Chinese dermatologists, ChatGPT‐4o was the most preferred large language model (LLM) for psoriasis‐related clinical tasks. Accuracy emerged as the top evaluation criterion for LLM‐generated responses. All icons were made by Freepik and Maan Icons from https://www.flaticon.com/ ABSTRACT Background Large language models ...
Jungang Yang +8 more
wiley +1 more source
Leveraging generative AI to enhance Synthea model development. [PDF]
Kramer MA +3 more
europepmc +1 more source
Abstract To determine the prevalence of postpartum depression (PPD) and postpartum stress (PPS) and identify associated risk factors among mothers of preterm and low birth weight (LBW) infants. We conducted a secondary analysis of data collected from 255 mothers with preterm and LBW infants admitted to the neonatal intensive care unit (NICU) at Korle ...
John Pellegrino +6 more
wiley +1 more source
Automated information extraction from plant specimen labels using OCR and large language models. [PDF]
Wen J.
europepmc +1 more source
Abstract Introduction To evaluate the accuracy and completeness of responses across common obstetrical and gynecologic topics generated by the large language models (LLMs) ChatGPT and Google Gemini, which have become increasingly popular for patients seeking medical information before physician consultations.
Madeline West +6 more
wiley +1 more source

