Results 81 to 90 of about 2,156,357 (213)
Disagreement between human and AI evaluation of treatment plans. [PDF]
Sengupta D, Panda S.
europepmc +1 more source
Evaluating Medical Text Summaries Using Automatic Evaluation Metrics and LLM-as-a-Judge Approach: A Pilot Study. [PDF]
Vasilev Y +8 more
europepmc +1 more source
Evaluating LLMs' divergent thinking capabilities for scientific idea generation with minimal context. [PDF]
Ruan K +5 more
europepmc +1 more source
Plagiarism: bringing economics and education together (with a little help from IT) [PDF]
Judge, Guy
core
Spontaneous Expulsion of a Sebaceous Cyst: A Case Report of a Rare Surgical Outcome. [PDF]
Gill MS, Cheema P.
europepmc +1 more source
Automating expert-level medical reasoning evaluation of large language models. [PDF]
Zhou S +18 more
europepmc +1 more source
Equal Employment Opportunity Commission, Plaintiff, v. Wal-Mart Stores, Inc., Defendant. [PDF]
Reinhard, Judge
core +1 more source
A Suite of LMs Comprehend Puzzle Statements as Well or Better Than Humans. [PDF]
Rakshit S +3 more
europepmc +1 more source

