Results 61 to 70 of about 81,815 (304)
Job Search in Thick Markets: Evidence from Italy [PDF]
I analyze empirically the effects of both urban and industrial agglomeration on menÂ’s and womenÂ’s search behavior and on the efficiency of matching. The analysis is based on a unique panel data set from the Italian Labor Force Survey micro-data, which ...
Sabrina Di Addario
core +3 more sources
Aim: Accurate diagnosis in emergency departments relies heavily on clinical decision-making, yet cognitive errors contribute to a significant proportion of diagnostic mistakes.
Banu Arslan +4 more
doaj +1 more source
LLMs Judging LLMs: A Simplex Perspective
Given the challenge of automatically evaluating free-form outputs from large language models (LLMs), an increasingly common solution is to use LLMs themselves as the judging mechanism, without any gold-standard scores. Implicitly, this practice accounts for only sampling variability (aleatoric uncertainty) and ignores uncertainty about judge quality ...
Vossler, Patrick +4 more
openaire +2 more sources
Evaluating the Utilities of Foundation Models in Single‐Cell Data Analysis
This study delivers the first systematic, task‐level evaluation of single‐cell foundation models across eight core analytical tasks. By benchmarking 10 leading models with the scEval framework, it reveals where foundation models truly add value, where task‐specific methods still dominate, and provides concrete, reproducible guidelines to steer the next
Tianyu Liu +4 more
wiley +1 more source
Foregrounding doctoral knowledge and knower in the age of Generative Artificial Intelligence
While Generative Artificial Intelligence (AI) presents new challenges for doctoral education, it also offers an opportunity to refocus doctoral programmes on their fundamental purposes: contributing to knowledge and developing critical researchers.
Sioux McKenna
doaj +1 more source
LLM-SQL-Solver: Can LLMs Determine SQL Equivalence?
Judging the equivalence between two SQL queries is a fundamental problem with many practical applications in data management and SQL generation (i.e., evaluating the quality of generated SQL queries in text-to-SQL task). While the research community has reasoned about SQL equivalence for decades, it poses considerable difficulties and no complete ...
Zhao, Fuheng +5 more
openaire +2 more sources
This study generates high‐fidelity synthetic longitudinal records for a million‐patient diabetes cohort, successfully replicating clinical predictive performance. However, deeper analysis reveals algorithmic biases and trajectory inconsistencies that escape standard quality metrics. These findings challenge current validation norms, demonstrating why a
Francisco Ortuño +5 more
wiley +1 more source
LLM-AutoDiff: Auto-Differentiate Any LLM Workflow
Large Language Models (LLMs) have reshaped natural language processing, powering applications from multi-hop retrieval and question answering to autonomous agent workflows. Yet, prompt engineering -- the task of crafting textual inputs to effectively direct LLMs -- remains difficult and labor-intensive, particularly for complex pipelines that combine ...
Yin, Li, Wang, Zhangyang
openaire +2 more sources
Causal Prediction of TP53 Variant Pathogenicity Using a Perturbation‐Informed Protein Language Model
A TP53‐specific predictor, CaVepP53, is developed by fine‐tuning ESMC on experimentally validated variants, quantifying pathogenicity via Euclidean distances. It outperforms general‐purpose models and extends to five cancer genes, enabling interpretable variant classification for precision medicine.
Huiying Chen +15 more
wiley +1 more source
Digital Friends and Empathy Blindness
Can chatbot-based virtual relationships replace physical ones? One possible bottleneck is lack of empathy in chatbots, as well as the attraction of physical relationships.
Bangsgaard Alberte Romme +3 more
doaj +1 more source

