Identifying recurrent stone formers with machine learning: A single-centre observational study. [PDF]
Amado P +7 more
europepmc +1 more source
The Challenge of Handling Structured Missingness in Integrated Data Sources
As data integration becomes ever more prevalent, a new research question that emerges is how to handle missing values that will inevitably arise in these large‐scale integrated databases? This missingness can be described as structured missingness, encompassing scenarios involving multivariate missingness mechanisms and deterministic, nonrandom ...
James Jackson +6 more
wiley +1 more source
Assessing imputation techniques for missing data in small and multicollinear datasets: insights from craniofacial morphometry. [PDF]
Abdullah NA +3 more
europepmc +1 more source
This study integrates random matrix theory (RMT) and principal component analysis (PCA) to improve the identification of correlated regions in HIV protein sequences for vaccine design. PCA validation enhances the reliability of RMT‐derived correlations, particularly in small‐sample, high‐dimensional datasets, enabling more accurate detection of ...
Mariyam Siddiqah +3 more
wiley +1 more source
A reference panel for linkage disequilibrium and genotype imputation using whole-genome sequencing data from 2,680 participants across India. [PDF]
Li Z +13 more
europepmc +1 more source
An Autonomous Large Language Model‐Agent Framework for Transparent and Local Time Series Forecasting
Architecture of the proposed large language model (LLM)‐based agent framework for autonomous time series forecasting in thermal power generation systems. The framework operates through a vertical pipeline initiated by natural language queries from users, which are processed by the LLM Agent Core powered by Llama.cpp and a ReAct loop with persistent ...
William Gouvêa Buratto +5 more
wiley +1 more source
Multi-output learning for systematic missing value imputation in DNA methylation arrays. [PDF]
Ma T +5 more
europepmc +1 more source
AI‐Driven Cancer Multi‐Omics: A Review From the Data Pipeline Perspective
The exponential growth of cancer multi‐omics data brings opportunities and challenges for precision oncology. This review systematically examines AI's role in addressing these challenges, covering generative models, integration architectures, Explainable AI for clinical trust, clinical applications, and key directions for clinical translation.
Shilong Liu, Shunxiang Li, Kun Qian
wiley +1 more source
Benchmarking imputation strategies for missing time-series data in critical care using real-world-inspired scenarios. [PDF]
Poette M +5 more
europepmc +1 more source
This study provides an introduction to Bayesian optimisation targeted for experimentalists. It explains core concepts, surrogate modelling, and acquisition strategies, and addresses common real‐world challenges such as noise, constraints, mixed variables, scalability, and automation.
Chuan He +2 more
wiley +1 more source

