The Challenge of Handling Structured Missingness in Integrated Data Sources
As data integration becomes ever more prevalent, a new research question that emerges is how to handle missing values that will inevitably arise in these large‐scale integrated databases? This missingness can be described as structured missingness, encompassing scenarios involving multivariate missingness mechanisms and deterministic, nonrandom ...
James Jackson +6 more
wiley +1 more source
Subset binding enables detection of multimodal patient subgroup patterns and drug target discovery in idiopathic pulmonary fibrosis. [PDF]
Natsume-Kitatani Y +32 more
europepmc +1 more source
This study introduces a tree‐based machine learning approach to accelerate USP8 inhibitor discovery. The best‐performing model identified 100 high‐confidence repurposable compounds, half already approved or in clinical trials, and uncovered novel scaffolds not previously studied. These findings offer a solid foundation for rapid experimental follow‐up,
Yik Kwong Ng +4 more
wiley +1 more source
Statistical inference and effect measures in abstracts of major HIV and AIDS journals, 1987-2022: A systematic review. [PDF]
Stang A +4 more
europepmc +1 more source
A Generalized Framework for Data‐Efficient and Extrapolative Materials Discovery for Gas Separation
This study introduces an iterative supervised machine learning framework for metal‐organic framework (MOF) discovery. The approach identifies over 97% of the best performing candidates while using less than 10% of available data. It generalizes across diverse MOF databases and gas separation scenarios.
Varad Daoo, Jayant K. Singh
wiley +1 more source
Identifying psychosis episodes in psychiatric admission notes via rule-based methods, machine learning, and pre-trained language models. [PDF]
Hua Y +5 more
europepmc +1 more source
Beyond Online Association Rules: Governing Semantic Drift in Numerical Stream Mining
This preprint proposes a governance-oriented reframing of online numerical association rule mining. While existing approaches have achieved substantial progress in efficiency, incremental updating, and metric optimization, they typically assume semantic stability and equate statistical persistence with epistemic validity.
openaire +1 more source
This paper presents a computer vision (deep learning) pipeline integrating YOLOv8 and YOLOv9 for automated detection, segmentation, and analysis of rosette cellulose synthase complexes in freeze‐fracture electron microscopy images. The study explores curated dataset expansion for model improvement and highlights pipeline accuracy, speed ...
Siri Mudunuri +6 more
wiley +1 more source
Data‐Guided Photocatalysis: Supervised Machine Learning in Water Splitting and CO2 Conversion
This review highlights recent advances in supervised machine learning (ML) for photocatalysis, emphasizing methods to optimize photocatalyst properties and design materials for solar‐driven water splitting and CO2 reduction. Key applications, challenges, and future directions are discussed, offering a practical framework for integrating ML into the ...
Paul Rossener Regonia +1 more
wiley +1 more source
Anti-epileptic medication induced disturbed calcium-vitamin D metabolism: A behavioral analysis using association rule mining technique. [PDF]
Dabla PK +7 more
europepmc +1 more source

