This study reveals that sampling strategy (i.e., sampling size and approach) is a foundational prerequisite for building accurate and generalizable AI models in peptide discovery. Reaching a threshold of 7.5% of the total tetrapeptide sequence space was essential to ensure reliable predictions.
Meiru Yan +3 more
wiley +1 more source
Combining psychoanalytic concepts and computer science methodologies: an empirical study of the relationship between emotions and the Lacanian discourses. [PDF]
Gadalla M, Nikoletseas S, Amazonas JRA.
europepmc +1 more source
LLM‐Based Scientific Assistants for Knowledge Extraction: Which Design Choices Matter?
A comprehensive framework for optimizing Large Language Models in domain‐specific applications is introduced. The LLM Playground integrates Prompt Engineering, knowledge augmentation, and advanced reasoning strategies to enable systematic comparison of architectures and base models.
David Exler +7 more
wiley +1 more source
Sex-Specific Cardiometabolic Phenotypes of Metabolic Syndrome Identified by Latent Class Analysis in Indian Adults. [PDF]
Sheth ND, Shaker IA, Ranade J.
europepmc +1 more source
Predictive models successfully screen nanoparticles for toxicity and cellular uptake. Yet, complex biological dynamics and sparse, nonstandardized data limit their accuracy. The field urgently needs integrated artificial intelligence/machine learning, systems biology, and open‐access data protocols to bridge the gap between materials science and safe ...
Mariya L. Ivanova +4 more
wiley +1 more source
An improved differential evolution algorithm based on reinforcement learning and its application. [PDF]
Yang G, Sun P, Zhang J, Zhang Y, Li T.
europepmc +1 more source
Majority‐Voting Overlapping Method for Error Correction in DNA Data Storage
We propose an overlapping‐based majority‐voting method for DNA data storage error correction. By aligning multiple reads and choosing the most frequent base per position, it suppresses substitution errors without prior models. Validated on synthetic and real sequencing data, it achieves high‐fidelity, scalable, and cost‐effective reconstruction ...
Thi Bich Ngoc Nguyen +5 more
wiley +1 more source
Risk factors analysis and nomogram model construction of refeeding syndrome after esophageal cancer surgery. [PDF]
Shi Y +8 more
europepmc +1 more source
Composition‐Aware Cross‐Sectional Integration for Spatial Transcriptomics
Multi‐section spatial transcriptomics demands coherent cell‐type deconvolution, domain detection, and batch correction, yet existing pipelines treat these tasks separately. FUSION unifies them within a composition‐aware latent framework, modeling reads as cell‐type–specific topics and clustering in embedding space.
Qishi Dong +5 more
wiley +1 more source
Genetic genealogy of the Piast dynasty and related European royal families. [PDF]
Zenczak M +16 more
europepmc +1 more source

