A Primer of Data Cleaning in Quantitative Research: Handling Missing Values and Outliers. [PDF]
Sharifnia AM +3 more
europepmc +1 more source
Multimodal Data‐Driven Microstructure Characterization
A self‐consistent autonomous workflow for EBSP‐based microstructure segmentation by integrating PCA, GMM clustering, and cNMF with information‐theoretic parameter selection, requiring no user input. An optimal ROI size related to characteristic grain size is identified.
Qi Zhang +4 more
wiley +1 more source
The effects of mismatched train and test data cleaning pipelines on regression models: lessons for practice. [PDF]
Nevin J, Lees M, Groth P.
europepmc +1 more source
Is it time to stop sweeping data cleaning under the carpet? A novel algorithm for outlier management in growth data. [PDF]
Woolley CSC +4 more
europepmc +1 more source
Tailoring Functional Properties of Ti–Ni–Cu Shape Memory Alloy Thin Films for MEMS Actuators
A comprehensive study of critical parameters required to develop well‐performing Ti–Ni–Cu thin film shape memory alloy microactuators is provided. Materials science and device integration aspects are integrated by addressing structural and physical relationships using complementary characterization techniques as well as a practical fabrication solution
Elaheh Akbarnejad +6 more
wiley +1 more source
Reliability-enhanced data cleaning in biomedical machine learning using inductive conformal prediction. [PDF]
Zhan X, Xu Q, Zheng Y, Lu G, Gevaert O.
europepmc +1 more source
PASTA‐ELN: Simplifying Research Data Management for Experimental Materials Science
Research data management faces ongoing hurdles as many ELNs remain complex and restrictive. PASTA‐ELN offers an open‐source, cross‐platform solution that prioritizes simplicity, offline access, and user control. Its in tuitive folder structure, modular Python add‐ons, and open formats enable seamless documentation, FAIR data practices, and easy ...
S. Brinckmann, G. Winkens, R. Schwaiger
wiley +1 more source
Data cleaning and enrichment through data integration: networking the Italian academia. [PDF]
Finocchi I +3 more
europepmc +1 more source
Automated data cleaning of paediatric anthropometric data from longitudinal electronic health records: protocol and application to a large patient cohort. [PDF]
Phan HTT +5 more
europepmc +1 more source

