Human‐in‐the‐Loop Object Segmentation for 3D Gaussian Splatting via Finger‐based VR Interface
This study introduces a human‐in‐the‐loop segmentation framework for 3D Gaussian Splatting that integrates real‐time optimization with intuitive VR‐based finger prompting. Compared with existing automatic, learning‐based methods, it achieves significantly higher accuracy and reduced segmentation time.
Yongseok Lee +5 more
wiley +1 more source
Graph attention network-based multimodal approach for lung diseases classification. [PDF]
Rahman M, YongZhong C, Bin L.
europepmc +1 more source
An Attention‐Assisted Machine Learning System for Deep Microorganism Image Classification
An attention‐assisted DenseNet201 framework was developed for the classification of eight microorganism classes from microscopic images. The proposed model improved classification performance and achieved an accuracy of 87.38%. Advances in microbiology and environmental health fundamentally depend on precise and timely microorganism identification ...
Yujie Li +6 more
wiley +1 more source
The Power of Multimodality in Multimodal Large Language Models, Unimodal ChatGPT 5.0, and Human Clinical Experts on a Wound Care Certification Examination: Cross-Sectional Comparative Study. [PDF]
Ucdal M +5 more
europepmc +1 more source
This paper presents a lidar‐based sensor node design and a rule‐based state observer for edge‐based traffic participant tracking. Unlike other state‐of‐the‐art methods, this state observer enables real‐time, CPU‐only edge processing without relying on machine learning approaches.
Simon Schäfer +2 more
wiley +1 more source
Artificial Intelligence in Medical Assessment: Reliability and Performance of Multimodal Large Language Models in a High-Stakes Licensing Examination. [PDF]
Güler I +6 more
europepmc +1 more source
This article proposes NIRGB‐GS, a multimodal 3DGS variant that enables reliable 3D reconstruction and normal‐light novel‐view synthesis for extremely low‐light scenes by fusing paired near‐infrared and noisy RGB captures. High‐SNR near‐infrared modality and modality‐specific appearance encoding together resolve the issues of unstable pose/geometry ...
Chengyun Yang +3 more
wiley +1 more source
Diagnostic Accuracy of GPT-4 With Vision in Neuroradiology Board-Style Exam Questions: Cross-Sectional Case-Based Study. [PDF]
Sussan TT +4 more
europepmc +1 more source
Four decades of retinal vessel segmentation research (1982–2025) are synthesized, spanning classical image processing, machine learning, and deep learning paradigms. A meta‐analysis of 428 studies establishes a unified taxonomy and highlights performance trends, generalization capabilities, and clinical relevance.
Avinash Bansal +6 more
wiley +1 more source
Predictive analysis of student engagement in university physical education courses based on a multimodal transformer algorithm. [PDF]
Li J.
europepmc +1 more source

