Multimodal text - Open Access .click

Results 231 to 240 of about 96,015 (310)

Human‐in‐the‐Loop Object Segmentation for 3D Gaussian Splatting via Finger‐based VR Interface

Advanced Intelligent Systems, EarlyView.
This study introduces a human‐in‐the‐loop segmentation framework for 3D Gaussian Splatting that integrates real‐time optimization with intuitive VR‐based finger prompting. Compared with existing automatic, learning‐based methods, it achieves significantly higher accuracy and reduced segmentation time.
Yongseok Lee +5 more
wiley +1 more source

Graph attention network-based multimodal approach for lung diseases classification. [PDF]

Sci Rep
Rahman M, YongZhong C, Bin L.
europepmc +1 more source

An Attention‐Assisted Machine Learning System for Deep Microorganism Image Classification

Advanced Intelligent Systems, EarlyView.
An attention‐assisted DenseNet201 framework was developed for the classification of eight microorganism classes from microscopic images. The proposed model improved classification performance and achieved an accuracy of 87.38%. Advances in microbiology and environmental health fundamentally depend on precise and timely microorganism identification ...
Yujie Li +6 more
wiley +1 more source

The Power of Multimodality in Multimodal Large Language Models, Unimodal ChatGPT 5.0, and Human Clinical Experts on a Wound Care Certification Examination: Cross-Sectional Comparative Study. [PDF]

JMIR Form Res
Ucdal M +5 more
europepmc +1 more source

Lidar‐Based Object Tracking of Traffic Participants with Sensor Nodes in Existing Urban Infrastructure

Advanced Intelligent Systems, EarlyView.
This paper presents a lidar‐based sensor node design and a rule‐based state observer for edge‐based traffic participant tracking. Unlike other state‐of‐the‐art methods, this state observer enables real‐time, CPU‐only edge processing without relying on machine learning approaches.
Simon Schäfer, Bassam Alrifaee, Ehsan Hashemi +2 more
wiley +1 more source

Artificial Intelligence in Medical Assessment: Reliability and Performance of Multimodal Large Language Models in a High-Stakes Licensing Examination. [PDF]

Behav Sci (Basel)
Güler I +6 more
europepmc +1 more source

NIRGB‐GS: Near‐Infrared Assisted Low‐Light Scene Reconstruction and Enhancement via Gaussian Splatting

Advanced Intelligent Systems, EarlyView.
This article proposes NIRGB‐GS, a multimodal 3DGS variant that enables reliable 3D reconstruction and normal‐light novel‐view synthesis for extremely low‐light scenes by fusing paired near‐infrared and noisy RGB captures. High‐SNR near‐infrared modality and modality‐specific appearance encoding together resolve the issues of unstable pose/geometry ...
Chengyun Yang, Yi Zhang, Yu Lei, Qiaofeng Li +3 more
wiley +1 more source

Diagnostic Accuracy of GPT-4 With Vision in Neuroradiology Board-Style Exam Questions: Cross-Sectional Case-Based Study. [PDF]

JMIR Neurotechnol
Sussan TT +4 more
europepmc +1 more source

Retinal Vessel Segmentation: A Comprehensive Review From Classical Methods to Deep Learning Advances (1982–2025)

Advanced Intelligent Systems, EarlyView.
Four decades of retinal vessel segmentation research (1982–2025) are synthesized, spanning classical image processing, machine learning, and deep learning paradigms. A meta‐analysis of 428 studies establishes a unified taxonomy and highlights performance trends, generalization capabilities, and clinical relevance.
Avinash Bansal +6 more
wiley +1 more source

Predictive analysis of student engagement in university physical education courses based on a multimodal transformer algorithm. [PDF]

Sci Rep
Li J.
europepmc +1 more source

4. education
fos: computer and information sciences
computer vision and pattern recognition cs.cv

multimodality
10. no inequality
literacy

attention mechanism
artificial intelligence cs.ai
computation and language cs.cl