Results 161 to 170 of about 518,705 (322)
Robust Dysarthric Speech Recognition with GAN Enhancement and LLM Correction
This study tackles dysarthric speech recognition by combining generative adversarial network (GAN)‐generated synthetic data with large language model (LLM)‐based error correction. The approach integrates three key elements: an improved CycleGAN to generate synthetic dysarthric speech for data augmentation, a multimodal automatic speech recognition core
Yibo He +3 more
wiley +1 more source
VISTA: A Visual Analytics Framework to Enhance Foundation Model-Generated Data Labels. [PDF]
Xuan X +6 more
europepmc +1 more source
Feature Disentangling and Combination Implemented by Spin–Orbit Torque Magnetic Tunnel Junctions
Spin–orbit torque magnetic tunnel junctions (SOT‐MTJs) enable efficient feature disentangling and integration in image data. A proposed algorithm leverages SOT‐MTJs as true random number generators to disentangle and recombine features in real time, with experimental validation on emoji and facial datasets.
Xiaohan Li +15 more
wiley +1 more source
Research on group type theory and its functorial semantic models in category logic. [PDF]
Tang JG, Aishan Y, Liu JY, Peng JY.
europepmc +1 more source
: In this work, Voxel‐SLAM (simultaneous localization and mapping) is introduced: a complete, accurate, and versatile LiDAR (light detection and ranging) ‐inertial SLAM system consisting of five modules: initialization, odometry, local mapping (LM), loop closure (LC), and global mapping (GM), all employing the same map representation, an adaptive voxel
Zheng Liu +9 more
wiley +1 more source
Enhancing trust in news media: A multimodality approach to detecting fake news with social constructs. [PDF]
Ali I +4 more
europepmc +1 more source
Feature from recent image foundation models (DINOv2) are useful for vision tasks (segmentation, object localization) with little or no human input. Once upsampled, they can be used for weakly supervised micrograph segmentation, achieving strong results when compared to classical features (blurs, edge detection) across a range of material systems.
Ronan Docherty +2 more
wiley +1 more source
Leveraging large language models and embedding representations for enhanced word similarity computation. [PDF]
Peng X, Jiang H, Chen J, Liu M, Chen X.
europepmc +1 more source
This paper presents an integrated AI‐driven cardiovascular platform unifying multimodal data, predictive analytics, and real‐time monitoring. It demonstrates how artificial intelligence—from deep learning to federated learning—enables early diagnosis, precision treatment, and personalized rehabilitation across the full disease lifecycle, promoting a ...
Mowei Kong +4 more
wiley +1 more source

