Results 141 to 150 of about 226,897 (274)
A hierarchical multimodal framework coupling a large language model for task decomposition and semantic mapping with a fine‐tuned vision‐language model for semantic perception, enhanced by GridMask, is presented. An aerial‐ground robot team exploits the semantic map for global and local planning.
Haokun Liu +6 more
wiley +1 more source
This paper presents a degeneracy‐aware light detection and ranging (LiDAR)‐inertial framework that enhances LiDAR simultaneous localization and mapping performance in challenging environments. The proposed system integrates a dual‐layer robust odometry frontend with a Scan‐Context‐based loop‐closure detection backend.
Haoming Yang +4 more
wiley +1 more source
Musculoskeletal humanoids exhibit rich biomechanical properties that remain insufficiently unified in prior discussions. This article systematically categorizes muscle characteristics into five properties: redundancy, independency, anisotropy, variable moment arm, and nonlinear elasticity, and analyzes their combined effects on control.
Kento Kawaharazuka +2 more
wiley +1 more source
FTGRN introduces an LLM‐enhanced framework for gene regulatory network inference through a two‐stage workflow. It combines a Transformer‐based model, pretrained on GPT‐4 derived gene embeddings and regulatory knowledge, with a fine‐tuning stage utilizing single‐cell RNA‐seq data.
Guangzheng Weng +7 more
wiley +1 more source
Adaptive multi‐indicator contrastive predictive coding is introduced as a self‐supervised pretraining framework for multivariate EHR time series. An adaptive sliding‐window algorithm and 2D convolutional neural network encoder capture localized temporal patterns and global indicator dependencies, enabling label‐efficient disease prediction that ...
Hongxu Yuan +3 more
wiley +1 more source
Robust Dysarthric Speech Recognition with GAN Enhancement and LLM Correction
This study tackles dysarthric speech recognition by combining generative adversarial network (GAN)‐generated synthetic data with large language model (LLM)‐based error correction. The approach integrates three key elements: an improved CycleGAN to generate synthetic dysarthric speech for data augmentation, a multimodal automatic speech recognition core
Yibo He +3 more
wiley +1 more source
Research on deep learning architecture optimization method for intelligent scheduling of structural space. [PDF]
Ying W, Hui L.
europepmc +1 more source
Feature Disentangling and Combination Implemented by Spin–Orbit Torque Magnetic Tunnel Junctions
Spin–orbit torque magnetic tunnel junctions (SOT‐MTJs) enable efficient feature disentangling and integration in image data. A proposed algorithm leverages SOT‐MTJs as true random number generators to disentangle and recombine features in real time, with experimental validation on emoji and facial datasets.
Xiaohan Li +15 more
wiley +1 more source
From Speech Semantics to Brain Activity-Timescales Are Key in Their Information Transfer. [PDF]
Kumar S +5 more
europepmc +1 more source

