Results 171 to 180 of about 237,519 (286)

BMPCQA: Bioinspired Metaverse Point Cloud Quality Assessment Based on Large Multimodal Models

open access: yesAdvanced Intelligent Systems, EarlyView.
This study presents a bioinspired metaverse point cloud quality assessment metric, which simulates the human visual evaluation process to perform the point cloud quality assessment task. It first extracts rendering projection video features, normal image features, and point cloud patch features, which are then fed into a large multimodal model to ...
Huiyu Duan   +7 more
wiley   +1 more source

Multitarget Recognition of Flower Images Based on Lightweight Deep Neural Network and Transfer Learning

open access: yesAdvanced Intelligent Systems, EarlyView.
This article proposes a lightweight YOLOv4‐based detection model using MobileNetV3 or CSPDarknet53_tiny, achieving 30+ FPS and higher mAP. It also presents a ShuffleNet‐based classification model with transfer learning and GAN‐augmented images, improving generalization and accuracy.
Qingyang Liu, Yanrong Hu, Hongjiu Liu
wiley   +1 more source

Fine-grained multiclass nuclei segmentation with molecular empowered all-in-SAM model. [PDF]

open access: yesJ Med Imaging (Bellingham)
Li X   +9 more
europepmc   +1 more source

Hierarchical Language Models for Semantic Navigation and Manipulation in an Aerial‐Ground Robotic System

open access: yesAdvanced Intelligent Systems, EarlyView.
A hierarchical multimodal framework coupling a large language model for task decomposition and semantic mapping with a fine‐tuned vision‐language model for semantic perception, enhanced by GridMask, is presented. An aerial‐ground robot team exploits the semantic map for global and local planning.
Haokun Liu   +6 more
wiley   +1 more source

Robust Dysarthric Speech Recognition with GAN Enhancement and LLM Correction

open access: yesAdvanced Intelligent Systems, EarlyView.
This study tackles dysarthric speech recognition by combining generative adversarial network (GAN)‐generated synthetic data with large language model (LLM)‐based error correction. The approach integrates three key elements: an improved CycleGAN to generate synthetic dysarthric speech for data augmentation, a multimodal automatic speech recognition core
Yibo He   +3 more
wiley   +1 more source

Speech and Language Disorders Associated With 7q31 Deletions Implicating FOXP2

open access: yesAmerican Journal of Medical Genetics Part A, EarlyView.
ABSTRACT Some 7q31 deletions encompass FOXP2, a gene long associated with speech and language disorders. Intragenic pathogenic FOXP2 variants cause FOXP2‐related speech and language disorder, which has been well characterized in the literature. Conversely, the phenotype associated with 7q31 deletions is neglected.
Lottie D. Morison   +3 more
wiley   +1 more source

Semantic classification of Indonesian consumer health questions. [PDF]

open access: yesJ Biomed Semantics
Hanami RN, Mahendra R, Wicaksono AF.
europepmc   +1 more source

Home - About - Disclaimer - Privacy