Results 111 to 120 of about 119,156 (290)

Multimodal Human–Robot Interaction Using Human Pose Estimation and Local Large Language Models

open access: yesAdvanced Robotics Research, EarlyView.
A multimodal human–robot interaction framework integrates human pose estimation (HPE) and a large language model (LLM) for gesture‐ and voice‐based robot control. Speech‐to‐text (STT) enables voice command interpretation, while a safety‐aware arbitration mechanism prioritizes gesture input for rapid intervention.
Nasiru Aboki   +2 more
wiley   +1 more source

Pallet recognition and localization using an RGB-D camera

open access: yesInternational Journal of Advanced Robotic Systems, 2017
This article reports our research results on an autonomous forklift, with the focus on pallet recognition and localization using an RGB-D camera. It is a fundamental issue for unmanned storehouses, which enables the forklift to insert the forks within ...
Junhao Xiao   +3 more
doaj   +1 more source

Miniaturized Magnetic Tip Design for Endoluminal Vine Robot Navigation

open access: yesAdvanced Robotics Research, EarlyView.
A magnetic tip mount is designed for a miniaturized 7 mm soft‐growing vine robot to enable wireless magnetic steering and onboard imaging, while preserving a 3 mm working channel. The internal–external ring magnets design balances magnetic attachment with low eversion pressure. Experiments demonstrate ±90° steering, 34 mm bending radius, and successful
Andrea Yanez Trujillo   +4 more
wiley   +1 more source

Analyzing interference between RGB-D cameras for human motion tracking [PDF]

open access: yes, 2018
Multi-camera RGB-D systems are becoming popular as sensor setups in Computer Vision applications but they are prone to cause interference between them, compromising their accuracy. This paper extends previous works on the analysis of the noise introduced
Fernandez, Mailys   +4 more
core  

Long‐Tea‐CLIP: An Expert‐Level Multimodal AI Framework for Fine‐Grained Green Tea Grading Across Five Sensory Dimensions

open access: yesAdvanced Science, EarlyView.
Long‐Tea‐CLIP (Contrastive Language‐Image Pre‐training) presents a multimodal AI framework that integrates visual, metabolomic, and sensory knowledge to grade green tea across appearance, soup color, aroma, taste, and infused leaf. By combining expert‐guided modeling with CLIP‐supervised learning, the system delivers fine‐grained quality evaluation and
Yanqun Xu   +9 more
wiley   +1 more source

Shadow‐Calibrated Stereo Vision for Colorimetric Sweat Analysis

open access: yesAdvanced Science, EarlyView.
By establishing a mathematical model that reconstructs 3D structures through geometric features of object shadows under controlled illumination, and combining it with Convolutional Neural Network‐based 2D image analysis for volumetric calibration, this work enables highly accurate 3D morphological reconstruction.
Ting Xiao   +7 more
wiley   +1 more source

A Correlative SICM‐OPM Platform for Surface and Volumetric Imaging in Live Cells

open access: yesAdvanced Science, EarlyView.
A multifunctional correlative imaging platform integrating Scanning Ion Conductance Microscopy (SICM) with Oblique Plane Microscopy (OPM) enables simultaneous surface topography, mechanical mapping, and 3D volumetric fluorescence imaging in live cells.
Wenzhi Hong   +13 more
wiley   +1 more source

Volume-based Human Re-identification with RGB-D Cameras

open access: yesProceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, 2017
This paper presents an RGB-D based human re-identification approach using novel biometrics features from the body's volume. Existing work based on RGB images or skeleton features have some limitations for real-world robotic applications, most notably in dealing with occlusions and orientation of the user.
Cosar S., Coppola C., Bellotto N.
openaire   +2 more sources

People Counting and Positioning Using Low‐Resolution Infrared Images for FeFET‐Based In‐Memory Computing

open access: yesAdvanced Electronic Materials, EarlyView.
In this work, low‐resolution infrared imaging is combined with a 28 nm FeFET IMC architecture to enable compact, energy‐efficient edge inference. MLC FeFET devices are experimentally characterized, and controlled multi‐level current accumulation is validated at crossbar array level.
Alptekin Vardar   +9 more
wiley   +1 more source

Indoor 3D Reconstruction of Buildings via Azure Kinect RGB-D Camera. [PDF]

open access: yesSensors (Basel), 2022
Delasse C   +4 more
europepmc   +1 more source

Home - About - Disclaimer - Privacy