Results 111 to 120 of about 119,156 (290)
Multimodal Human–Robot Interaction Using Human Pose Estimation and Local Large Language Models
A multimodal human–robot interaction framework integrates human pose estimation (HPE) and a large language model (LLM) for gesture‐ and voice‐based robot control. Speech‐to‐text (STT) enables voice command interpretation, while a safety‐aware arbitration mechanism prioritizes gesture input for rapid intervention.
Nasiru Aboki +2 more
wiley +1 more source
Pallet recognition and localization using an RGB-D camera
This article reports our research results on an autonomous forklift, with the focus on pallet recognition and localization using an RGB-D camera. It is a fundamental issue for unmanned storehouses, which enables the forklift to insert the forks within ...
Junhao Xiao +3 more
doaj +1 more source
Miniaturized Magnetic Tip Design for Endoluminal Vine Robot Navigation
A magnetic tip mount is designed for a miniaturized 7 mm soft‐growing vine robot to enable wireless magnetic steering and onboard imaging, while preserving a 3 mm working channel. The internal–external ring magnets design balances magnetic attachment with low eversion pressure. Experiments demonstrate ±90° steering, 34 mm bending radius, and successful
Andrea Yanez Trujillo +4 more
wiley +1 more source
Analyzing interference between RGB-D cameras for human motion tracking [PDF]
Multi-camera RGB-D systems are becoming popular as sensor setups in Computer Vision applications but they are prone to cause interference between them, compromising their accuracy. This paper extends previous works on the analysis of the noise introduced
Fernandez, Mailys +4 more
core
Long‐Tea‐CLIP (Contrastive Language‐Image Pre‐training) presents a multimodal AI framework that integrates visual, metabolomic, and sensory knowledge to grade green tea across appearance, soup color, aroma, taste, and infused leaf. By combining expert‐guided modeling with CLIP‐supervised learning, the system delivers fine‐grained quality evaluation and
Yanqun Xu +9 more
wiley +1 more source
Shadow‐Calibrated Stereo Vision for Colorimetric Sweat Analysis
By establishing a mathematical model that reconstructs 3D structures through geometric features of object shadows under controlled illumination, and combining it with Convolutional Neural Network‐based 2D image analysis for volumetric calibration, this work enables highly accurate 3D morphological reconstruction.
Ting Xiao +7 more
wiley +1 more source
A Correlative SICM‐OPM Platform for Surface and Volumetric Imaging in Live Cells
A multifunctional correlative imaging platform integrating Scanning Ion Conductance Microscopy (SICM) with Oblique Plane Microscopy (OPM) enables simultaneous surface topography, mechanical mapping, and 3D volumetric fluorescence imaging in live cells.
Wenzhi Hong +13 more
wiley +1 more source
Volume-based Human Re-identification with RGB-D Cameras
This paper presents an RGB-D based human re-identification approach using novel biometrics features from the body's volume. Existing work based on RGB images or skeleton features have some limitations for real-world robotic applications, most notably in dealing with occlusions and orientation of the user.
Cosar S., Coppola C., Bellotto N.
openaire +2 more sources
In this work, low‐resolution infrared imaging is combined with a 28 nm FeFET IMC architecture to enable compact, energy‐efficient edge inference. MLC FeFET devices are experimentally characterized, and controlled multi‐level current accumulation is validated at crossbar array level.
Alptekin Vardar +9 more
wiley +1 more source
Indoor 3D Reconstruction of Buildings via Azure Kinect RGB-D Camera. [PDF]
Delasse C +4 more
europepmc +1 more source

