Results 151 to 160 of about 286,668 (294)
6D object pose estimation is a critical component in computer vision domain. A deep bidirectional fusion network is developed, DBF‐Net, which achieves accurate 6D object pose estimation using a single RGB‐D image. DBF‐Net effectively extracts and fuses geometric and appearance features from input data.
Xuan Fan+5 more
wiley +1 more source
Waveguide‐Based Retinal Projection Near‐Eye Display with Bidirectional Eyebox Expansion
A waveguide‐type retinal projection display system with bidirectional extended eyebox is proposed. The two‐layer holographic optical elements (HOEs), with two eye reliefs of 11 and 12 mm in the axial direction, serve as the image combiner, and each layer of HOEs generates two viewpoints with a horizontal spacing of 5 mm. Based on the mechanism of human
Yujing Fu+4 more
wiley +1 more source
A thorough analysis of event‐based vision for flapping‐wing robots is presented, with emphasis on hardware integration, suitability under challenging flight conditions, and real‐time performance. Theoretical and experimental studies demonstrate how these sensors enable robust and efficient onboard perception for ornithopters, thereby advancing autonomy
Raul Tapia+5 more
wiley +1 more source
This study presents a compact freeform projection optics using dual wedge prisms for waveguide displays. Unlike previous designs that separate illumination and imaging paths, a freeform optical path integrates both. The system achieves a 32º field of view, 37.5 pixels per degree, and a total volume of less than 2 cm3.
Ximeng Wang+5 more
wiley +1 more source
Adapting Image‐Based Models for 1D Data via Spider Plot Transformation and Transfer Learning
A novel method enables the use of pretrained image‐based neural networks for complex 1D data, including Raman and mid‐infrared spectra, electrocardiograms, and mass spectrometry. 2D spider plots with false‐color fill enable transfer lerning, therefore enhancing data augmentation and model explainability across diverse spectral and time series datasets.
Azadeh Mokari+2 more
wiley +1 more source
This study presents Hi‐LabSpermTracking, a long‐duration, expert‐annotated sperm motility dataset for detection and tracking. It evaluates You Only Look Once version 8, Real‐Time Detection Transformer, and Simple Online and Realtime Tracking with a Deep Association Metric in three scenarios. The mean ensemble method improves tracking performance. Sperm
Abdulsamet Aktas+5 more
wiley +1 more source
This study investigates how gaze mode and stimulus brightness affect augmented reality‐based steady‐state visual evoked potentials (AR‐SSVEP) brain‐computer interface (BCI) performance. Experiments with 20 participants show that binocular gaze improves recognition accuracy and stimulus luminance impacts performance.
Yiting Geng+6 more
wiley +1 more source
A large language model (LLM)‐powered multimodal robotic scrub nurse control framework is proposed for enhancing human–robot interaction. Inside, the vision module gives out instrument class, location, and occlusion relationship. Speech module captures and converts user's speech commands.
Wing Yin Ng+4 more
wiley +1 more source
Haptic Perception via the Dynamics of a Flexible Body Inspired by an Ostrich's Neck
Inspired by avian anatomy, this study uses a flexible robotic neck to investigate haptic perception driven by musculoskeletal dynamics. By applying physical reservoir computing, the robot encodes external force interactions into its body dynamics, allowing effective object classification.
Kazashi Nakano+3 more
wiley +1 more source
This article proposes a lightweight YOLOv4‐based detection model using MobileNetV3 or CSPDarknet53_tiny, achieving 30+ FPS and higher mAP. It also presents a ShuffleNet‐based classification model with transfer learning and GAN‐augmented images, improving generalization and accuracy.
Qingyang Liu, Yanrong Hu, Hongjiu Liu
wiley +1 more source