Results 71 to 80 of about 21,157 (187)

Recurrent Scene Parsing with Perspective Understanding in the Loop

open access: yes, 2017
Objects may appear at arbitrary scales in perspective images of a scene, posing a challenge for recognition systems that process images at a fixed resolution.
Fowlkes, Charless, Kong, Shu
core   +1 more source

DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object Detection

open access: yes, 2022
Modern neural networks use building blocks such as convolutions that are equivariant to arbitrary 2D translations. However, these vanilla blocks are not equivariant to arbitrary 3D translations in the projective manifold. Even then, all monocular 3D detectors use vanilla blocks to obtain the 3D coordinates, a task for which the vanilla blocks are not ...
Kumar, Abhinav   +4 more
openaire   +2 more sources

Why Autonomous Vehicles Are Not Ready Yet: A Multi‐Disciplinary Review of Problems, Attempted Solutions, and Future Directions

open access: yesJournal of Field Robotics, EarlyView.
ABSTRACT Personal autonomous vehicles can sense their surrounding environment, plan their route, and drive with little or no involvement of human drivers. Despite the latest technological advancements and the hopeful announcements made by leading entrepreneurs, to date no personal vehicle is approved for road circulation in a “fully” or “semi ...
Xingshuai Dong   +13 more
wiley   +1 more source

MonoDFM: Density Field Modeling-Based End-to-End Monocular 3D Object Detection

open access: yesIEEE Access
Monocular 3D object detection aims to infer the 3D properties of objects from a single RGB image. Existing methods primarily rely on planar features to estimate 3D information directly.
Gang Liu, Xinrui Huang, Xiaoxiao Xie
doaj   +1 more source

Capturing Hands in Action using Discriminative Salient Points and Physics Simulation

open access: yes, 2016
Hand motion capture is a popular research field, recently gaining more attention due to the ubiquity of RGB-D sensors. However, even most recent approaches focus on the case of a single isolated hand.
Aponte, Pablo   +5 more
core   +1 more source

Survey on AI‐Enabled Computer Vision Technologies and Applications for Space Robotic Missions

open access: yesJournal of Field Robotics, EarlyView.
ABSTRACT This survey provides a comprehensive overview of recent advancements and challenges in Artificial Intelligence (AI)‐enabled computer vision (CV) techniques for space robotic missions, spanning critical phases such as Entry, Descent, and Landing (EDL), orbital operations, and planetary surface exploration.
Maciej Quoos   +6 more
wiley   +1 more source

MonoDI:Monocular 3D Object Detection Based on Fusing Depth Instances

open access: yesShuju Caiji Yu Chuli
Monocular 3D object detection aims to locate the 3D bounding boxes of objects in a single 2D input image, which is an extremely challenging task in the absence of image depth information.
ZHAO Ke, DONG Haoran, YE Ning
doaj   +1 more source

Rotatable Lens‐Based 3D Reconstruction Method for Monocular Vision Systems

open access: yesSmartBot, EarlyView.
Accurate 3D reconstruction in confined environments remains challenging because conventional vision‐based measurement systems are difficult to maintain compactness without sacrificing accuracy. Here, the authors develop a rotatable lens‐based monocular vision system, which is the first time using the refraction of the light induced by a rotating wedge ...
Puchen Zhu   +11 more
wiley   +1 more source

SCPM: monocular 3D object detection with spatiotemporal consistent pseudo-labels module

open access: yesComplex & Intelligent Systems
Monocular 3D object detection models have become increasingly popular due to its low cost and ease of deployment. It remains challenging because of limited depth estimation and dataset imbalance.
Yujing Wang   +2 more
doaj   +1 more source

Geometry-Based Next Frame Prediction from Monocular Video

open access: yes, 2017
We consider the problem of next frame prediction from video input. A recurrent convolutional neural network is trained to predict depth from monocular video input, which, along with the current video image and the camera trajectory, can then be used to ...
Angelova, Anelia   +2 more
core   +1 more source

Home - About - Disclaimer - Privacy