Center-based 3D Object Detection and Tracking [PDF]
Three-dimensional objects are commonly represented as 3D boxes in a point-cloud. This representation mimics the well-studied image-based 2D bounding-box detection but comes with additional challenges.
Tianwei Yin+2 more
semanticscholar +1 more source
Exploring Plain Vision Transformer Backbones for Object Detection [PDF]
We explore the plain, non-hierarchical Vision Transformer (ViT) as a backbone network for object detection. This design enables the original ViT architecture to be fine-tuned for object detection without needing to redesign a hierarchical backbone for ...
Yanghao Li+3 more
semanticscholar +1 more source
RecFRCN: Few-Shot Object Detection With Recalibrated Faster R-CNN
Currently, Faster R-CNN serves as the fundamental detection framework in the majority of few-shot object detection algorithms. However, due to limited samples per class, the Faster R-CNN’s classification branch faces limitations in capturing ...
Youyou Zhang, Tongwei Lu
doaj +1 more source
TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers [PDF]
LiDAR and camera are two important sensors for 3D object detection in autonomous driving. Despite the increasing popularity of sensor fusion in this field, the robustness against inferior image conditions, e.g., bad illumination and sensor misalignment ...
Xuyang Bai+6 more
semanticscholar +1 more source
Focus-and-Detect: A Small Object Detection Framework for Aerial Images [PDF]
Despite recent advances, object detection in aerial images is still a challenging task. Specific problems in aerial images makes the detection problem harder, such as small objects, densely packed objects, objects in different sizes and with different orientations.
arxiv +1 more source
3D Object Class Detection in the Wild [PDF]
Object class detection has been a synonym for 2D bounding box localization for the longest time, fueled by the success of powerful statistical learning techniques, combined with robust image representations.
Gehler, Peter+4 more
core +1 more source
Target-aware Dual Adversarial Learning and a Multi-scenario Multi-Modality Benchmark to Fuse Infrared and Visible for Object Detection [PDF]
This study addresses the issue of fusing infrared and visible images that appear differently for object detection. Aiming at generating an image of high visual quality, previous approaches discover commons underlying the two modalities and fuse upon the ...
Jinyuan Liu+6 more
semanticscholar +1 more source
Efficient and Scalable Object Localization in 3D on Mobile Device
Two-Dimensional (2D) object detection has been an intensely discussed and researched field of computer vision. With numerous advancements made in the field over the years, we still need to identify a robust approach to efficiently conduct classification ...
Neetika Gupta, Naimul Mefraz Khan
doaj +1 more source
Review of Deep Learning Applied to Occluded Object Detection [PDF]
Occluded object detection has long been a difficulty and hot topic in the field of computer vision. Based on convolutional neural network, the deep learning takes the object detection task as a classification and regression task to handle, and obtains ...
SUN Fangwei, LI Chengyang, XIE Yongqiang, LI Zhongbo, YANG Caidong, QI Jin
doaj +1 more source
VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection [PDF]
Accurate detection of objects in 3D point clouds is a central problem in many applications, such as autonomous navigation, housekeeping robots, and augmented/virtual reality.
Yin Zhou, Oncel Tuzel
semanticscholar +1 more source