DR-CLIP: A Deformable Vision-Language Model for Scale-Invariant Object Counting in Remote Sensing Images. [PDF]
Nie J, Liu Q, Li T, Lu X, Zhang L.
europepmc +1 more source
Overview of the proposed Gate‐Align‐SED, including two stages of training: (1) Mean‐Teacher SSL Training; and (2) Enhancer Model Training. In complex real‐world environments such as disaster monitoring, effective sound event detection (SED) is often hindered by the presence of noise and limited labeled data.
Jieli Chen +4 more
wiley +1 more source
River extraction from high-resolution remote sensing images based on non-uniform sampling and semi-supervised learning. [PDF]
Wang K, Han L, Li L.
europepmc +1 more source
Visual features, numerical descriptors, and controlled textual attributes extracted from smartphone images of Chenpi are integrated by VALIANT, a tailored multimodal framework for simultaneous storage‐age classification and authenticity verification. The workflow distinguishes genuine products from suspicious standard operating procedure mimics while ...
Simon C. K. Chan +5 more
wiley +1 more source
Lightweight model LMW-YOLO for small object detection in remote sensing images. [PDF]
Qiu Y, Lin Z.
europepmc +1 more source
Off-Road Autonomous Vehicle Semantic Segmentation and Spatial Overlay Video Assembly. [PDF]
Dror I, Aviv O, Hadar O.
europepmc +1 more source
Adaptive graph signal processing for robust multimodal fusion with dynamic semantic alignment. [PDF]
Karthikeya KV +4 more
europepmc +1 more source
Vision-Controlled autonomous navigation in unstructured environments: Integrating image processing, path planning, and trajectory control in robotic systems. [PDF]
Wang P, Yu H, Wang S.
europepmc +1 more source
Diamond-DETR: lightweight real-time quality evaluation algorithm for synthetic diamonds. [PDF]
Yan X, Yang S, Zhang S, Li X, Li A.
europepmc +1 more source

