Results 11 to 20 of about 6,892,691 (254)
Joint Alignment of Image Faces [PDF]
Researches on face alignment have made great progress, which benefits from the use of prior information and auxiliary models. However, that information lacks in a single face image has always affected the further development of these researches.
Gang Zhang +4 more
doaj +2 more sources
What You See is What You Read? Improving Text-Image Alignment Evaluation [PDF]
Automatically determining whether a text and a corresponding image are semantically aligned is a significant challenge for vision-language models, with applications in generative text-to-image and image-to-text tasks.
Michal Yarom +7 more
semanticscholar +1 more source
Text-Image Alignment for Diffusion-Based Perception [PDF]
Diffusion models are generative models with impressive text-to-image synthesis capabilities and have spurred a new wave of creative methods for classical machine learning tasks.
Neehar Kondapaneni +4 more
semanticscholar +1 more source
Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback [PDF]
The field of text-conditioned image generation has made unparalleled progress with the recent advent of latent diffusion models. While remarkable, as the complexity of given text input increases, the state-of-the-art diffusion models may still fail in ...
Jaskirat Singh, Liang Zheng
semanticscholar +1 more source
Deep Lucas-Kanade Homography for Multimodal Image Alignment [PDF]
Estimating homography to align image pairs captured by different sensors or image pairs with large appearance changes is an important and general challenge for many computer vision applications.
Yiming Zhao, Xinming Huang, Ziming Zhang
semanticscholar +1 more source
Deep Learning Strategy for Braille Character Recognition
People with vision impairment use Braille language for reading, writing, and communication. The basic structure of the Braille language consists of six dots arranged in three rows and two column cells, which are identified by visually impaired people ...
Tasleem Kausar +5 more
doaj +1 more source
An Improved SIFT Underwater Image Stitching Method
Underwater image stitching is a technique employed to seamlessly merge images with overlapping regions, creating a coherent underwater panorama. In recent years, extensive research efforts have been devoted to advancing image stitching methodologies for ...
Haosu Zhang +4 more
doaj +1 more source
Accurate Matching of Invariant Features Derived from Irregular Curves
High-quality feature matching is a critical prerequisite in a wide range of applications. Most contemporary methods concentrate on detecting keypoints or line features for matching, which have achieved adequate results.
Huajun Liu +5 more
doaj +1 more source
A Generic, Multimodal Geospatial Data Alignment System for Aerial Navigation
We present a template matching algorithm based on local descriptors for aligning two geospatial products of different modalities with a large area asymmetry.
Victor Martin-Lac +2 more
doaj +1 more source
2D/3D Multimode Medical Image Alignment Based on Spatial Histograms
The key to image-guided surgery (IGS) technology is to find the transformation relationship between preoperative 3D images and intraoperative 2D images, namely, 2D/3D image registration.
Yuxi Ban +6 more
semanticscholar +1 more source

