Results 11 to 20 of about 6,892,691 (254)

Joint Alignment of Image Faces [PDF]

open access: yesIEEE Access, 2020
Researches on face alignment have made great progress, which benefits from the use of prior information and auxiliary models. However, that information lacks in a single face image has always affected the further development of these researches.
Gang Zhang   +4 more
doaj   +2 more sources

What You See is What You Read? Improving Text-Image Alignment Evaluation [PDF]

open access: yesNeural Information Processing Systems, 2023
Automatically determining whether a text and a corresponding image are semantically aligned is a significant challenge for vision-language models, with applications in generative text-to-image and image-to-text tasks.
Michal Yarom   +7 more
semanticscholar   +1 more source

Text-Image Alignment for Diffusion-Based Perception [PDF]

open access: yesComputer Vision and Pattern Recognition, 2023
Diffusion models are generative models with impressive text-to-image synthesis capabilities and have spurred a new wave of creative methods for classical machine learning tasks.
Neehar Kondapaneni   +4 more
semanticscholar   +1 more source

Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback [PDF]

open access: yesNeural Information Processing Systems, 2023
The field of text-conditioned image generation has made unparalleled progress with the recent advent of latent diffusion models. While remarkable, as the complexity of given text input increases, the state-of-the-art diffusion models may still fail in ...
Jaskirat Singh, Liang Zheng
semanticscholar   +1 more source

Deep Lucas-Kanade Homography for Multimodal Image Alignment [PDF]

open access: yesComputer Vision and Pattern Recognition, 2021
Estimating homography to align image pairs captured by different sensors or image pairs with large appearance changes is an important and general challenge for many computer vision applications.
Yiming Zhao, Xinming Huang, Ziming Zhang
semanticscholar   +1 more source

Deep Learning Strategy for Braille Character Recognition

open access: yesIEEE Access, 2021
People with vision impairment use Braille language for reading, writing, and communication. The basic structure of the Braille language consists of six dots arranged in three rows and two column cells, which are identified by visually impaired people ...
Tasleem Kausar   +5 more
doaj   +1 more source

An Improved SIFT Underwater Image Stitching Method

open access: yesApplied Sciences, 2023
Underwater image stitching is a technique employed to seamlessly merge images with overlapping regions, creating a coherent underwater panorama. In recent years, extensive research efforts have been devoted to advancing image stitching methodologies for ...
Haosu Zhang   +4 more
doaj   +1 more source

Accurate Matching of Invariant Features Derived from Irregular Curves

open access: yesRemote Sensing, 2022
High-quality feature matching is a critical prerequisite in a wide range of applications. Most contemporary methods concentrate on detecting keypoints or line features for matching, which have achieved adequate results.
Huajun Liu   +5 more
doaj   +1 more source

A Generic, Multimodal Geospatial Data Alignment System for Aerial Navigation

open access: yesRemote Sensing, 2023
We present a template matching algorithm based on local descriptors for aligning two geospatial products of different modalities with a large area asymmetry.
Victor Martin-Lac   +2 more
doaj   +1 more source

2D/3D Multimode Medical Image Alignment Based on Spatial Histograms

open access: yesApplied Sciences, 2022
The key to image-guided surgery (IGS) technology is to find the transformation relationship between preoperative 3D images and intraoperative 2D images, namely, 2D/3D image registration.
Yuxi Ban   +6 more
semanticscholar   +1 more source

Home - About - Disclaimer - Privacy