Results 1 to 10 of about 1,043,346 (376)
Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert [PDF]
Talking face generation, also known as speech-to-lip generation, reconstructs facial motions concerning lips given coherent speech input. The previous studies revealed the importance of lip-speech synchronization and visual quality. Despite much progress,
Jiadong Wang +4 more
semanticscholar +1 more source
SyncTalkFace: Talking Face Generation with Precise Lip-Syncing via Audio-Lip Memory [PDF]
The challenge of talking face generation from speech lies in aligning two different modal information, audio and video, such that the mouth region corresponds to input audio.
Se Jin Park +4 more
semanticscholar +1 more source
A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild [PDF]
In this work, we investigate the problem of lip-syncing a talking face video of an arbitrary identity to match a target speech segment. Current works excel at producing accurate lip movements on a static image or videos of specific people seen during the
Prajwal K R +3 more
semanticscholar +1 more source
Distinguishing Homophenes Using Multi-Head Visual-Audio Memory for Lip Reading [PDF]
Recognizing speech from silent lip movement, which is called lip reading, is a challenging task due to 1) the inherent information insufficiency of lip movement to fully represent the speech, and 2) the existence of homophenes that have similar lip ...
Minsu Kim, Jeong Hun Yeo, Yong Man Ro
semanticscholar +1 more source
Sub-word Level Lip Reading With Visual Attention [PDF]
The goal of this paper is to learn strong lip reading models that can recognise speech in silent videos. Most prior works deal with the open-set visual speech recognition problem by adapting existing automatic speech recognition techniques on top of ...
Prajwal K R +2 more
semanticscholar +1 more source
Big and little Lipschitz one sets [PDF]
Given a continuous function $f: {{\mathbb R}}\to {{\mathbb R}}$ we denote the so-called "big Lip" and "little lip" functions by $ {{\mathrm {Lip}}} f$ and $ {{\mathrm {lip}}} f$ respectively}.
Buczolich, Zoltán +3 more
core +2 more sources
Decoding lip language using triboelectric sensors with deep learning
Lip language is an effective method of voice-off communication in daily life for people with vocal cord lesions and laryngeal and lingual injuries without occupying the hands. Collection and interpretation of lip language is challenging. Here, we propose
Yijia Lu +7 more
semanticscholar +1 more source
3-D Underactuated Bipedal Walking via H-LIP Based Gait Synthesis and Stepping Stabilization [PDF]
In this article, we holistically present a hybrid-linear inverted pendulum (H-LIP) based approach for synthesizing and stabilizing 3-D foot-underactuated bipedal walking, with an emphasis on thorough hardware realization. The H-LIP is proposed to capture
Xiaobin Xiong, A. Ames
semanticscholar +1 more source
Use of a generalized energy Mover’s distance in the search for rare phenomena at colliders
In this paper, we expand on the previously proposed concept of energy Mover’s distance. The resulting observables are shown to provide a way of identifying rare processes in proton–proton collider experiments.
M. Crispim Romão +4 more
doaj +1 more source
Novel methods to reconstruct the slant depth of the maximum of the longitudinal profile ( $$X_{\mathrm{max}}$$ X max ) of high-energy showers initiated by gamma-rays as well as their energy ( $$E_0$$ E 0 ) are presented.
R. Conceição +3 more
doaj +1 more source

