Results 11 to 20 of about 952,817 (347)

Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert [PDF]

open access: yesarXiv, 2023
Talking face generation, also known as speech-to-lip generation, reconstructs facial motions concerning lips given coherent speech input. The previous studies revealed the importance of lip-speech synchronization and visual quality. Despite much progress,
Jiadong Wang   +4 more
semanticscholar   +2 more sources

Lipschitz spaces and M-ideals [PDF]

open access: yesExtracta Math. 18, no.1, 33-56 (2003), 2002
For a metric space $(K,d)$ the Banach space $\Lip(K)$ consists of all scalar-valued bounded Lipschitz functions on $K$ with the norm $\|f\|_{L}=\max(\|f\|_{\infty},L(f))$, where $L(f)$ is the Lipschitz constant of $f$.
Berninger, Heiko, Werner, Dirk
core   +3 more sources

A Cascade Sequence-to-Sequence Model for Chinese Mandarin Lip Reading [PDF]

open access: yesarXiv, 2019
Lip reading aims at decoding texts from the movement of a speaker's mouth. In recent years, lip reading methods have made great progress for English, at both word-level and sentence-level.
Ya Zhao, Rui Xu, Mingli Song
semanticscholar   +2 more sources

Lip2AudSpec: Speech reconstruction from silent lip movements video [PDF]

open access: yesIEEE International Conference on Acoustics, Speech, and Signal Processing, 2017
In this study, we propose a deep neural network for reconstructing intelligible speech from silent lip movement videos. We use auditory spectrogram as spectral representation of speech and its corresponding sound generation method resulting in a more ...
Akbari, Hassan   +3 more
core   +2 more sources

SyncTalkFace: Talking Face Generation with Precise Lip-Syncing via Audio-Lip Memory [PDF]

open access: yesAAAI Conference on Artificial Intelligence, 2022
The challenge of talking face generation from speech lies in aligning two different modal information, audio and video, such that the mouth region corresponds to input audio.
Se Jin Park   +4 more
semanticscholar   +1 more source

Distinguishing Homophenes Using Multi-Head Visual-Audio Memory for Lip Reading [PDF]

open access: yesAAAI Conference on Artificial Intelligence, 2022
Recognizing speech from silent lip movement, which is called lip reading, is a challenging task due to 1) the inherent information insufficiency of lip movement to fully represent the speech, and 2) the existence of homophenes that have similar lip ...
Minsu Kim, Jeong Hun Yeo, Yong Man Ro
semanticscholar   +1 more source

A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild [PDF]

open access: yesACM Multimedia, 2020
In this work, we investigate the problem of lip-syncing a talking face video of an arbitrary identity to match a target speech segment. Current works excel at producing accurate lip movements on a static image or videos of specific people seen during the
Prajwal K R   +3 more
semanticscholar   +1 more source

Sub-word Level Lip Reading With Visual Attention [PDF]

open access: yesComputer Vision and Pattern Recognition, 2021
The goal of this paper is to learn strong lip reading models that can recognise speech in silent videos. Most prior works deal with the open-set visual speech recognition problem by adapting existing automatic speech recognition techniques on top of ...
Prajwal K R   +2 more
semanticscholar   +1 more source

A Case of Pyoderma Gangrenosum on the Lip [PDF]

open access: yesClinical Case Reports
Pyoderma gangrenosum should be considered in the differential diagnosis of ulcerative lip lesions in children. Long‐term management may require low‐dose oral steroids.
Mari Nakanishi   +2 more
doaj   +2 more sources

3-D Underactuated Bipedal Walking via H-LIP Based Gait Synthesis and Stepping Stabilization [PDF]

open access: yesIEEE Transactions on robotics, 2021
In this article, we holistically present a hybrid-linear inverted pendulum (H-LIP) based approach for synthesizing and stabilizing 3-D foot-underactuated bipedal walking, with an emphasis on thorough hardware realization. The H-LIP is proposed to capture
Xiaobin Xiong, A. Ames
semanticscholar   +1 more source

Home - About - Disclaimer - Privacy