GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis [PDF]
Generating photo-realistic video portrait with arbitrary speech audio is a crucial problem in film-making and virtual reality. Recently, several works explore the usage of neural radiance field in this task to improve 3D realness and image fidelity ...
Zhenhui Ye +5 more
semanticscholar +1 more source
Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior [PDF]
In this work, we investigate the problem of creating high-fidelity 3D content from only a single image. This is inherently challenging: it essentially involves estimating the underlying 3D geometry while simultaneously hallucinating unseen textures.
Junshu Tang +6 more
semanticscholar +1 more source
FastNeRF: High-Fidelity Neural Rendering at 200FPS [PDF]
Recent work on Neural Radiance Fields (NeRF) showed how neural networks can be used to encode complex 3D environments that can be rendered photorealistically from novel viewpoints.
Stephan J. Garbin +4 more
semanticscholar +1 more source
Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision [PDF]
We introduce SPEAR-TTS, a multi-speaker text-to-speech (TTS) system that can be trained with minimal supervision. By combining two types of discrete speech representations, we cast TTS as a composition of two sequence-to-sequence tasks: from text to high-
E. Kharitonov +8 more
semanticscholar +1 more source
HumanRF: High-Fidelity Neural Radiance Fields for Humans in Motion [PDF]
Representing human performance at high-fidelity is an essential building block in diverse applications, such as film production, computer games or videoconferencing. To close the gap to production-level quality, we introduce HumanRF1, a 4D dynamic neural
Mustafa Işık +6 more
semanticscholar +1 more source
HyperReel: High-Fidelity 6-DoF Video with Ray-Conditioned Sampling [PDF]
Volumetric scene representations enable photorealistic view synthesis for static scenes and form the basis of several existing 6-DoF video techniques.
Benjamin Attal +6 more
semanticscholar +1 more source
Erasure conversion in a high-fidelity Rydberg quantum simulator [PDF]
Minimizing and understanding errors is critical for quantum science, both in noisy intermediate scale quantum (NISQ) devices^ 1 and for the quest towards fault-tolerant quantum computation^ 2 , 3 .
P. Scholl +5 more
semanticscholar +1 more source
HiFi-Codec: Group-residual Vector quantization for High Fidelity Audio Codec [PDF]
Audio codec models are widely used in audio communication as a crucial technique for compressing audio into discrete representations. Nowadays, audio codec models are increasingly utilized in generation fields as intermediate representations.
Dongchao Yang +5 more
semanticscholar +1 more source
StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-Based Generator [PDF]
Despite recent advances in syncing lip movements with any audio waves, current methods still struggle to balance generation quality and the model's generalization ability.
Jiazhi Guan +10 more
semanticscholar +1 more source
VR-NeRF: High-Fidelity Virtualized Walkable Spaces [PDF]
We present an end-to-end system for the high-fidelity capture, model reconstruction, and real-time rendering of walkable spaces in virtual reality using neural radiance fields.
Linning Xu +12 more
semanticscholar +1 more source

