Results 271 to 280 of about 3,407,261 (330)
Some of the next articles are maybe not open access.

VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control

International Conference on Learning Representations
Modern text-to-video synthesis models demonstrate coherent, photorealistic generation of complex videos from a text description. However, most existing models lack fine-grained control over camera movement, which is critical for downstream applications ...
Sherwin Bahmani   +11 more
semanticscholar   +1 more source

DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion

arXiv.org
In this paper, we introduce \textbf{DimensionX}, a framework designed to generate photorealistic 3D and 4D scenes from just a single image with video diffusion.
Wenqiang Sun   +6 more
semanticscholar   +1 more source

CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation

arXiv.org
Recently video diffusion models have emerged as expressive generative tools for high-quality video content creation readily available to general users.
Dejia Xu   +6 more
semanticscholar   +1 more source

V3D: Video Diffusion Models are Effective 3D Generators

arXiv.org
Automatic 3D generation has recently attracted widespread attention. Recent methods have greatly accelerated the generation speed, but usually produce less-detailed objects due to limited model capacity or 3D data.
Zilong Chen   +4 more
semanticscholar   +1 more source

3D-color video camera

2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops, 2009
We introduce a design of a coded light-based 3D color video camera optimized for build up cost as well as accuracy in depth reconstruction and acquisition speed. The components of the system include a monochromatic camera and an off-the-shelf LED projector synchronized by a miniature circuit.
O. Rubinstein   +4 more
openaire   +1 more source

AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers

Computer Vision and Pattern Recognition
Numerous works have recently integrated 3D camera control into foundational text-to-video models, but the resulting camera control is often imprecise, and video generation quality suffers.
Sherwin Bahmani   +7 more
semanticscholar   +1 more source

Capture and 3D Video Processing of Volumetric Video

International Conference on Information Photonics, 2019
Volumetric video is regarded worldwide as the next important development step in the field of media production. Especially in the context of the extremely rapid development of the Virtual Reality (VR) and Augmented Reality (AR) markets, volumetric video ...
O. Schreer   +6 more
semanticscholar   +1 more source

Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding

Computer Vision and Pattern Recognition
The rapid advancement of Multimodal Large Language Models (MLLMs) has significantly impacted various multimodal tasks. However, these models face challenges in tasks that require spatial understanding within 3D environments.
Duo Zheng, Shijia Huang, Liwei Wang
semanticscholar   +1 more source

Jointly learning perceptually heterogeneous features for blind 3D video quality assessment

Neurocomputing, 2019
3D videos quality assessment (3D-VQA) is essential to various 3D video processing applications. However, it has not been well investigated on how to make use of perceptual multi-channel video information to improve 3D-VQA under different distortion ...
Yongfang Wang   +4 more
semanticscholar   +1 more source

3D Video Encoding

2012
Over the past decade the progress in computing and telecommunication technologies have made storage and transmission of visual information media even more ubiquitous. Nowadays it is usual to stream in real-time huge amount of data on-line, e.g. over a LAN or the Internet.
Takashi Matsuyama   +3 more
openaire   +1 more source

Home - About - Disclaimer - Privacy