Results 271 to 280 of about 3,407,261 (330)
Some of the next articles are maybe not open access.
VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control
International Conference on Learning RepresentationsModern text-to-video synthesis models demonstrate coherent, photorealistic generation of complex videos from a text description. However, most existing models lack fine-grained control over camera movement, which is critical for downstream applications ...
Sherwin Bahmani +11 more
semanticscholar +1 more source
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion
arXiv.orgIn this paper, we introduce \textbf{DimensionX}, a framework designed to generate photorealistic 3D and 4D scenes from just a single image with video diffusion.
Wenqiang Sun +6 more
semanticscholar +1 more source
CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation
arXiv.orgRecently video diffusion models have emerged as expressive generative tools for high-quality video content creation readily available to general users.
Dejia Xu +6 more
semanticscholar +1 more source
V3D: Video Diffusion Models are Effective 3D Generators
arXiv.orgAutomatic 3D generation has recently attracted widespread attention. Recent methods have greatly accelerated the generation speed, but usually produce less-detailed objects due to limited model capacity or 3D data.
Zilong Chen +4 more
semanticscholar +1 more source
2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops, 2009
We introduce a design of a coded light-based 3D color video camera optimized for build up cost as well as accuracy in depth reconstruction and acquisition speed. The components of the system include a monochromatic camera and an off-the-shelf LED projector synchronized by a miniature circuit.
O. Rubinstein +4 more
openaire +1 more source
We introduce a design of a coded light-based 3D color video camera optimized for build up cost as well as accuracy in depth reconstruction and acquisition speed. The components of the system include a monochromatic camera and an off-the-shelf LED projector synchronized by a miniature circuit.
O. Rubinstein +4 more
openaire +1 more source
AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers
Computer Vision and Pattern RecognitionNumerous works have recently integrated 3D camera control into foundational text-to-video models, but the resulting camera control is often imprecise, and video generation quality suffers.
Sherwin Bahmani +7 more
semanticscholar +1 more source
Capture and 3D Video Processing of Volumetric Video
International Conference on Information Photonics, 2019Volumetric video is regarded worldwide as the next important development step in the field of media production. Especially in the context of the extremely rapid development of the Virtual Reality (VR) and Augmented Reality (AR) markets, volumetric video ...
O. Schreer +6 more
semanticscholar +1 more source
Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding
Computer Vision and Pattern RecognitionThe rapid advancement of Multimodal Large Language Models (MLLMs) has significantly impacted various multimodal tasks. However, these models face challenges in tasks that require spatial understanding within 3D environments.
Duo Zheng, Shijia Huang, Liwei Wang
semanticscholar +1 more source
Jointly learning perceptually heterogeneous features for blind 3D video quality assessment
Neurocomputing, 20193D videos quality assessment (3D-VQA) is essential to various 3D video processing applications. However, it has not been well investigated on how to make use of perceptual multi-channel video information to improve 3D-VQA under different distortion ...
Yongfang Wang +4 more
semanticscholar +1 more source
2012
Over the past decade the progress in computing and telecommunication technologies have made storage and transmission of visual information media even more ubiquitous. Nowadays it is usual to stream in real-time huge amount of data on-line, e.g. over a LAN or the Internet.
Takashi Matsuyama +3 more
openaire +1 more source
Over the past decade the progress in computing and telecommunication technologies have made storage and transmission of visual information media even more ubiquitous. Nowadays it is usual to stream in real-time huge amount of data on-line, e.g. over a LAN or the Internet.
Takashi Matsuyama +3 more
openaire +1 more source

