Results 291 to 300 of about 3,407,261 (330)
Some of the next articles are maybe not open access.
Continuous 3D Perception Model with Persistent State
Computer Vision and Pattern RecognitionWe present a unified framework capable of solving a broad range of 3D tasks. Our approach features a stateful recurrent model that continuously updates its state representation with each new observation.
Qianqian Wang +4 more
semanticscholar +1 more source
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
International Conference on Learning RepresentationsWe present CogVideoX, a large-scale text-to-video generation model based on diffusion transformer, which can generate 10-second continuous videos aligned with text prompt, with a frame rate of 16 fps and resolution of 768 * 1360 pixels.
Zhuoyi Yang +18 more
semanticscholar +1 more source
2011 International Conference on 3D Imaging (IC3D), 2011
Device for presenting 3D animated images, obtained by taking pictures through a diffraction grating, from all size objects, and a projection onto a holographic screen. The diffraction, orientated horizontally, gets by each ray of light a way done which depends on its wavelength and allows to obtain as many angles of vision as wavelengths.
openaire +1 more source
Device for presenting 3D animated images, obtained by taking pictures through a diffraction grating, from all size objects, and a projection onto a holographic screen. The diffraction, orientated horizontally, gets by each ray of light a way done which depends on its wavelength and allows to obtain as many angles of vision as wavelengths.
openaire +1 more source
Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model
arXiv.orgWe present Step-Video-T2V, a state-of-the-art text-to-video pre-trained model with 30B parameters and the ability to generate videos up to 204 frames in length. A deep compression Variational Autoencoder, Video-VAE, is designed for video generation tasks,
Guoqing Ma +99 more
semanticscholar +1 more source
1996
The invention concerns a 3D video endoscope with two optical inputs and an electrical output for the video signal. The entire optical system and the single camera head are situated at the endoscope's forward end. The electrical signals are transmitted by cables on the shaft channel to the signal-processing video electronic system.
KARLSRUHE FORSCHZENT, BECKER HEINZ
openaire +1 more source
The invention concerns a 3D video endoscope with two optical inputs and an electrical output for the video signal. The entire optical system and the single camera head are situated at the endoscope's forward end. The electrical signals are transmitted by cables on the shaft channel to the signal-processing video electronic system.
KARLSRUHE FORSCHZENT, BECKER HEINZ
openaire +1 more source
Real-time 3D video-based MR remote collaboration using gesture cues and virtual replicas
The International Journal of Advanced Manufacturing Technology, 2022X. Zhang +7 more
semanticscholar +1 more source
2011
This paper proposes an improved algorithm based on Active Appearance Models (AAM) and applies on a real-time 3D video conference system with a novel 3D display device which can pop out an avatar out of the display in the air. The proposed algorithm utilizes an improved Adaboost algorithm [1] for face detection based on skin color information.
Kun-Lung Tseng +3 more
openaire +1 more source
This paper proposes an improved algorithm based on Active Appearance Models (AAM) and applies on a real-time 3D video conference system with a novel 3D display device which can pop out an avatar out of the display in the air. The proposed algorithm utilizes an improved Adaboost algorithm [1] for face detection based on skin color information.
Kun-Lung Tseng +3 more
openaire +1 more source
2012
Visualization is one of the most standard applications of 3D video. Its essential functionality includes interactive free-viewpoint and 3D (pop-up) visualization of the captured scene as is. Following an ordinary 3D video visualization system, this chapter presents a novel free-viewpoint visualization method for 3D video stream of a single human in ...
Takashi Matsuyama +3 more
openaire +1 more source
Visualization is one of the most standard applications of 3D video. Its essential functionality includes interactive free-viewpoint and 3D (pop-up) visualization of the captured scene as is. Following an ordinary 3D video visualization system, this chapter presents a novel free-viewpoint visualization method for 3D video stream of a single human in ...
Takashi Matsuyama +3 more
openaire +1 more source
2010
In this chapter, we describe a method to render realistic 3D Videos by applying a clever dynamic 3D texturing scheme to the moving geometry representation captured by the methods proposed in the previous chapters. By displaying high-quality renderings of the recorded actor from any viewpoint, our system enables new interesting applications for 3D ...
openaire +1 more source
In this chapter, we describe a method to render realistic 3D Videos by applying a clever dynamic 3D texturing scheme to the moving geometry representation captured by the methods proposed in the previous chapters. By displaying high-quality renderings of the recorded actor from any viewpoint, our system enables new interesting applications for 3D ...
openaire +1 more source

