Results 111 to 120 of about 404,957 (120)
Some of the next articles are maybe not open access.

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

International Journal of Computer Vision, 2023
Significant advancements have been achieved in the realm of large-scale pre-trained text-to-video Diffusion Models (VDMs). However, previous methods either rely solely on pixel-based VDMs, which come with high computational costs, or on latent-based VDMs,
David Junhao Zhang   +7 more
semanticscholar   +1 more source

Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning

arXiv.org
Chain-of-thought reasoning has significantly improved the performance of Large Language Models (LLMs) across various domains. However, this reasoning process has been confined exclusively to textual space, limiting its effectiveness in visually intensive
Alex Su   +4 more
semanticscholar   +1 more source

Pixel-GS: Density Control with Pixel-aware Gradient for 3D Gaussian Splatting

European Conference on Computer Vision
3D Gaussian Splatting (3DGS) has demonstrated impressive novel view synthesis results while advancing real-time rendering performance. However, it relies heavily on the quality of the initial point cloud, resulting in blurring and needle-like artifacts ...
Zheng Zhang   +4 more
semanticscholar   +1 more source

Cross-Image Pixel Contrasting for Semantic Segmentation

IEEE Transactions on Pattern Analysis and Machine Intelligence
This work studies the problem of image semantic segmentation. Current approaches focus mainly on mining “local” context, i.e., dependencies between pixels within individual images, by specifically-designed, context aggregation modules (e.g., dilated ...
Tianfei Zhou, Wenguan Wang
semanticscholar   +1 more source

Hierarchical Cross-Modal Talking Face Generation With Dynamic Pixel-Wise Loss

Computer Vision and Pattern Recognition, 2019
We devise a cascade GAN approach to generate talking face video, which is robust to different face shapes, view angles, facial characteristics, and noisy audio conditions.
Lele Chen   +3 more
semanticscholar   +1 more source

Automatic pixel‐level multiple damage detection of concrete structure using fully convolutional network

Comput. Aided Civ. Infrastructure Eng., 2019
Deep learning‐based structural damage detection methods overcome the limitation of inferior adaptability caused by extensively varying real‐world situations (e.g., lighting and shadow changes).
Shengyuan Li, Xuefeng Zhao, Guangyi Zhou
semanticscholar   +1 more source

Active pixel sensor matrix based on monolayer MoS2 phototransistor array

Nature Materials, 2022
Akhil Dodda   +13 more
semanticscholar   +1 more source

Automated Pixel‐Level Pavement Crack Detection on 3D Asphalt Surfaces Using a Deep‐Learning Network

Comput. Aided Civ. Infrastructure Eng., 2017
Allen A. Zhang   +9 more
semanticscholar   +1 more source

Principles and prospects for single-pixel imaging

Nature Photonics, 2018
M. Edgar, G. Gibson, M. Padgett
semanticscholar   +1 more source

Home - About - Disclaimer - Privacy