Results 111 to 120 of about 404,957 (120)
Some of the next articles are maybe not open access.
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
International Journal of Computer Vision, 2023Significant advancements have been achieved in the realm of large-scale pre-trained text-to-video Diffusion Models (VDMs). However, previous methods either rely solely on pixel-based VDMs, which come with high computational costs, or on latent-based VDMs,
David Junhao Zhang +7 more
semanticscholar +1 more source
Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning
arXiv.orgChain-of-thought reasoning has significantly improved the performance of Large Language Models (LLMs) across various domains. However, this reasoning process has been confined exclusively to textual space, limiting its effectiveness in visually intensive
Alex Su +4 more
semanticscholar +1 more source
Pixel-GS: Density Control with Pixel-aware Gradient for 3D Gaussian Splatting
European Conference on Computer Vision3D Gaussian Splatting (3DGS) has demonstrated impressive novel view synthesis results while advancing real-time rendering performance. However, it relies heavily on the quality of the initial point cloud, resulting in blurring and needle-like artifacts ...
Zheng Zhang +4 more
semanticscholar +1 more source
Cross-Image Pixel Contrasting for Semantic Segmentation
IEEE Transactions on Pattern Analysis and Machine IntelligenceThis work studies the problem of image semantic segmentation. Current approaches focus mainly on mining “local” context, i.e., dependencies between pixels within individual images, by specifically-designed, context aggregation modules (e.g., dilated ...
Tianfei Zhou, Wenguan Wang
semanticscholar +1 more source
Hierarchical Cross-Modal Talking Face Generation With Dynamic Pixel-Wise Loss
Computer Vision and Pattern Recognition, 2019We devise a cascade GAN approach to generate talking face video, which is robust to different face shapes, view angles, facial characteristics, and noisy audio conditions.
Lele Chen +3 more
semanticscholar +1 more source
Presentación de la edición Semilla Científica 6
Revista Semilla Científica*
Amelia Sarco
semanticscholar +1 more source
Comput. Aided Civ. Infrastructure Eng., 2019
Deep learning‐based structural damage detection methods overcome the limitation of inferior adaptability caused by extensively varying real‐world situations (e.g., lighting and shadow changes).
Shengyuan Li, Xuefeng Zhao, Guangyi Zhou
semanticscholar +1 more source
Deep learning‐based structural damage detection methods overcome the limitation of inferior adaptability caused by extensively varying real‐world situations (e.g., lighting and shadow changes).
Shengyuan Li, Xuefeng Zhao, Guangyi Zhou
semanticscholar +1 more source
Active pixel sensor matrix based on monolayer MoS2 phototransistor array
Nature Materials, 2022Akhil Dodda +13 more
semanticscholar +1 more source
Automated Pixel‐Level Pavement Crack Detection on 3D Asphalt Surfaces Using a Deep‐Learning Network
Comput. Aided Civ. Infrastructure Eng., 2017Allen A. Zhang +9 more
semanticscholar +1 more source
Principles and prospects for single-pixel imaging
Nature Photonics, 2018M. Edgar, G. Gibson, M. Padgett
semanticscholar +1 more source

