Decoder-side depth estimation - Open Access .click

Results 101 to 110 of about 22,886 (195)

Hand gesture 3D pose estimation method based on swin transformer and CNN. [PDF]

Sci Rep
Dang R, Feng G.
europepmc +1 more source

Computer Graphics Forum, EarlyView.
Spatially augmented reality (SAR) transforms large, surround, collaborative experiences out of VR/AR headsets to the real world by merging content from projectors with the physical environment. This detailed state‐of‐the‐art survey reports on the advancements in multi‐projector aggregation and hardware technologies used to achieve SAR and build ...
Aditi Majumder, Muhammad Twaha Ibrahim
wiley +1 more source

Encoding Occupancy in Memory Location for Efficient and Compact High‐Resolution Voxel Structures

Computer Graphics Forum, EarlyView.
We encode information about geometric structure into the pointers of a sparse voxel directed acyclic graph (SVDAG). Each pointer carries information about the structure of the node it points to. Our encoding improves ray tracing performance and reduces model size in memory.
Jaina Modisett, Markus Billeter
wiley +1 more source

HASwinNet: A Swin Transformer-Based Denoising Framework with Hybrid Attention for mmWave MIMO Systems. [PDF]

Entropy (Basel)
Han X, Tu H, Ying J, Chen J, Xing Z.
europepmc +1 more source

3D Character Reconstruction from Hand‐drawn Model Sheets

Computer Graphics Forum, EarlyView.
Abstract Hand‐drawn model sheets are widely used in character design to define 3D shape and appearance through sparse multi‐view drawings. Reconstructing 3D characters from such sparse inputs has traditionally been challenging due to insufficient visual information.
Hyejeong Yoon +3 more
wiley +1 more source

Intracortical brain-computer interface for navigation in virtual reality in macaque monkeys. [PDF]

Sci Adv
Saussus O +4 more
europepmc +1 more source

PBR‐Inspired Controllable Diffusion for Image Generation

Computer Graphics Forum, EarlyView.
Abstract Despite recent advances in text‐to‐image generation, controlling geometric layout and PBR material properties in synthesized scenes remains challenging. We present a pipeline that first produces a G‐buffer (albedo, normals, depth, roughness, shading, and metallic) from a text prompt and then renders a final image through a PBR‐inspired branch ...
Bowen Xue +3 more
wiley +1 more source

Enhancing Fluorescence Lifetime Imaging With Differential Transformer. [PDF]

J Biophotonics
Erbas I +8 more
europepmc +1 more source

See4D: Pose‐Free 4D Generation via Auto‐Regressive Video Inpainting

Computer Graphics Forum, EarlyView.
Abstract Immersive applications call for synthesizing spatiotemporal 4D content from casual videos without costly 3D supervision. Existing video‐to‐4D methods typically rely on manually annotated camera poses, which are labor‐intensive and brittle for in‐the‐wild footage.
Dongyue Lu +10 more
wiley +1 more source

VLM-Nav: Mapless UAV navigation using monocular vision driven by vision-language models. [PDF]

PLoS One
Sarker GC, Azad A, Rahman S, Hasan MM.
europepmc +1 more source

immersive video