Hand gesture 3D pose estimation method based on swin transformer and CNN. [PDF]
Dang R, Feng G.
europepmc +1 more source
Democratising Multi‐Projector Displays
Spatially augmented reality (SAR) transforms large, surround, collaborative experiences out of VR/AR headsets to the real world by merging content from projectors with the physical environment. This detailed state‐of‐the‐art survey reports on the advancements in multi‐projector aggregation and hardware technologies used to achieve SAR and build ...
Aditi Majumder, Muhammad Twaha Ibrahim
wiley +1 more source
Encoding Occupancy in Memory Location for Efficient and Compact High‐Resolution Voxel Structures
We encode information about geometric structure into the pointers of a sparse voxel directed acyclic graph (SVDAG). Each pointer carries information about the structure of the node it points to. Our encoding improves ray tracing performance and reduces model size in memory.
Jaina Modisett, Markus Billeter
wiley +1 more source
HASwinNet: A Swin Transformer-Based Denoising Framework with Hybrid Attention for mmWave MIMO Systems. [PDF]
Han X, Tu H, Ying J, Chen J, Xing Z.
europepmc +1 more source
3D Character Reconstruction from Hand‐drawn Model Sheets
Abstract Hand‐drawn model sheets are widely used in character design to define 3D shape and appearance through sparse multi‐view drawings. Reconstructing 3D characters from such sparse inputs has traditionally been challenging due to insufficient visual information.
Hyejeong Yoon +3 more
wiley +1 more source
Intracortical brain-computer interface for navigation in virtual reality in macaque monkeys. [PDF]
Saussus O +4 more
europepmc +1 more source
PBR‐Inspired Controllable Diffusion for Image Generation
Abstract Despite recent advances in text‐to‐image generation, controlling geometric layout and PBR material properties in synthesized scenes remains challenging. We present a pipeline that first produces a G‐buffer (albedo, normals, depth, roughness, shading, and metallic) from a text prompt and then renders a final image through a PBR‐inspired branch ...
Bowen Xue +3 more
wiley +1 more source
Enhancing Fluorescence Lifetime Imaging With Differential Transformer. [PDF]
Erbas I +8 more
europepmc +1 more source
See4D: Pose‐Free 4D Generation via Auto‐Regressive Video Inpainting
Abstract Immersive applications call for synthesizing spatiotemporal 4D content from casual videos without costly 3D supervision. Existing video‐to‐4D methods typically rely on manually annotated camera poses, which are labor‐intensive and brittle for in‐the‐wild footage.
Dongyue Lu +10 more
wiley +1 more source
VLM-Nav: Mapless UAV navigation using monocular vision driven by vision-language models. [PDF]
Sarker GC, Azad A, Rahman S, Hasan MM.
europepmc +1 more source

