Results 261 to 270 of about 4,996,743 (329)
Some of the next articles are maybe not open access.

Related searches:

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Conference on Empirical Methods in Natural Language Processing, 2023
Large Vision-Language Model (LVLM) has enhanced the performance of various downstream tasks in visual-language understanding. Most existing approaches encode images and videos into separate feature spaces, which are then fed as inputs to large language ...
Bin Lin   +5 more
semanticscholar   +1 more source

Visual imagery and visual representation

Trends in Neurosciences, 1994
Among many controversies in visual neuroscience is whether visual imagery of objects, scenes and living beings is based upon contributions of the early visual areas or depends on hierarchical higher visual areas only, and whether the cortical areas subserving visual imagery are identical to those underlying visual perception.
P E, Roland, B, Gulyás
openaire   +2 more sources

Big Transfer (BiT): General Visual Representation Learning

European Conference on Computer Vision, 2019
Transfer of pre-trained representations improves sample efficiency and simplifies hyperparameter tuning when training deep neural networks for vision.
Alexander Kolesnikov   +6 more
semanticscholar   +1 more source

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

International Conference on Machine Learning
Recently the state space models (SSMs) with efficient hardware-aware designs, i.e., the Mamba deep learning model, have shown great potential for long sequence modeling.
Lianghui Zhu   +5 more
semanticscholar   +1 more source

A Novel Visual Representation on Text Using Diverse Conditional GAN for Visual Recognition

IEEE Transactions on Image Processing, 2021
Automatic image visual recognition can make full use of largely available images with text descriptions on social media platforms to build large-scale image labeled datasets.
Tao Hu, Chengjiang Long, Chunxia Xiao
semanticscholar   +1 more source

Visual Basic Representations

International Journal of Algebra and Computation, 1998
We depict the weight diagrams (alias, crystal graphs) of basic and adjoint representations of complex simple Lie algebras/algebraic groups and describe some of their uses.
Plotkin, Eugene   +2 more
openaire   +1 more source

Scaling Language-Free Visual Representation Learning

arXiv.org
Visual Self-Supervised Learning (SSL) currently underperforms Contrastive Language-Image Pretraining (CLIP) in multimodal settings such as Visual Question Answering (VQA).
David Fan   +10 more
semanticscholar   +1 more source

Home - About - Disclaimer - Privacy