Results 301 to 310 of about 2,484,152 (338)

A multi-modal parcellation of human cerebral cortex

open access: yesNature, 2016
Understanding the amazingly complex human cerebral cortex requires a map (or parcellation) of its major subdivisions, known as cortical areas. Making an accurate areal map has been a century-old objective in neuroscience.
Matthew F Glasser   +2 more
exaly   +2 more sources

ShareGPT4V: Improving Large Multi-Modal Models with Better Captions

European Conference on Computer Vision, 2023
In the realm of large multi-modal models (LMMs), efficient modality alignment is crucial yet often constrained by the scarcity of high-quality image-text data.
Lin Chen   +7 more
semanticscholar   +1 more source

Otter: A Multi-Modal Model With In-Context Instruction Tuning

IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023
Recent advances in Large Multimodal Models (LMMs) have unveiled great potential as visual assistants. However, most existing works focus on responding to individual instructions or using previous dialogues for contextual understanding.
Bo Li   +5 more
semanticscholar   +1 more source

mPLUG-OwI2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration

Computer Vision and Pattern Recognition, 2023
Multi-modal Large Language Models (MLLMs) have demonstrated impressive instruction abilities across various open-ended tasks. However, previous methods primarily fo-cus on enhancing multi-modal capabilities.
Qinghao Ye   +9 more
semanticscholar   +1 more source

PointAugmenting: Cross-Modal Augmentation for 3D Object Detection

Computer Vision and Pattern Recognition, 2021
Camera and LiDAR are two complementary sensors for 3D object detection in the autonomous driving context. Camera provides rich texture and color cues while LiDAR specializes in relative distance sensing.
Chunwei Wang   +3 more
semanticscholar   +1 more source

Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

Computer Vision and Pattern Recognition
In the quest for artificial general intelligence, Multi-modal Large Language Models (MLLMs) have emerged as a focal point in recent advancements. However, the predominant focus remains on developing their capabilities in static image understanding.
Chaoyou Fu   +19 more
semanticscholar   +1 more source

Home - About - Disclaimer - Privacy