Results 311 to 320 of about 2,484,152 (338)
Some of the next articles are maybe not open access.

MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter

Conference on Empirical Methods in Natural Language Processing, 2023
Language Models (LMs) have demonstrated impressive molecule understanding ability on various 1D text-related tasks. However, they inherently lack 2D graph perception - a critical ability of human professionals in comprehending molecules' topological ...
Zhiyuan Liu   +7 more
semanticscholar   +1 more source

Chameleon: Mixed-Modal Early-Fusion Foundation Models

arXiv.org
We present Chameleon, a family of early-fusion token-based mixed-modal models capable of understanding and generating images and text in any arbitrary sequence.
Chameleon Team   +3 more
semanticscholar   +1 more source

Cross-modal Ambiguity Learning for Multimodal Fake News Detection

The Web Conference, 2022
Cross-modal learning is essential to enable accurate fake news detection due to the fast-growing multimodal contents in online social communities. A fundamental challenge of multimodal fake news detection lies in the inherent ambiguity across different ...
Yixuan Chen   +6 more
semanticscholar   +1 more source

Modal Analysis of Fluid Flows: An Overview [PDF]

open access: yesAIAA Journal, 2017
Simple aerodynamic configurations under even modest conditions can exhibit complex flows with a wide range of temporal and spatial features. It has become common practice in the analysis of these flows to look for and extract physically important ...
Kunihiko Taira   +2 more
exaly   +2 more sources

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model

arXiv.org
We introduce Transfusion, a recipe for training a multi-modal model over discrete and continuous data. Transfusion combines the language modeling loss function (next token prediction) with diffusion to train a single transformer over mixed-modality ...
Chunting Zhou   +9 more
semanticscholar   +1 more source

Multi-Modal 3D Object Detection in Autonomous Driving: A Survey and Taxonomy

IEEE Transactions on Intelligent Vehicles, 2023
Autonomous vehicles require constant environmental perception to obtain the distribution of obstacles to achieve safe driving. Specifically, 3D object detection is a vital functional module as it can simultaneously predict surrounding objects' categories,
L. xilinx Wang   +10 more
semanticscholar   +1 more source

Bi-directional Adapter for Multi-modal Tracking

AAAI Conference on Artificial Intelligence, 2023
Due to the rapid development of computer vision, single-modal (RGB) object tracking has made significant progress in recent years. Considering the limitation of single imaging sensor, multi-modal images (RGB, infrared, etc.) are introduced to compensate ...
Bing Cao   +3 more
semanticscholar   +1 more source

Multi-modal Transformer for Video Retrieval

European Conference on Computer Vision, 2020
The task of retrieving video content relevant to natural language queries plays a critical role in effectively handling internet-scale datasets. Most of the existing methods for this caption-to-video retrieval problem do not fully exploit cross-modal ...
Valentin Gabeur   +3 more
semanticscholar   +1 more source

Theoretical and Experimental Modal Analysis of a 6 PUS PKM

International Conference on Informatics in Control, Automation and Robotics, 2019
Have free times? Read theoretical and experimental modal analysis writer by Why? A best seller book worldwide with fantastic worth as well as content is incorporated with interesting words. Where? Merely right here, in this site you can read online. Want
N. Maia, J. M. N. Silva, Wai Ming To
semanticscholar   +1 more source

MMGCN: Multi-modal Graph Convolution Network for Personalized Recommendation of Micro-video

ACM Multimedia, 2019
Personalized recommendation plays a central role in many online content sharing platforms. To provide quality micro-video recommendation service, it is of crucial importance to consider the interactions between users and items (i.e. micro-videos) as well
Yin-wei Wei   +5 more
semanticscholar   +1 more source

Home - About - Disclaimer - Privacy