MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities [PDF]
We propose MM-Vet, an evaluation benchmark that examines large multimodal models (LMMs) on complicated multimodal tasks. Recent LMMs have shown various intriguing abilities, such as solving math problems written on the blackboard, reasoning about events ...
Weihao Yu +7 more
semanticscholar +1 more source
MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action [PDF]
We propose MM-REACT, a system paradigm that integrates ChatGPT with a pool of vision experts to achieve multimodal reasoning and action. In this paper, we define and explore a comprehensive list of advanced vision tasks that are intriguing to solve, but ...
Zhengyuan Yang +9 more
semanticscholar +1 more source
MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation [PDF]
We propose the first joint audio-video generation framework that brings engaging watching and listening experiences simultaneously, towards high-quality realistic videos. To generate joint audio-video pairs, we propose a novel Multi-Modal Diffusion model
Ludan Ruan +8 more
semanticscholar +1 more source
MM-DFN: Multimodal Dynamic Fusion Network for Emotion Recognition in Conversations [PDF]
Emotion Recognition in Conversations (ERC) has considerable prospects for developing empathetic machines. For multimodal ERC, it is vital to understand context and fuse modality information in conversations.
Dou Hu +4 more
semanticscholar +1 more source
BigEarthNet-MM: A Large-Scale, Multimodal, Multilabel Benchmark Archive for Remote Sensing Image Classification and Retrieval [Software and Data Sets] [PDF]
This article presents the multimodal BigEarthNet (BigEarthNet-MM) benchmark archive consisting of 590,326 pairs of Sentinel-1 and Sentinel-2 image patches to support deep learning (DL) studies in multimodal, multilabel remote sensing (RS) image retrieval
Gencer Sumbul +8 more
semanticscholar +1 more source
Basic assumptions and definitions in the analysis of financial leverage [PDF]
The financial leverage literature has been in a state of terminological chaos for decades as evidenced, forexample, by the Nobel Prize Lecture mistake on the one hand, and the global financial crisis on the other.A meaningful analysis of the leverage ...
Tomasz Berent
doaj +1 more source
Environmental problems such as air pollution, global warming and water scarcity pose a threat to environmental sustainability. One of the causes is the administration of public services.
Bayu Nugroho +2 more
doaj +1 more source
This study examine the determinants that may predict the consumer’s intention to accept equity-based product, which is mushārakah mutanāqisah (MM) Islamic mortgage. Survey was conducted using multi-stage and purposive sampling.
Imran Mehboob Shaikh +1 more
doaj +1 more source
FRICTIONAL MODELING AND OPTIMIZATION FOR A VIBRATION MODAL ANALYSIS SIMULATOR DEVICE USING GENETIC ALGORITHM [PDF]
The aim of this study is friction modeling and optimization of a âvibration modal analysis simulatorâ. This device has been used for observation and measurement of natural frequencies and mode shapes ofvibrating components and parts under the free or
m Elhami, s Razavian, H. Teimuri
doaj +1 more source
INVESTIGATION NUMERICAL BEHAVIOR STATIC AND DYNAMIC EMBANKMENTS OF RAILWAY CONSTRUCTED BY GEOFOAM [PDF]
One of the most important problems related to performing railway on soft subgrade, is the problem of bearing capacity and embankment controlling settlement relied on these subgrades.
M. Esmaeili, V. Khalilian
doaj +1 more source

