Enhancing multimodal analogical reasoning with Logic Augmented Generation [PDF]
Recent advances in Large Language Models have demonstrated their capabilities across a variety of tasks. However, automatically extracting implicit knowledge from natural language remains a significant challenge, as machines lack active experience with ...
Anna Sofia Lippolis +2 more
semanticscholar +4 more sources
Interpretable Multimodal Out-of-Context Detection with Soft Logic Regularization [PDF]
The rapid spread of information through mobile devices and media has led to the widespread of false or deceptive news, causing significant concerns in society.
Huanhuan Ma +4 more
openalex +2 more sources
Code-Vision: Evaluating Multimodal LLMs Logic Understanding and Code Generation Capabilities [PDF]
This paper introduces Code-Vision, a benchmark designed to evaluate the logical understanding and code generation capabilities of Multimodal Large Language Models (MLLMs).
Hanbin Wang +9 more
openalex +2 more sources
Probability Logic for Harsanyi Type Spaces [PDF]
Probability logic has contributed to significant developments in belief types for game-theoretical economics. We present a new probability logic for Harsanyi Type spaces, show its completeness, and prove both a de-nesting property and a unique extension ...
Chunlai Zhou
doaj +3 more sources
FlowVQA: Mapping Multimodal Logic in Visual Question Answering with Flowcharts [PDF]
Existing benchmarks for visual question answering lack in visual grounding and complexity, particularly in evaluating spatial reasoning skills. We introduce FlowVQA, a novel benchmark aimed at assessing the capabilities of visual question-answering ...
Shubhankar Singh +6 more
openalex +2 more sources
Towards Multimodal Co-Construction of Explanations for Robots: Combining Inductive Logic Programming and Large Language Models to Explain Robot Faults [PDF]
This paper explores a hybrid approach to the multimodal co-const-ruction of explanations for robot faults, integrating Inductive Logic Programming (ILP) and Large Language Models (LLMs).
Youssef Mahmoud Youssef, Teena Hassan
openalex +2 more sources
Mine-DW-Fusion: BEV Multiscale-Enhanced Fusion Object-Detection Model for Underground Coal Mine Based on Dynamic Weight Adjustment [PDF]
Environmental perception is crucial for achieving autonomous driving of auxiliary haulage vehicles in underground coal mines. The complex underground environment and working conditions, such as dust pollution, uneven lighting, and sensor data ...
Wanzi Yan +7 more
doaj +2 more sources
Implementing Multimodal Hardware Security with 2D α‐In2Se3 Ferroelectric Transistor [PDF]
Security is a critical challenge in the integrated circuit (IC) industry, yet device‐level hardware security remains largely underexplored. Most existing solutions necessitate modifications to current technology nodes and typically address only a single ...
Xinwei Zhang +11 more
doaj +2 more sources
A Multimodal Retrieval-Augmented Generation System with ReAct Agent Logic for Multi-Hop Reasoning
The rapid advancement of generative artificial intelligence models significantly influences modern methods of information processing and user interactions with information systems.
Denys Yuvzhenko +5 more
openalex +3 more sources
Digital transformation in higher education: logical framework, practical dilemmas, and implementation approaches [PDF]
The digital age comes with new demands and challenges for talent cultivation within the higher education system. The digital transformation in higher education has emerged as a critical element in addressing these challenges.
Juan Tang, Pin Huang, Shuangsheng Yan
doaj +2 more sources

