Results 251 to 260 of about 1,394,777 (330)
Collaborative positional attention for image to English question answering. [PDF]
Li Y, Teng H.
europepmc +1 more source
Physics-Aware Spatiotemporal Consistency for Transferable Defense of Autonomous Driving Perception. [PDF]
Liu Y +7 more
europepmc +1 more source
Some of the next articles are maybe not open access.
Related searches:
Related searches:
Modal Semantics without Worlds
Philosophy Compass, 2016Abstract Over the last half century, possible worlds have bled into almost every area of philosophy. In the metaphysics of modality, for example, philosophers have used possible worlds almost exclusively to illuminate discourse about metaphysical necessity and possibility.
Craig Warmke
openaire +2 more sources
Semantics Disentangling for Cross-Modal Retrieval
IEEE Transactions on Image ProcessingCross-modal retrieval (e.g., query a given image to obtain a semantically similar sentence, and vice versa) is an important but challenging task, as the heterogeneous gap and inconsistent distributions exist between different modalities. The dominant approaches struggle to bridge the heterogeneity by capturing the common representations among ...
Zheng Wang +5 more
openaire +3 more sources
Journal of Philosophical Logic, 1996
Believing that modal semantics should be done without ontological assumptions made by possible-worlds semantics the author reproduces Christopher Menzel's model-theoretical semantics for modal languages which works without assuming ``possibilia'', and builds up a more extensionalist ``ontology-free'' modification of it.
G. Ray
openaire +2 more sources
Believing that modal semantics should be done without ontological assumptions made by possible-worlds semantics the author reproduces Christopher Menzel's model-theoretical semantics for modal languages which works without assuming ``possibilia'', and builds up a more extensionalist ``ontology-free'' modification of it.
G. Ray
openaire +2 more sources
Semantics-Aware Spatial-Temporal Binaries for Cross-Modal Video Retrieval
IEEE Transactions on Image Processing, 2021With the current exponential growth of video-based social networks, video retrieval using natural language is receiving ever-increasing attention.
Mengshi Qi +4 more
semanticscholar +1 more source
arXiv.org, 2023
We present SPHINX, a versatile multi-modal large language model (MLLM) with a joint mixing of model weights, tuning tasks, and visual embeddings. First, for stronger vision-language alignment, we unfreeze the large language model (LLM) during pre ...
Ziyi Lin +15 more
semanticscholar +1 more source
We present SPHINX, a versatile multi-modal large language model (MLLM) with a joint mixing of model weights, tuning tasks, and visual embeddings. First, for stronger vision-language alignment, we unfreeze the large language model (LLM) during pre ...
Ziyi Lin +15 more
semanticscholar +1 more source

