Results 31 to 40 of about 1,322,987 (378)
Shikra: Unleashing Multimodal LLM's Referential Dialogue Magic [PDF]
In human conversations, individuals can indicate relevant regions within a scene while addressing others. In turn, the other person can then respond by referring to specific regions if necessary.
Ke Chen +5 more
semanticscholar +1 more source
Acknowledgement to Reviewers of Multimodal Technologies and Interaction in 2017
Peer review is an essential part in the publication process, ensuring that Multimodal Technologies and Interaction maintains high quality standards for its published papers [...]
Multimodal Technologies and Interaction Editorial Office
doaj +1 more source
A haptic-enabled multimodal interface for the planning of hip arthroplasty [PDF]
Multimodal environments help fuse a diverse range of sensory modalities, which is particularly important when integrating the complex data involved in surgical preoperative planning.
Caldwell, DG +6 more
core +1 more source
A large number of farms in developing countries are smallholder farms. In China, this trend is expected to persist for the foreseeable future. However, smallholder farmers in China face several challenges, including excessive fertilizer use for higher ...
Jing Hua +6 more
doaj +1 more source
Automatic prediction of presentation style and student engagement from videos
Presentation style is an important dimension to be considered for delivering lectures or presentations. It affects the quality of the content delivery as well as the engagement of the students who consume the lectures, which is a key aspect of a learning
Chinchu Thomas +3 more
doaj +1 more source
Adaptive multimodal continuous ant colony optimization [PDF]
Seeking multiple optima simultaneously, which multimodal optimization aims at, has attracted increasing attention but remains challenging. Taking advantage of ant colony optimization algorithms in preserving high diversity, this paper intends to extend ...
Chen, Wei-Neng +6 more
core +2 more sources
Cryo-EM structures from sub-nl volumes using pin-printing and jet vitrification
There is a need to further improve the automation of cryo-EM sample preparation to make it more easily accessible for non-specialists, reduce sample waste and increase reproducibility.
Raimond B. G. Ravelli +8 more
doaj +1 more source
3D Printing—A Cutting Edge Technology for Treating Post-Infarction Patients
The increasing complexity of cardiovascular interventions requires advanced peri-procedural imaging and tailored treatment. Three-dimensional printing technology represents one of the most significant advances in the field of cardiac imaging ...
Daniel Cernica +4 more
doaj +1 more source
SEED-Bench: Benchmarking Multimodal LLMs with Generative Comprehension [PDF]
Based on powerful Large Language Models (LLMs), recent generative Multimodal Large Language Models (MLLMs) have gained prominence as a pivotal research area, exhibiting remarkable capability for both comprehension and generation. In this work, we address
Bohao Li +5 more
semanticscholar +1 more source
Aligning Large Multimodal Models with Factually Augmented RLHF [PDF]
Large Multimodal Models (LMM) are built across modalities and the misalignment between two modalities can result in"hallucination", generating textual outputs that are not grounded by the multimodal information in context.
Zhiqing Sun +11 more
semanticscholar +1 more source

