Results 301 to 310 of about 1,152,162 (330)
Some of the next articles are maybe not open access.
ACM Conference on Recommender Systems, 2022
For a long time, different recommendation tasks require designing task-specific architectures and training objectives. As a result, it is hard to transfer the knowledge and representations from one task to another, thus restricting the generalization ...
Shijie Geng +4 more
semanticscholar +1 more source
For a long time, different recommendation tasks require designing task-specific architectures and training objectives. As a result, it is hard to transfer the knowledge and representations from one task to another, thus restricting the generalization ...
Shijie Geng +4 more
semanticscholar +1 more source
Toward Unified Token Learning for Vision-Language Tracking
IEEE transactions on circuits and systems for video technology (Print), 2023In this paper, we present a simple, flexible and effective vision-language (VL) tracking pipeline, termed MMTrack, which casts VL tracking as a token generation task.
Yaozong Zheng +5 more
semanticscholar +1 more source
Unified Vision-Language-Action Model
arXiv.orgVision-language-action models (VLAs) have garnered significant attention for their potential in advancing robotic manipulation. However, previous approaches predominantly rely on the general comprehension capabilities of vision-language models (VLMs) to ...
Yu-Quan Wang +7 more
semanticscholar +1 more source
AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
Annual Meeting of the Association for Computational LinguisticsWe introduce AnyGPT, an any-to-any multimodal language model that utilizes discrete representations for the unified processing of various modalities, including speech, text, images, and music.
Jun Zhan +15 more
semanticscholar +1 more source
Show-o2: Improved Native Unified Multimodal Models
arXiv.orgThis paper presents improved native unified multimodal models, \emph{i.e.,} Show-o2, that leverage autoregressive modeling and flow matching. Built upon a 3D causal variational autoencoder space, unified visual representations are constructed through a ...
Jinheng Xie +2 more
semanticscholar +1 more source
Automatic code generation using unified modeling language activity and sequence models
IET Software, 2016S. Viswanathan, P. Samuel
semanticscholar +1 more source
2016 7th IEEE International Conference on Software Engineering and Service Science (ICSESS), 2016
Jessada Tomyim, A. Pohthong
semanticscholar +1 more source
Jessada Tomyim, A. Pohthong
semanticscholar +1 more source
Transforming heat transfer with thermal metamaterials and devices
Nature Reviews Materials, 2021Ying Li, Wei Li, Tiancheng Han
exaly
A unified description of non-radiative voltage losses in organic solar cells
Nature Energy, 2021Xian-Kai Chen, Deping Qian, Yuming Wang
exaly

