Results 31 to 40 of about 277,143 (97)
High Capacity Reversible Data Hiding for Encrypted 3D Mesh Models Based on Topology
Reversible data hiding in encrypted domain(RDH-ED) can not only protect the privacy of 3D mesh models and embed additional data, but also recover original models and extract additional data losslessly.
Cheng, Lulu+3 more
core
Improving Social Media Popularity Prediction with Multiple Post Dependencies
Social Media Popularity Prediction has drawn a lot of attention because of its profound impact on many different applications, such as recommendation systems and multimedia advertising. Despite recent efforts to leverage the content of social media posts
Cui, Yong+5 more
core
RecipeMeta: Metapath-enhanced Recipe Recommendation on Heterogeneous Recipe Network
Recipe is a set of instructions that describes how to make food. It can help people from the preparation of ingredients, food cooking process, etc. to prepare the food, and increasingly in demand on the Web.
Doman, Keisuke+4 more
core
TACOformer:Token-channel compounded Cross Attention for Multimodal Emotion Recognition
Recently, emotion recognition based on physiological signals has emerged as a field with intensive research. The utilization of multi-modal, multi-channel physiological signals has significantly improved the performance of emotion recognition systems ...
Li, Xinda
core
Social media popularity (SMP) prediction is a complex task involving multi-modal data integration. While pre-trained vision-language models (VLMs) like CLIP have been widely adopted for this task, their effectiveness in capturing the unique ...
Chou, Yi-Shiuan+5 more
core
Deep Mamba Multi-modal Learning
Inspired by the excellent performance of Mamba networks, we propose a novel Deep Mamba Multi-modal Learning (DMML). It can be used to achieve the fusion of multi-modal features. We apply DMML to the field of multimedia retrieval and propose an innovative
Cui, Yu+5 more
core
Automatic evaluating systems are fundamental issues in sports technologies. In many sports, such as figure skating, automated evaluating methods based on pose estimation have been proposed.
Fujii, Keisuke+3 more
core
Generative AI-enabled Mobile Tactical Multimedia Networks: Distribution, Generation, and Perception
Mobile multimedia networks (MMNs) demonstrate great potential in delivering low-latency and high-quality entertainment and tactical applications, such as short-video sharing, online conferencing, and battlefield surveillance.
Fang, Yuguang+6 more
core
Deep3DSketch+: Obtaining Customized 3D Model by Single Free-Hand Sketch through Deep Learning
As 3D models become critical in today's manufacturing and product design, conventional 3D modeling approaches based on Computer-Aided Design (CAD) are labor-intensive, time-consuming, and have high demands on the creators.
Chen, Tianrun+5 more
core
User Digital Twin-Driven Video Streaming for Customized Preferences and Adaptive Transcoding
In the rapidly evolving field of multimedia services, video streaming has become increasingly prevalent, demanding innovative solutions to enhance user experience and system efficiency.
Berhane, Kalkidan+2 more
core