Results 31 to 40 of about 276,223 (97)
Using Set Covering to Generate Databases for Holistic Steganalysis
Within an operational framework, covers used by a steganographer are likely to come from different sensors and different processing pipelines than the ones used by researchers for training their steganalysis models. Thus, a performance gap is unavoidable
Abecidan, Rony+4 more
core
Experience of live video streaming can be improved if the video uploader has more accurate knowledge about the future available bandwidth. Because with such knowledge, one is able to know what sizes should he encode the frames to be in an ever-changing ...
Zheng, Weijia
core
High Capacity Reversible Data Hiding for Encrypted 3D Mesh Models Based on Topology
Reversible data hiding in encrypted domain(RDH-ED) can not only protect the privacy of 3D mesh models and embed additional data, but also recover original models and extract additional data losslessly.
Cheng, Lulu+3 more
core
TACOformer:Token-channel compounded Cross Attention for Multimodal Emotion Recognition
Recently, emotion recognition based on physiological signals has emerged as a field with intensive research. The utilization of multi-modal, multi-channel physiological signals has significantly improved the performance of emotion recognition systems ...
Li, Xinda
core
Improving Social Media Popularity Prediction with Multiple Post Dependencies
Social Media Popularity Prediction has drawn a lot of attention because of its profound impact on many different applications, such as recommendation systems and multimedia advertising. Despite recent efforts to leverage the content of social media posts
Cui, Yong+5 more
core
Social media popularity (SMP) prediction is a complex task involving multi-modal data integration. While pre-trained vision-language models (VLMs) like CLIP have been widely adopted for this task, their effectiveness in capturing the unique ...
Chou, Yi-Shiuan+5 more
core
Deep Mamba Multi-modal Learning
Inspired by the excellent performance of Mamba networks, we propose a novel Deep Mamba Multi-modal Learning (DMML). It can be used to achieve the fusion of multi-modal features. We apply DMML to the field of multimedia retrieval and propose an innovative
Cui, Yu+5 more
core
RecipeMeta: Metapath-enhanced Recipe Recommendation on Heterogeneous Recipe Network
Recipe is a set of instructions that describes how to make food. It can help people from the preparation of ingredients, food cooking process, etc. to prepare the food, and increasingly in demand on the Web.
Doman, Keisuke+4 more
core
Automatic evaluating systems are fundamental issues in sports technologies. In many sports, such as figure skating, automated evaluating methods based on pose estimation have been proposed.
Fujii, Keisuke+3 more
core
AI-Based Sports Highlight Generation for Social Media [PDF]
Social media plays a significant role for sports organizations with millions of active fans, but publishing highlights is often a tedious manual operation.
Dorcheh, Sayed Mohammad Majidi+7 more
core +1 more source