Results 51 to 60 of about 276,223 (97)
Universal Organizer of SAM for Unsupervised Semantic Segmentation
Unsupervised semantic segmentation (USS) aims to achieve high-quality segmentation without manual pixel-level annotations. Existing USS models provide coarse category classification for regions, but the results often have blurry and imprecise edges ...
Cai, Xinhao+5 more
core
DKiS: Decay weight invertible image steganography with private key
Image steganography, defined as the practice of concealing information within another image, traditionally encounters security challenges when its methods become publicly known or are under attack.
Liu, Xuhua, Xu, Yitian, Yang, Hang
core
MVBIND: Self-Supervised Music Recommendation For Videos Via Embedding Space Binding
Recent years have witnessed the rapid development of short videos, which usually contain both visual and audio modalities. Background music is important to the short videos, which can significantly influence the emotions of the viewers.
Duan, Huiyu+4 more
core
Deep Learning-based Text-in-Image Watermarking
In this work, we introduce a novel deep learning-based approach to text-in-image watermarking, a method that embeds and extracts textual information within images to enhance data security and integrity.
Huang, Pei-Chi+3 more
core
Efficient 3D medical image segmentation algorithm over a secured multimedia network
Shadi Alzu'bi+3 more
semanticscholar +1 more source
Enhancing Expressiveness in Dance Generation via Integrating Frequency and Music Style Information
Dance generation, as a branch of human motion generation, has attracted increasing attention. Recently, a few works attempt to enhance dance expressiveness, which includes genre matching, beat alignment, and dance dynamics, from certain aspects. However,
Chen, Liyang+8 more
core
FashionReGen: LLM-Empowered Fashion Report Generation
Fashion analysis refers to the process of examining and evaluating trends, styles, and elements within the fashion industry to understand and interpret its current state, generating fashion reports.
Chua, Tat-Seng+5 more
core
Multi-source Knowledge Enhanced Graph Attention Networks for Multimodal Fact Verification
Multimodal fact verification is an under-explored and emerging field that has gained increasing attention in recent years. The goal is to assess the veracity of claims that involve multiple modalities by analyzing the retrieved evidence.
Cao, Han+3 more
core
Some of the next articles are maybe not open access.
Related searches:
Related searches:
Self-Supervised Learning for Multimedia Recommendation
IEEE transactions on multimedia, 2023Learning representations for multimedia content is critical for multimedia recommendation. Current representation learning methods roughly fall into two groups: (1) using the historical interactions to create ID embeddings of users and items, and (2 ...
Zhulin Tao+6 more
semanticscholar +1 more source
DualGNN: Dual Graph Neural Network for Multimedia Recommendation
IEEE transactions on multimedia, 2023One of the important factors affecting micro-video recommender systems is to model the multi-modal user preference on the micro-video. Despite the remarkable performance of prior arts, they are still limited by fusing the user preference derived from ...
Qifan Wang+6 more
semanticscholar +1 more source