Results 51 to 60 of about 276,223 (97)

Universal Organizer of SAM for Unsupervised Semantic Segmentation

open access: yes
Unsupervised semantic segmentation (USS) aims to achieve high-quality segmentation without manual pixel-level annotations. Existing USS models provide coarse category classification for regions, but the results often have blurry and imprecise edges ...
Cai, Xinhao   +5 more
core  

DKiS: Decay weight invertible image steganography with private key

open access: yes
Image steganography, defined as the practice of concealing information within another image, traditionally encounters security challenges when its methods become publicly known or are under attack.
Liu, Xuhua, Xu, Yitian, Yang, Hang
core  

MVBIND: Self-Supervised Music Recommendation For Videos Via Embedding Space Binding

open access: yes
Recent years have witnessed the rapid development of short videos, which usually contain both visual and audio modalities. Background music is important to the short videos, which can significantly influence the emotions of the viewers.
Duan, Huiyu   +4 more
core  

Deep Learning-based Text-in-Image Watermarking

open access: yes
In this work, we introduce a novel deep learning-based approach to text-in-image watermarking, a method that embeds and extracts textual information within images to enhance data security and integrity.
Huang, Pei-Chi   +3 more
core  

Efficient 3D medical image segmentation algorithm over a secured multimedia network

open access: yesMultimedia tools and applications, 2020
Shadi Alzu'bi   +3 more
semanticscholar   +1 more source

Enhancing Expressiveness in Dance Generation via Integrating Frequency and Music Style Information

open access: yes
Dance generation, as a branch of human motion generation, has attracted increasing attention. Recently, a few works attempt to enhance dance expressiveness, which includes genre matching, beat alignment, and dance dynamics, from certain aspects. However,
Chen, Liyang   +8 more
core  

FashionReGen: LLM-Empowered Fashion Report Generation

open access: yes
Fashion analysis refers to the process of examining and evaluating trends, styles, and elements within the fashion industry to understand and interpret its current state, generating fashion reports.
Chua, Tat-Seng   +5 more
core  

Multi-source Knowledge Enhanced Graph Attention Networks for Multimodal Fact Verification

open access: yes
Multimodal fact verification is an under-explored and emerging field that has gained increasing attention in recent years. The goal is to assess the veracity of claims that involve multiple modalities by analyzing the retrieved evidence.
Cao, Han   +3 more
core  
Some of the next articles are maybe not open access.

Related searches:

Self-Supervised Learning for Multimedia Recommendation

IEEE transactions on multimedia, 2023
Learning representations for multimedia content is critical for multimedia recommendation. Current representation learning methods roughly fall into two groups: (1) using the historical interactions to create ID embeddings of users and items, and (2 ...
Zhulin Tao   +6 more
semanticscholar   +1 more source

DualGNN: Dual Graph Neural Network for Multimedia Recommendation

IEEE transactions on multimedia, 2023
One of the important factors affecting micro-video recommender systems is to model the multi-modal user preference on the micro-video. Despite the remarkable performance of prior arts, they are still limited by fusing the user preference derived from ...
Qifan Wang   +6 more
semanticscholar   +1 more source

Home - About - Disclaimer - Privacy