Multimedia cs.mm - Open Access .click

Results 11 to 20 of about 281 (47)

Explaining Hierarchical Features in Dynamic Point Cloud Processing [PDF]

, 2022
This paper aims at bringing some light and understanding to the field of deep learning for dynamic point cloud processing. Specifically, we focus on the hierarchical features learning aspect, with the ultimate goal of understanding which features are ...
Gomes, Pedro, Rossi, Silvia, Toni, Laura
core +2 more sources

Encoder-Decoder-Based Intra-Frame Block Partitioning Decision

, 2023
The recursive intra-frame block partitioning decision process, a crucial component of the next-generation video coding standards, exerts significant influence over the encoding time.
Jiang, Yucheng +5 more
core

Using Set Covering to Generate Databases for Holistic Steganalysis

, 2022
Within an operational framework, covers used by a steganographer are likely to come from different sensors and different processing pipelines than the ones used by researchers for training their steganalysis models. Thus, a performance gap is unavoidable
Abecidan, Rony +4 more
core +2 more sources

Evaluation of Sampling Algorithms for a Pairwise Subjective Assessment Methodology

, 2023
Subjective assessment tests are often employed to evaluate image processing systems, notably image and video compression, super-resolution among others and have been used as an indisputable way to provide evidence of the performance of an algorithm or ...
Ascenso, Joao, Mohammadi, Shima
core

Unified Pretraining Target Based Video-music Retrieval With Music Rhythm And Video Optical Flow Information

, 2023
Background music (BGM) can enhance the video's emotion. However, selecting an appropriate BGM often requires domain knowledge. This has led to the development of video-music retrieval techniques.
Li, Dian +4 more
core

Encoding and Decoding Narratives: Datafication and Alternative Access Models for Audiovisual Archives

, 2023
Situated in the intersection of audiovisual archives, computational methods, and immersive interactions, this work probes the increasingly important accessibility issues from a two-fold approach.
Yang, Yuchen
core

Invertible Mosaic Image Hiding Network for Very Large Capacity Image Steganography

, 2023
The existing image steganography methods either sequentially conceal secret images or conceal a concatenation of multiple images. In such ways, the interference of information among multiple images will become increasingly severe when the number of ...
Bi, Xing +5 more
core

Dance2MIDI: Dance-driven multi-instruments music generation

, 2023
Dance-driven music generation aims to generate musical pieces conditioned on dance videos. Previous works focus on monophonic or raw audio generation, while the multiinstruments scenario is under-explored.
Han, Bo, Ren, Yi
core

An Implementation of Multimodal Fusion System for Intelligent Digital Human Generation

, 2023
With the rapid development of artificial intelligence (AI), digital humans have attracted more and more attention and are expected to achieve a wide range of applications in several industries.
Bi, Kaiyue +4 more
core

Text-oriented Modality Reinforcement Network for Multimodal Sentiment Analysis from Unaligned Multimodal Sequences

, 2023
Multimodal Sentiment Analysis (MSA) aims to mine sentiment information from text, visual, and acoustic modalities. Previous works have focused on representation learning and feature fusion strategies.
Chen, Jiawei +5 more
core