Results 11 to 20 of about 281 (47)
Explaining Hierarchical Features in Dynamic Point Cloud Processing [PDF]
This paper aims at bringing some light and understanding to the field of deep learning for dynamic point cloud processing. Specifically, we focus on the hierarchical features learning aspect, with the ultimate goal of understanding which features are ...
Gomes, Pedro, Rossi, Silvia, Toni, Laura
core +2 more sources
Encoder-Decoder-Based Intra-Frame Block Partitioning Decision
The recursive intra-frame block partitioning decision process, a crucial component of the next-generation video coding standards, exerts significant influence over the encoding time.
Jiang, Yucheng +5 more
core
Using Set Covering to Generate Databases for Holistic Steganalysis
Within an operational framework, covers used by a steganographer are likely to come from different sensors and different processing pipelines than the ones used by researchers for training their steganalysis models. Thus, a performance gap is unavoidable
Abecidan, Rony +4 more
core +2 more sources
Evaluation of Sampling Algorithms for a Pairwise Subjective Assessment Methodology
Subjective assessment tests are often employed to evaluate image processing systems, notably image and video compression, super-resolution among others and have been used as an indisputable way to provide evidence of the performance of an algorithm or ...
Ascenso, Joao, Mohammadi, Shima
core
Background music (BGM) can enhance the video's emotion. However, selecting an appropriate BGM often requires domain knowledge. This has led to the development of video-music retrieval techniques.
Li, Dian +4 more
core
Situated in the intersection of audiovisual archives, computational methods, and immersive interactions, this work probes the increasingly important accessibility issues from a two-fold approach.
Yang, Yuchen
core
Invertible Mosaic Image Hiding Network for Very Large Capacity Image Steganography
The existing image steganography methods either sequentially conceal secret images or conceal a concatenation of multiple images. In such ways, the interference of information among multiple images will become increasingly severe when the number of ...
Bi, Xing +5 more
core
Dance2MIDI: Dance-driven multi-instruments music generation
Dance-driven music generation aims to generate musical pieces conditioned on dance videos. Previous works focus on monophonic or raw audio generation, while the multiinstruments scenario is under-explored.
Han, Bo, Ren, Yi
core
An Implementation of Multimodal Fusion System for Intelligent Digital Human Generation
With the rapid development of artificial intelligence (AI), digital humans have attracted more and more attention and are expected to achieve a wide range of applications in several industries.
Bi, Kaiyue +4 more
core
Multimodal Sentiment Analysis (MSA) aims to mine sentiment information from text, visual, and acoustic modalities. Previous works have focused on representation learning and feature fusion strategies.
Chen, Jiawei +5 more
core

