Results 21 to 30 of about 276,223 (97)
Latency Target based Analysis of the DASH.js Player
We analyse the low latency performance of the three Adaptive Bitrate (ABR) algorithms in the dash.js Dynamic Adaptive Streaming over HTTP (DASH) player with respect to a range of latency targets and configuration options.
Aslam, Adil, O'Hanlon, Piers
core +1 more source
Evaluation of Sampling Algorithms for a Pairwise Subjective Assessment Methodology
Subjective assessment tests are often employed to evaluate image processing systems, notably image and video compression, super-resolution among others and have been used as an indisputable way to provide evidence of the performance of an algorithm or ...
Ascenso, Joao, Mohammadi, Shima
core
Situated in the intersection of audiovisual archives, computational methods, and immersive interactions, this work probes the increasingly important accessibility issues from a two-fold approach.
Yang, Yuchen
core
Dance2MIDI: Dance-driven multi-instruments music generation
Dance-driven music generation aims to generate musical pieces conditioned on dance videos. Previous works focus on monophonic or raw audio generation, while the multiinstruments scenario is under-explored.
Han, Bo, Ren, Yi
core
Background music (BGM) can enhance the video's emotion. However, selecting an appropriate BGM often requires domain knowledge. This has led to the development of video-music retrieval techniques.
Li, Dian+4 more
core
Invertible Mosaic Image Hiding Network for Very Large Capacity Image Steganography
The existing image steganography methods either sequentially conceal secret images or conceal a concatenation of multiple images. In such ways, the interference of information among multiple images will become increasingly severe when the number of ...
Bi, Xing+5 more
core
Deep Learning Model for Multimedia Quality of Experience Prediction Based on Network Flow Packets [PDF]
[EN] Quality of experience (QoE) is the overall acceptability of an application or service, as perceived subjectively by the end user. In particular, for video quality the QoE is dependent of video transmission parameters.
Carro, Belén+4 more
core +1 more source
An Implementation of Multimodal Fusion System for Intelligent Digital Human Generation
With the rapid development of artificial intelligence (AI), digital humans have attracted more and more attention and are expected to achieve a wide range of applications in several industries.
Bi, Kaiyue+4 more
core
Multimodal Sentiment Analysis (MSA) aims to mine sentiment information from text, visual, and acoustic modalities. Previous works have focused on representation learning and feature fusion strategies.
Chen, Jiawei+5 more
core
Complete Cross-triplet Loss in Label Space for Audio-visual Cross-modal Retrieval
The heterogeneity gap problem is the main challenge in cross-modal retrieval. Because cross-modal data (e.g. audiovisual) have different distributions and representations that cannot be directly compared. To bridge the gap between audiovisual modalities,
Ikeda, Kazushi+3 more
core