Results 21 to 30 of about 276,223 (97)

Latency Target based Analysis of the DASH.js Player

open access: yes, 2023
We analyse the low latency performance of the three Adaptive Bitrate (ABR) algorithms in the dash.js Dynamic Adaptive Streaming over HTTP (DASH) player with respect to a range of latency targets and configuration options.
Aslam, Adil, O'Hanlon, Piers
core   +1 more source

Evaluation of Sampling Algorithms for a Pairwise Subjective Assessment Methodology

open access: yes, 2023
Subjective assessment tests are often employed to evaluate image processing systems, notably image and video compression, super-resolution among others and have been used as an indisputable way to provide evidence of the performance of an algorithm or ...
Ascenso, Joao, Mohammadi, Shima
core  

Encoding and Decoding Narratives: Datafication and Alternative Access Models for Audiovisual Archives

open access: yes, 2023
Situated in the intersection of audiovisual archives, computational methods, and immersive interactions, this work probes the increasingly important accessibility issues from a two-fold approach.
Yang, Yuchen
core  

Dance2MIDI: Dance-driven multi-instruments music generation

open access: yes, 2023
Dance-driven music generation aims to generate musical pieces conditioned on dance videos. Previous works focus on monophonic or raw audio generation, while the multiinstruments scenario is under-explored.
Han, Bo, Ren, Yi
core  

Unified Pretraining Target Based Video-music Retrieval With Music Rhythm And Video Optical Flow Information

open access: yes, 2023
Background music (BGM) can enhance the video's emotion. However, selecting an appropriate BGM often requires domain knowledge. This has led to the development of video-music retrieval techniques.
Li, Dian   +4 more
core  

Invertible Mosaic Image Hiding Network for Very Large Capacity Image Steganography

open access: yes, 2023
The existing image steganography methods either sequentially conceal secret images or conceal a concatenation of multiple images. In such ways, the interference of information among multiple images will become increasingly severe when the number of ...
Bi, Xing   +5 more
core  

Deep Learning Model for Multimedia Quality of Experience Prediction Based on Network Flow Packets [PDF]

open access: yes, 2018
[EN] Quality of experience (QoE) is the overall acceptability of an application or service, as perceived subjectively by the end user. In particular, for video quality the QoE is dependent of video transmission parameters.
Carro, Belén   +4 more
core   +1 more source

An Implementation of Multimodal Fusion System for Intelligent Digital Human Generation

open access: yes, 2023
With the rapid development of artificial intelligence (AI), digital humans have attracted more and more attention and are expected to achieve a wide range of applications in several industries.
Bi, Kaiyue   +4 more
core  

Text-oriented Modality Reinforcement Network for Multimodal Sentiment Analysis from Unaligned Multimodal Sequences

open access: yes, 2023
Multimodal Sentiment Analysis (MSA) aims to mine sentiment information from text, visual, and acoustic modalities. Previous works have focused on representation learning and feature fusion strategies.
Chen, Jiawei   +5 more
core  

Complete Cross-triplet Loss in Label Space for Audio-visual Cross-modal Retrieval

open access: yes, 2022
The heterogeneity gap problem is the main challenge in cross-modal retrieval. Because cross-modal data (e.g. audiovisual) have different distributions and representations that cannot be directly compared. To bridge the gap between audiovisual modalities,
Ikeda, Kazushi   +3 more
core  

Home - About - Disclaimer - Privacy