Results 11 to 20 of about 246,748 (291)

MASS: Microphone Array Speech Simulator in Room Acoustic Environment for Multi-Channel Speech Coding and Enhancement

open access: yesApplied Sciences, 2020
Multi-channel speech coding and enhancement is an indispensable technology in speech communication. In order to verify the effectiveness of multi-channel speech coding and enhancement methods in the research and development, a microphone array speech ...
Rui Cheng, Changchun Bao, Zihao Cui
doaj   +1 more source

Audio signal transmission method in network-based audio analytics system

open access: yesСучасний стан наукових досліджень та технологій в промисловості, 2023
The subject matter of the article is аudio signal transmission method in network-based audio analytics system. The creation of a network-based audio analytics system leads to the emergence of new classes of load sources that transmit packetized sound ...
Антон Порошенко   +1 more
doaj   +3 more sources

Feature Augmenting Networks for Improving Depression Severity Estimation From Speech Signals

open access: yesIEEE Access, 2020
Depression disorder has become one of the major psychological diseases endangering human health. Researcher in the affective computing community is supporting the development of reliable depression severity estimation system, from multiple modalities ...
Le Yang, Dongmei Jiang, Hichem Sahli
doaj   +1 more source

A zero‐watermarking technique based on i‐vector model for audio copyright protection

open access: yesElectronics Letters, 2023
Audio zero‐watermarking is a promising technology for audio copyright protection. It does not modify the content of the original carrier and has good imperceptibility.
Longting Xu, Mingrui He, Xing Guo
doaj   +1 more source

Active Noise Control over Space: A Subspace Method for Performance Analysis

open access: yesApplied Sciences, 2019
In this paper, we investigate the maximum active noise control performance over a three-dimensional (3-D) spatial space, for a given set of secondary sources in a particular environment.
Jihui Zhang   +3 more
doaj   +1 more source

Deep convolutional neural networks for double compressed AMR audio detection

open access: yesIET Signal Processing, 2021
Detection of double compressed (DC) adaptive multi‐rate (AMR) audio recordings is a challenging audio forensic problem and has received great attention in recent years. Here, the authors propose to use convolutional neural networks (CNN) for DC AMR audio
Aykut Büker, Cemal Hanilçi
doaj   +1 more source

Audio-visual speech recognition with background music using single-channel source separation [PDF]

open access: yes, 2012
In this paper, we consider audio-visual speech recognition with background music. The proposed algorithm is an integration of audio-visual speech recognition and single channel source separation (SCSS). We apply the proposed algorithm to recognize spoken
Erdogan, Hakan   +4 more
core   +1 more source

GestureVLAD: Combining Unsupervised Features Representation and Spatio-Temporal Aggregation for Doppler-Radar Gesture Recognition

open access: yesIEEE Access, 2019
In this paper we propose a novel framework to process Doppler-radar signals for hand gesture recognition. Doppler-radar sensors provide many advantages over other emerging sensing modalities, including low development costs and high sensitivity to ...
Abel Diaz Berenguer   +5 more
doaj   +1 more source

An Audio-Visual Separation Model Integrating Dual-Channel Attention Mechanism

open access: yesIEEE Access, 2023
Sound source separation is the separation of targeted sounds from a noisy environment, which plays an important role in signal processing and has been studied extensively.
Yutao Zhang, Kaixing Wu, Mengfan Zhao
doaj   +1 more source

APPLICATION OF PARTIAL LEAST SQUARES REGRESSION FOR AUDIO-VISUAL SPEECH PROCESSING AND MODELING [PDF]

open access: yesНаучно-технический вестник информационных технологий, механики и оптики, 2015
Subject of Research. The paper deals with the problem of lip region image reconstruction from speech signal by means of Partial Least Squares regression. Such problems arise in connection with development of audio-visual speech processing methods.
A. L. Oleinik
doaj   +1 more source

Home - About - Disclaimer - Privacy