Speech signal - Open Access .click

Results 21 to 30 of about 86,691 (269)

Wideband TDoA Positioning Exploiting RSS-Based Clustering

Sensors, 2023
The accuracy of radio-based positioning is heavily influenced by a dense multipath (DM) channel, leading to poor position accuracy. The DM affects both time of flight (ToF) measurements extracted from wideband (WB) signals—specifically, if the bandwidth ...
Andreas Fuchs +4 more
doaj +1 more source

A Full Loading-Based MVDR Beamforming Method by Backward Correction of the Steering Vector and Reconstruction of the Covariance Matrix

Applied Sciences, 2022
In order to improve the performance of the diagonal loading-based minimum variance distortionless response (MVDR) beamformer, a full loading-based MVDR beamforming method is proposed in this paper. Different from the conventional diagonal loading methods,
Jing Zhou, Changchun Bao
doaj +1 more source

Deep Learning and Bidirectional Optical Flow Based Viewport Predictions for 360° Video Coding

IEEE Access, 2022
The rapid development of virtual reality applications continues to urge better compression of 360° videos owing to the large volume of content.
Jayasingam Adhuran, Gosala Kulupana, Anil Fernando +2 more
doaj +1 more source

IoT-Stream: A Lightweight Ontology for Internet of Things Data Streams and Its Use with Data Analytics and Event Detection Services

Sensors, 2020
With the proliferation of sensors and IoT technologies, stream data are increasingly stored and analysed, but rarely combined, due to the heterogeneity of sources and technologies.
Tarek Elsaleh +5 more
doaj +1 more source

Application of Tensor Train Decomposition in S2VT Model for Sign Language Recognition

IEEE Access, 2021
Sign language recognition is a conversion of sign language into text or speech, bridging the communication between the hearing and society. Recently, sequence-to-sequence video to text (S2VT) models has been employed in the field of sign language ...
Biao Xu, Shiliang Huang, Zhongfu Ye
doaj +1 more source

Off-Grid DOA Estimation Aiding Virtual Extension of Coprime Arrays Exploiting Fourth Order Difference Co-Array With Interpolation

IEEE Access, 2018
In this paper, a novel array structure exploiting coprime arrays is proposed which can be very proficient to determine the number of consecutive lags in proportion with the number of array elements.
Tarek Hasan Al Mahmud +4 more
doaj +1 more source

Automated audio captioning: an overview of recent progress and new challenges

EURASIP Journal on Audio, Speech, and Music Processing, 2022
Automated audio captioning is a cross-modal translation task that aims to generate natural language descriptions for given audio clips. This task has received increasing attention with the release of freely available datasets in recent years. The problem
Xinhao Mei +3 more
doaj +1 more source

A validated finite element model for room acoustic treatments with edge absorbers

Acta Acustica, 2023
Porous acoustic absorbers have excellent properties in the low-frequency range when positioned in room edges, therefore they are a common method for reducing low-frequency reverberation.
Kraxberger Florian +5 more
doaj +1 more source

The pursuit of invariance in speech signals [PDF]

The Journal of the Acoustical Society of America, 1983
The search for the acoustic properties useful to the listener in extracting the linguistic message from a speech signal is often construed as the task of matching invariant physical properties to invariant phonological percepts; the discovery of the former will explain the latter.
openaire +2 more sources

Dataset of directional room impulse responses for realistic speech data

Data in Brief
Obtaining real-world multi-channel speech recordings is expensive and time-consuming. Therefore, multi-channel recordings are often artificially generated by convolving existing monaural speech recordings with simulated Room Impulse Responses (RIRs) from
Stefan Fragner +3 more
doaj +1 more source

humans
speech
deep learning

speech enhancement
fos: computer and information sciences
speech perception

hearing
audio and speech processing eess.as
sound cs.sd