Speaker recognition - Open Access .click

Results 21 to 30 of about 2,263,293 (338)

In defence of metric learning for speaker recognition [PDF]

Interspeech, 2020
The objective of this paper is 'open-set' speaker recognition of unseen speakers, where ideal embeddings should be able to condense information into a compact utterance-level representation that has small intra-speaker and large inter-speaker distance ...
Joon Son Chung +9 more
semanticscholar +1 more source

U-Vectors: Generating Clusterable Speaker Embedding from Unlabeled Data

Applied Sciences, 2021
Speaker recognition deals with recognizing speakers by their speech. Most speaker recognition systems are built upon two stages, the first stage extracts low dimensional correlation embeddings from speech, and the second performs the classification task.
Muhammad Firoz Mridha +5 more
doaj +1 more source

Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition? [PDF]

Interspeech, 2022
Recently, self-supervised learning (SSL) has demonstrated strong performance in speaker recognition, even if the pre-training objective is designed for speech recognition.
Sanyuan Chen +10 more
semanticscholar +1 more source

Survey of Speaker Adaptation Methods in Speech Recognition

Jisuanji kexue yu tansuo, 2021
Speech is one of the ways of human-computer interaction, and speech recognition technology is an important part of artificial intelligence. In recent years, the application of neural network technology in the field of speech recognition has developed ...
ZHU Fangyuan, MA Zhiqiang, CHEN Yan, ZHANG Xiaoxu, WANG Hongbin, BAO Caijilahu
doaj +1 more source

Supervised Attention for Speaker Recognition [PDF]

2021 IEEE Spoken Language Technology Workshop (SLT), 2021
The recently proposed self-attentive pooling (SAP) has shown good performance in several speaker recognition systems. In SAP systems, the context vector is trained end-to-end together with the feature extractor, where the role of context vector is to select the most discriminative frames for speaker recognition.
Seong Min Kye, Joon Son Chung, Hoirin Kim +2 more
openaire +2 more sources

Self-Supervised Speaker Recognition with Loss-Gated Learning [PDF]

IEEE International Conference on Acoustics, Speech, and Signal Processing, 2021
In self-supervised learning for speaker recognition, pseudo labels are useful as the supervision signals. It is a known fact that a speaker recognition model doesn’t always benefit from pseudo labels due to their unreliability.
Ruijie Tao +4 more
semanticscholar +1 more source

Speaker recognition based on deep learning: An overview [PDF]

Neural Networks, 2020
Speaker recognition is a task of identifying persons from their voices. Recently, deep learning has dramatically revolutionized speaker recognition. However, there is lack of comprehensive reviews on the exciting progress.
Zhongxin Bai, Xiao-Lei Zhang
semanticscholar +1 more source

Speaker Recognition in the Wild

CoRR, 2022
This paper was submitted to Interspeech ...
Neeraj Chhimwal +6 more
openaire +2 more sources

Analyzing Noise Robustness of Cochleogram and Mel Spectrogram Features in Deep Learning Based Speaker Recognition

Applied Sciences, 2022
The performance of speaker recognition systems is very well on the datasets without noise and mismatch. However, the performance gets degraded with the environmental noises, channel variation, physical and behavioral changes in speaker.
Wondimu Lambamo, Ramasamy Srinivasagan, Worku Jifara +2 more
doaj +1 more source

Multi-View Self-Attention Based Transformer for Speaker Recognition [PDF]

IEEE International Conference on Acoustics, Speech, and Signal Processing, 2021
Initially developed for natural language processing (NLP), Transformer model is now widely used for speech processing tasks such as speaker recognition, due to its powerful sequence modeling capabilities.
Rui Wang +7 more
semanticscholar +1 more source

computer science
engineering
deep learning

computer science - sound
sound cs.sd
fos: computer and information sciences

audio and speech processing eess.as
speaker identification
speech recognition