Results 21 to 30 of about 2,263,293 (338)
In defence of metric learning for speaker recognition [PDF]
The objective of this paper is 'open-set' speaker recognition of unseen speakers, where ideal embeddings should be able to condense information into a compact utterance-level representation that has small intra-speaker and large inter-speaker distance ...
Joon Son Chung +9 more
semanticscholar +1 more source
U-Vectors: Generating Clusterable Speaker Embedding from Unlabeled Data
Speaker recognition deals with recognizing speakers by their speech. Most speaker recognition systems are built upon two stages, the first stage extracts low dimensional correlation embeddings from speech, and the second performs the classification task.
Muhammad Firoz Mridha +5 more
doaj +1 more source
Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition? [PDF]
Recently, self-supervised learning (SSL) has demonstrated strong performance in speaker recognition, even if the pre-training objective is designed for speech recognition.
Sanyuan Chen +10 more
semanticscholar +1 more source
Survey of Speaker Adaptation Methods in Speech Recognition
Speech is one of the ways of human-computer interaction, and speech recognition technology is an important part of artificial intelligence. In recent years, the application of neural network technology in the field of speech recognition has developed ...
ZHU Fangyuan, MA Zhiqiang, CHEN Yan, ZHANG Xiaoxu, WANG Hongbin, BAO Caijilahu
doaj +1 more source
Supervised Attention for Speaker Recognition [PDF]
The recently proposed self-attentive pooling (SAP) has shown good performance in several speaker recognition systems. In SAP systems, the context vector is trained end-to-end together with the feature extractor, where the role of context vector is to select the most discriminative frames for speaker recognition.
Seong Min Kye +2 more
openaire +2 more sources
Self-Supervised Speaker Recognition with Loss-Gated Learning [PDF]
In self-supervised learning for speaker recognition, pseudo labels are useful as the supervision signals. It is a known fact that a speaker recognition model doesn’t always benefit from pseudo labels due to their unreliability.
Ruijie Tao +4 more
semanticscholar +1 more source
Speaker recognition based on deep learning: An overview [PDF]
Speaker recognition is a task of identifying persons from their voices. Recently, deep learning has dramatically revolutionized speaker recognition. However, there is lack of comprehensive reviews on the exciting progress.
Zhongxin Bai, Xiao-Lei Zhang
semanticscholar +1 more source
Speaker Recognition in the Wild
This paper was submitted to Interspeech ...
Neeraj Chhimwal +6 more
openaire +2 more sources
The performance of speaker recognition systems is very well on the datasets without noise and mismatch. However, the performance gets degraded with the environmental noises, channel variation, physical and behavioral changes in speaker.
Wondimu Lambamo +2 more
doaj +1 more source
Multi-View Self-Attention Based Transformer for Speaker Recognition [PDF]
Initially developed for natural language processing (NLP), Transformer model is now widely used for speech processing tasks such as speaker recognition, due to its powerful sequence modeling capabilities.
Rui Wang +7 more
semanticscholar +1 more source

