Results 21 to 30 of about 2,263,293 (338)

In defence of metric learning for speaker recognition [PDF]

open access: yesInterspeech, 2020
The objective of this paper is 'open-set' speaker recognition of unseen speakers, where ideal embeddings should be able to condense information into a compact utterance-level representation that has small intra-speaker and large inter-speaker distance ...
Joon Son Chung   +9 more
semanticscholar   +1 more source

U-Vectors: Generating Clusterable Speaker Embedding from Unlabeled Data

open access: yesApplied Sciences, 2021
Speaker recognition deals with recognizing speakers by their speech. Most speaker recognition systems are built upon two stages, the first stage extracts low dimensional correlation embeddings from speech, and the second performs the classification task.
Muhammad Firoz Mridha   +5 more
doaj   +1 more source

Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition? [PDF]

open access: yesInterspeech, 2022
Recently, self-supervised learning (SSL) has demonstrated strong performance in speaker recognition, even if the pre-training objective is designed for speech recognition.
Sanyuan Chen   +10 more
semanticscholar   +1 more source

Survey of Speaker Adaptation Methods in Speech Recognition

open access: yesJisuanji kexue yu tansuo, 2021
Speech is one of the ways of human-computer interaction, and speech recognition technology is an important part of artificial intelligence. In recent years, the application of neural network technology in the field of speech recognition has developed ...
ZHU Fangyuan, MA Zhiqiang, CHEN Yan, ZHANG Xiaoxu, WANG Hongbin, BAO Caijilahu
doaj   +1 more source

Supervised Attention for Speaker Recognition [PDF]

open access: yes2021 IEEE Spoken Language Technology Workshop (SLT), 2021
The recently proposed self-attentive pooling (SAP) has shown good performance in several speaker recognition systems. In SAP systems, the context vector is trained end-to-end together with the feature extractor, where the role of context vector is to select the most discriminative frames for speaker recognition.
Seong Min Kye   +2 more
openaire   +2 more sources

Self-Supervised Speaker Recognition with Loss-Gated Learning [PDF]

open access: yesIEEE International Conference on Acoustics, Speech, and Signal Processing, 2021
In self-supervised learning for speaker recognition, pseudo labels are useful as the supervision signals. It is a known fact that a speaker recognition model doesn’t always benefit from pseudo labels due to their unreliability.
Ruijie Tao   +4 more
semanticscholar   +1 more source

Speaker recognition based on deep learning: An overview [PDF]

open access: yesNeural Networks, 2020
Speaker recognition is a task of identifying persons from their voices. Recently, deep learning has dramatically revolutionized speaker recognition. However, there is lack of comprehensive reviews on the exciting progress.
Zhongxin Bai, Xiao-Lei Zhang
semanticscholar   +1 more source

Speaker Recognition in the Wild

open access: yesCoRR, 2022
This paper was submitted to Interspeech ...
Neeraj Chhimwal   +6 more
openaire   +2 more sources

Analyzing Noise Robustness of Cochleogram and Mel Spectrogram Features in Deep Learning Based Speaker Recognition

open access: yesApplied Sciences, 2022
The performance of speaker recognition systems is very well on the datasets without noise and mismatch. However, the performance gets degraded with the environmental noises, channel variation, physical and behavioral changes in speaker.
Wondimu Lambamo   +2 more
doaj   +1 more source

Multi-View Self-Attention Based Transformer for Speaker Recognition [PDF]

open access: yesIEEE International Conference on Acoustics, Speech, and Signal Processing, 2021
Initially developed for natural language processing (NLP), Transformer model is now widely used for speech processing tasks such as speaker recognition, due to its powerful sequence modeling capabilities.
Rui Wang   +7 more
semanticscholar   +1 more source

Home - About - Disclaimer - Privacy