Speaker recognition - Open Access .click

Results 11 to 20 of about 2,263,293 (338)

Efficient Invariant Features for Sensor Variability Compensation in Speaker Recognition [PDF]

Sensors, 2014
In this paper, we investigate the use of invariant features for speaker recognition. Owing to their characteristics, these features are introduced to cope with the difficult and challenging problem of sensor variability and the source of performance ...
Abdennour Alimohad, Ahmed Bouridane, Abderrezak Guessoum +2 more
doaj +3 more sources

Cost-Sensitive Learning for Emotion Robust Speaker Recognition [PDF]

The Scientific World Journal, 2014
In the field of information security, voice is one of the most important parts in biometrics. Especially, with the development of voice communication through the Internet or telephone system, huge voice data resources are accessed. In speaker recognition,
Dongdong Li, Yingchun Yang, Weihui Dai
doaj +2 more sources

A Proposed Speaker Recognition Method Based on Long-Term Voice Features and Fuzzy Logic [PDF]

Engineering and Technology Journal, 2021
Speaker recognition depends on specific predefined steps. The most important steps are feature extraction and features matching. In addition, the category of the speaker voice features has an impact on the recognition process.
Iman H. Hadi, Alia K. Abdul-Hassan
doaj +1 more source

Disentangling Voice and Content with Self-Supervision for Speaker Recognition [PDF]

Neural Information Processing Systems, 2023
For speaker recognition, it is difficult to extract an accurate speaker representation from speech because of its mixture of speaker traits and content. This paper proposes a disentanglement framework that simultaneously models speaker traits and content
Tianchi Liu +3 more
semanticscholar +1 more source

Pushing the limits of raw waveform speaker recognition [PDF]

Interspeech, 2022
In recent years, speaker recognition systems based on raw waveform inputs have received increasing attention. However, the performance of such systems are typically inferior to the state-of-the-art handcrafted feature-based counterparts, which ...
Jee-weon Jung +5 more
semanticscholar +1 more source

Multimodal Emotion Recognition using Transfer Learning from Speaker Recognition and BERT-based models [PDF]

The Speaker and Language Recognition Workshop, 2022
Automatic emotion recognition plays a key role in computer-human interaction as it has the potential to enrich the next-generation artificial intelligence with emotional intelligence.
Sarala Padi +3 more
semanticscholar +1 more source

The 2021 NIST Speaker Recognition Evaluation [PDF]

The Speaker and Language Recognition Workshop, 2022
The 2021 Speaker Recognition Evaluation (SRE21) was the latest cycle of the ongoing evaluation series conducted by the U.S. National Institute of Standards and Technology (NIST) since 1996.
S. O. Sadjadi +4 more
semanticscholar +1 more source

Fine-Tuning Wav2Vec2 for Speaker Recognition [PDF]

IEEE International Conference on Acoustics, Speech, and Signal Processing, 2021
This paper explores applying the wav2vec2 framework to speaker recognition instead of speech recognition. We study the effectiveness of the pre-trained weights on the speaker recognition task, and how to pool the wav2vec2 output sequence into a fixed ...
Nik Vaessen, D. V. Leeuwen
semanticscholar +1 more source

Bias in Automated Speaker Recognition [PDF]

Conference on Fairness, Accountability and Transparency, 2022
Automated speaker recognition uses data processing to identify speakers by their voice. Today, automated speaker recognition is deployed on billions of smart devices and in services such as call centres.
Wiebke Toussaint, A. Ding
semanticscholar +1 more source

Streaming End-to-End Target-Speaker Automatic Speech Recognition and Activity Detection

IEEE Access, 2023
Automatic speech recognition of a target speaker in the presence of interfering speakers remains a challenging issue. One approach to tackle this problem is target-speaker speech recognition, which conditions the recognition process on an embedding that ...
Takafumi Moriya +4 more
doaj +1 more source

computer science
engineering
deep learning

computer science - sound
sound cs.sd
fos: computer and information sciences

audio and speech processing eess.as
speaker identification
speech recognition