Results 21 to 30 of about 14,332 (195)
A study on constraining Connectionist Temporal Classification for temporal audio alignment
Connectionist Temporal Classification (CTC) has become a standard for deep learning-based temporal alignment allowing relevant probabilistic distributions to be learned. However, by nature, CTC is a transcription objective that can be minimized without guaranteeing any alignment properties.
Teytaut, Yann +2 more
openaire +2 more sources
Sequence labeling is a common machine-learning task which not only needs the most likely prediction of label for a local input but also seeks the most suitable annotation for the whole input sequence.
Xiaohui Huang +4 more
doaj +1 more source
We solve the problem of how to densely align actions in videos at frame level, with only the order of occurring actions available, in order to save the time-consuming efforts to accurately annotate the temporal boundaries of each action. We propose three
Lin Wang +3 more
doaj +1 more source
Nasal Speech Sounds Detection Using Connectionist Temporal Classification [PDF]
Phone attributes, known also as distinctive or phonological features, belong to important classification of the speech sounds used in automatic speech processing. Training of conventional phone attribute detectors (classifiers), either based on acoustic measurements or deep learning approaches, requires decent phone boundary segmentation.
Milos Cernak, Sibo Tong
openaire +1 more source
Training LDCRF model on unsegmented sequences using connectionist temporal classification [PDF]
Many machine learning problems such as speech recognition, gesture recognition, and handwriting recognition are concerned with simultaneous segmentation and labeling of sequence data. Latent-dynamic conditional random field (LDCRF) is a well-known discriminative method that has been successfully used for this task.
Atashin, Amir Ahooye +2 more
openaire +2 more sources
A Helium Speech Unscrambling Algorithm Based on Deep Learning
Helium speech, the language spoken by divers in the deep sea who breathe a high-pressure helium–oxygen mixture, is almost unintelligible. To accurately unscramble helium speech, a neural network based on deep learning is proposed.
Yonghong Chen, Shibing Zhang
doaj +1 more source
Towards multilingual end‐to‐end speech recognition for air traffic control
In this work, an end‐to‐end framework is proposed to achieve multilingual automatic speech recognition (ASR) in air traffic control (ATC) systems. Considering the standard ATC procedure, a recurrent neural network (RNN) based framework is selected to ...
Yi Lin, Bo Yang, Dongyue Guo, Peng Fan
doaj +1 more source
Word Beam Search: A Connectionist Temporal Classification Decoding Algorithm [PDF]
Recurrent Neural Networks (RNNs) are used for sequence recognition tasks such as Handwritten Text Recognition (HTR) or speech recognition. If trained with the Connectionist Temporal Classification (CTC) loss function, the output of such a RNN is a matrix containing character probabilities for each time-step.
Harald Scheidl +2 more
openaire +1 more source
Decoding Handwriting Trajectories from Intracortical Brain Signals for Brain-to-Text Communication. [PDF]
By developing a novel framework that optimizes both shape and temporal loss during decoder training, the authors successfully reconstruct human‐recognizable handwriting trajectories from intracortical neural signals for both Chinese characters and English letters, effectively resolving the temporal misalignment problem in clinical BCIs, thereby ...
Xu G +6 more
europepmc +2 more sources
Connectionist natural language parsing [PDF]
The key developments of two decades of connectionist parsing are reviewed. Connectionist parsers are assessed according to their ability to learn to represent syntactic structures from examples automatically, without being presented with symbolic grammar
Berg +52 more
core +1 more source

