Results 31 to 40 of about 45,674 (282)
This paper proposes a new collaborative and inclusive model for Knowledge Organization Systems (KOS) for sustaining cultural heritage and language diversity.
Amel Fraisse +7 more
doaj +1 more source
For the past few years, artificial neural networks (ANNs) have been one of the most common solutions relied upon while developing automated speech recognition (ASR) acoustic models. There are several variants of ANNs, such as deep neural networks (DNNs),
Steve Olsen Michael +2 more
doaj +1 more source
Deep neural networks for automatic speech processing: a survey from large corpora to limited data
Most state-of-the-art speech systems use deep neural networks (DNNs). These systems require a large amount of data to be learned. Hence, training state-of-the-art frameworks on under-resourced speech challenges are difficult tasks.
Vincent Roger +2 more
doaj +1 more source
This article presents a systematic review of research on language and multilingualism in mathematics education published in the South African journal Pythagoras from 1994 to 2021.
Kathryn McLachlan, Anthony A. Essien
doaj +1 more source
Automated text simplification as a preprocessing step for machine translation into an under-resourced language [PDF]
In this work, we investigate the possibility of using fully automatic text simplification system on the English source in machine translation (MT) for improving its translation into an under-resourced language.
Popović, Maja, Štajner, Sanja
core +1 more source
Mismatched Crowdsourcing based Language Perception for Under-resourced Languages
AbstractMismatched crowdsourcing is a technique for acquiring automatic speech recognizer training data in under-resourced languages by decoding the transcriptions of workers who don’t know the target language using a noisy-channel model of cross-language speech perception.
Chen, Wenda +2 more
openaire +1 more source
Datasets for South African Languages: Bilingual Aligned and Monolingual Data for Machine Translation
This data paper describes machine translation datasets built for the Autshumato project. The datasets contain both bilingual aligned data between English and all other official written languages of South Africa, namely Afrikaans (ISO 639-3: afr ...
Tanja Gaustad +2 more
doaj +1 more source
The Usefulness of Imperfect Speech Data for ASR Development in Low-Resource Languages
When the National Centre for Human Language Technology (NCHLT) Speech corpus was released, it created various opportunities for speech technology development in the 11 official, but critically under-resourced, languages of South Africa.
Jaco Badenhorst, Febe de Wet
doaj +1 more source
This article reports some of the main achievements of the EU-funded PRINCIPLE project in collecting high-quality language resources (LRs) in the legal domain for four under-resourced European languages, namely Croatian, Irish, Norwegian and Icelandic ...
Federico Gaspari +17 more
doaj +1 more source
The Cardamom workbench for historical and under-resourced languages
This paper describes the creation of a workbench tool designed to make technologies developed throughout the lifespan of the Cardamom project easily accessible to researchers who could most benefit from them, but who may not have the technical expertise to apply bleeding edge technologies to their own datasets.
Doyle, Adrian +5 more
openaire +3 more sources

