Under-resourced languages - Open Access .click

Results 31 to 40 of about 37,865 (281)

MiNgMatch—A Fast N-gram Model for Word Segmentation of the Ainu Language

Information, 2019
Word segmentation is an essential task in automatic language processing for languages where there are no explicit word boundary markers, or where space-delimited orthographic words are too coarse-grained.
Karol Nowakowski, Michal Ptaszynski, Fumito Masui +2 more
doaj +1 more source

Preliminary Evaluation of Convolutional Neural Network Acoustic Model for Iban Language Using NVIDIA NeMo

Journal of Telecommunications and Information Technology, 2022
For the past few years, artificial neural networks (ANNs) have been one of the most common solutions relied upon while developing automated speech recognition (ASR) acoustic models. There are several variants of ANNs, such as deep neural networks (DNNs),
Steve Olsen Michael, Sarah Samson Juan , Edwin Mit +2 more
doaj +1 more source

Mismatched Crowdsourcing based Language Perception for Under-resourced Languages

Procedia Computer Science, 2016
AbstractMismatched crowdsourcing is a technique for acquiring automatic speech recognizer training data in under-resourced languages by decoding the transcriptions of workers who don’t know the target language using a noisy-channel model of cross-language speech perception.
Wenda Chen, Mark Hasegawa-Johnson, Nancy F. Chen +2 more
openaire +1 more source

Hope Speech detection in under-resourced Kannada language

CoRR, 2021
@article{hande-etal-kanhope, title = "Hope Speech detection in under-resourced Kannada language", author = "Hande, Adeep and Priyadharshini, Ruba and Sampath, Anbukkarasi and Thamburaj, Kingston Pal and Chandran, Prabakaran and Chakravarthi, Bharathi Raja ", journal={SN Computer Science}, publisher={Springer} }
Hande, Adeep +5 more
openaire +3 more sources

Semiautomatic Speech Alignment for Under-Resourced Languages [PDF]

, 2022
Cross-language forced alignment is a solution for linguists who create speech corpora for very low-resource languages. However, cross-language is an additional challenge making a complex task, forced alignment, even more difficult. We study how linguists
Virpioja, Sami +3 more
core

A Sustainable and Open Access Knowledge Organization Model to Preserve Cultural Heritage and Language Diversity

Information, 2019
This paper proposes a new collaborative and inclusive model for Knowledge Organization Systems (KOS) for sustaining cultural heritage and language diversity.
Amel Fraisse +7 more
doaj +1 more source

Deep neural networks for automatic speech processing: a survey from large corpora to limited data

EURASIP Journal on Audio, Speech, and Music Processing, 2022
Most state-of-the-art speech systems use deep neural networks (DNNs). These systems require a large amount of data to be learned. Hence, training state-of-the-art frameworks on under-resourced speech challenges are difficult tasks.
Vincent Roger, Jérôme Farinas, Julien Pinquier +2 more
doaj +1 more source

Language and multilingualism in the teaching and learning of mathematics in South Africa: A review of literature in Pythagoras from 1994 to 2021

Pythagoras, 2022
This article presents a systematic review of research on language and multilingualism in mathematics education published in the South African journal Pythagoras from 1994 to 2021.
Kathryn McLachlan, Anthony A. Essien
doaj +1 more source

Investigating the quality of static anchor embeddings from transformers for under-resourced languages

, 2022
This paper reports on experiments for cross-lingual transfer using the anchor-based approach of Schuster et al. (2019) for English and a low-resourced language, namely Hindi.
Lefever, ElsLW220019930142378020002650670000-0002-7755-0591F98A820A-F0ED-11E1-A9DE-61C894A0A6B4 +5 more
core

Datasets for South African Languages: Bilingual Aligned and Monolingual Data for Machine Translation

Journal of Open Humanities Data
This data paper describes machine translation datasets built for the Autshumato project. The datasets contain both bilingual aligned data between English and all other official written languages of South Africa, namely Afrikaans (ISO 639-3: afr ...
Tanja Gaustad +2 more
doaj +1 more source

natural language processing
low-resource languages
deep learning

automatic speech recognition
language corpora
machine translation

sentiment analysis
speech recognition
south african languages