Results 31 to 40 of about 37,865 (281)

MiNgMatch—A Fast N-gram Model for Word Segmentation of the Ainu Language

open access: yesInformation, 2019
Word segmentation is an essential task in automatic language processing for languages where there are no explicit word boundary markers, or where space-delimited orthographic words are too coarse-grained.
Karol Nowakowski   +2 more
doaj   +1 more source

Preliminary Evaluation of Convolutional Neural Network Acoustic Model for Iban Language Using NVIDIA NeMo

open access: yesJournal of Telecommunications and Information Technology, 2022
For the past few years, artificial neural networks (ANNs) have been one of the most common solutions relied upon while developing automated speech recognition (ASR) acoustic models. There are several variants of ANNs, such as deep neural networks (DNNs),
Steve Olsen Michael   +2 more
doaj   +1 more source

Mismatched Crowdsourcing based Language Perception for Under-resourced Languages

open access: yesProcedia Computer Science, 2016
AbstractMismatched crowdsourcing is a technique for acquiring automatic speech recognizer training data in under-resourced languages by decoding the transcriptions of workers who don’t know the target language using a noisy-channel model of cross-language speech perception.
Wenda Chen   +2 more
openaire   +1 more source

Hope Speech detection in under-resourced Kannada language

open access: yesCoRR, 2021
@article{hande-etal-kanhope, title = "Hope Speech detection in under-resourced Kannada language", author = "Hande, Adeep and Priyadharshini, Ruba and Sampath, Anbukkarasi and Thamburaj, Kingston Pal and Chandran, Prabakaran and Chakravarthi, Bharathi Raja ", journal={SN Computer Science}, publisher={Springer} }
Hande, Adeep   +5 more
openaire   +3 more sources

Semiautomatic Speech Alignment for Under-Resourced Languages [PDF]

open access: yes, 2022
Cross-language forced alignment is a solution for linguists who create speech corpora for very low-resource languages. However, cross-language is an additional challenge making a complex task, forced alignment, even more difficult. We study how linguists
Virpioja, Sami   +3 more
core  

A Sustainable and Open Access Knowledge Organization Model to Preserve Cultural Heritage and Language Diversity

open access: yesInformation, 2019
This paper proposes a new collaborative and inclusive model for Knowledge Organization Systems (KOS) for sustaining cultural heritage and language diversity.
Amel Fraisse   +7 more
doaj   +1 more source

Deep neural networks for automatic speech processing: a survey from large corpora to limited data

open access: yesEURASIP Journal on Audio, Speech, and Music Processing, 2022
Most state-of-the-art speech systems use deep neural networks (DNNs). These systems require a large amount of data to be learned. Hence, training state-of-the-art frameworks on under-resourced speech challenges are difficult tasks.
Vincent Roger   +2 more
doaj   +1 more source

Language and multilingualism in the teaching and learning of mathematics in South Africa: A review of literature in Pythagoras from 1994 to 2021

open access: yesPythagoras, 2022
This article presents a systematic review of research on language and multilingualism in mathematics education published in the South African journal Pythagoras from 1994 to 2021.
Kathryn McLachlan, Anthony A. Essien
doaj   +1 more source

Investigating the quality of static anchor embeddings from transformers for under-resourced languages

open access: yes, 2022
This paper reports on experiments for cross-lingual transfer using the anchor-based approach of Schuster et al. (2019) for English and a low-resourced language, namely Hindi.
Lefever, ElsLW220019930142378020002650670000-0002-7755-0591F98A820A-F0ED-11E1-A9DE-61C894A0A6B4   +5 more
core  

Datasets for South African Languages: Bilingual Aligned and Monolingual Data for Machine Translation

open access: yesJournal of Open Humanities Data
This data paper describes machine translation datasets built for the Autshumato project. The datasets contain both bilingual aligned data between English and all other official written languages of South Africa, namely Afrikaans (ISO 639-3: afr ...
Tanja Gaustad   +2 more
doaj   +1 more source

Home - About - Disclaimer - Privacy