Results 21 to 30 of about 45,674 (282)
A Python package for text processing for Serbian: nlpheart [PDF]
Within the past two decades, text processing became an important part of most state-of-the-art advanced automation systems. However, for many under-resourced languages it is still challenging to perform textual data preparation, due to the lack of ...
Ostrogonac Stevan +2 more
doaj +1 more source
Statistical speech and language processing techniques, requiring large amounts of training data, are currently state-of-the-art in automatic speech recognition.
CUCU, H. +3 more
doaj +1 more source
Automatic Speech Recognition Using Limited Vocabulary: A Survey
Automatic Speech Recognition (ASR) is an active field of research due to its large number of applications and the proliferation of interfaces or computing devices that can support speech processing.
Jean Louis K. E Fendji +3 more
doaj +1 more source
Code-Switching in Automatic Speech Recognition: The Issues and Future Directions
Code-switching (CS) in spoken language is where the speech has two or more languages within an utterance. It is an unsolved issue in automatic speech recognition (ASR) research as ASR needs to recognise speech in bilingual and multilingual settings ...
Mumtaz Begum Mustafa +6 more
doaj +1 more source
Extractive summarization of Malayalam documents using latent Dirichlet allocation: An experience
Automatic text summarization (ATS) extracts information from a source text and presents it to the user in a condensed form while preserving its primary content.
Kondath Manju +2 more
doaj +1 more source
Zero-Shot Cross-Lingual Transfer with Meta Learning
Learning what to share between tasks has been a topic of great importance recently, as strategic sharing of knowledge has been shown to improve downstream task performance.
Augenstein, Isabelle +3 more
core +1 more source
Author identification for Under-Resourced language (KadazanDusun)
<span>This paper presents the task of Author Identification for KadazanDusun language by using tweets as the source of data to perform Author Identification task of short text on KadazanDusun, which is considered as one the under-resourced language in Malaysia.
Nursyahirah Tarmizi +2 more
openaire +2 more sources
Domain Generalization for Language-Independent Automatic Speech Recognition
A language-independent automatic speech recognizer (ASR) is one that can be used for phonetic transcription in languages other than the languages in which it was trained.
Heting Gao +6 more
doaj +1 more source
MiNgMatch—A Fast N-gram Model for Word Segmentation of the Ainu Language
Word segmentation is an essential task in automatic language processing for languages where there are no explicit word boundary markers, or where space-delimited orthographic words are too coarse-grained.
Karol Nowakowski +2 more
doaj +1 more source
Bayesian Models for Unit Discovery on a Very Low Resource Language [PDF]
Developing speech technologies for low-resource languages has become a very active research field over the last decade. Among others, Bayesian models have shown some promising results on artificial examples but still lack of in situ experiments. Our work
Besacier, Laurent +9 more
core +3 more sources

