Multilingual Sentiment Analysis for Under-Resourced Languages: A Systematic Review of the Landscape [PDF]
Sentiment analysis automatically evaluates people’s opinions of products or services. It is an emerging research area with promising advancements in high-resource languages such as Indo-European languages (e.g. English).
Koena Ronny Mabokela +2 more
doaj +4 more sources
Word-length algorithm for language identification of under-resourced languages
Language identification is widely used in machine learning, text mining, information retrieval, and speech processing. Available techniques for solving the problem of language identification do require large amount of training text that are not available
Ali Selamat, Nicholas Akosu
doaj +4 more sources
Artificial intelligence translation in healthcare: an urgent call for evidence-informed policy frameworks [PDF]
The deployment of artificial intelligence (AI) translation tools in healthcare is accelerating rapidly, yet regulatory frameworks lag dangerously behind clinical practice.
Jonathan H Chen +8 more
doaj +2 more sources
Speech recognition for under-resourced languages: Data sharing in hidden Markov model systems
For purposes of automated speech recognition in under-resourced environments, techniques used to share acoustic data between closely related or similar languages become important.
Febe de Wet +3 more
doaj +3 more sources
Spoken word corpus and dictionary definition for an African language [PDF]
The preservation of languages is critical to maintaining and strengthening the cultures and identities of communities, and this is especially true for under-resourced languages with a predominantly oral culture.
Wanjiku Nganga, Ikechukwu Achebe
doaj +3 more sources
NCHLT Auxiliary speech data for ASR technology development in South Africa
The aim of the National Centre for Human Language Technology (NCHLT) project was to create speech and text resources that would enable Human Language Technology (HLT) development for the 11 official languages of South Africa. The speech data described in
Jaco Badenhorst, Febe de Wet
doaj +1 more source
Improving the Performance of Low-resourced Speaker Identification with Data Preprocessing
Automatic speaker identification is done to tackle daily security problems. Speech data collection is an essential but very challenging task for under-resourced languages like Burmese.
Win Lai Lai Phyu +2 more
doaj +1 more source
En aquesta introducció es presenta un resum del número especial que la Revista de Llengua i Dret, Journal of Language and Law dedica a la traducció i la interpretació (TI) jurídiques en el món de les tecnologies. Tot i que les tecnologies de la traducció
Christopher D. Mellinger +1 more
doaj +1 more source
Strategies for building wordnets for under-resourced languages: The case of African languages
The African Wordnet Project (AWN) aims at building wordnets for five African languages: Setswana, isiXhosa, isiZulu, Sesotho sa Leboa (also referred to as Sepedi or Northern Sotho) and Tshivenda.
Sonja E. Bosch, Marissa Griesel
doaj +1 more source
A Python package for text processing for Serbian: nlpheart [PDF]
Within the past two decades, text processing became an important part of most state-of-the-art advanced automation systems. However, for many under-resourced languages it is still challenging to perform textual data preparation, due to the lack of ...
Ostrogonac Stevan +2 more
doaj +1 more source

