Results 21 to 30 of about 63,855 (324)

Weakly supervised POS tagging without disambiguation [PDF]

open access: yes, 2018
Weakly supervised part-of-speech (POS) tagging is to learn to predict the POS tag for a given word in context by making use of partial annotated data instead of the fully tagged corpora.
He, Yulan   +3 more
core   +1 more source

A studyforrest extension, an annotation of spoken language in the German dubbed movie “Forrest Gump” and its audio-description [version 1; peer review: 1 approved, 2 approved with reservations]

open access: yesF1000Research, 2021
Here we present an annotation of speech in the audio-visual movie “Forrest Gump” and its audio-description for a visually impaired audience, as an addition to a large public functional brain imaging dataset (studyforrest.org).
Christian Olaf Häusler, Michael Hanke
doaj   +1 more source

Part of Speech Tagging for Ancient Greek

open access: yesOpen Linguistics, 2016
In this article we report the results for five POS taggers, i.e., the Mate tagger, the Hunpos tagger, RFTagger, theOpenNLP tagger, andNLTKUnigramtagger, tested on the data of the Ancient Greek Dependency Treebank.
Celano Giuseppe G. A.   +2 more
doaj   +1 more source

Improving the quality of Gujarati-Hindi Machine Translation through part-of-speech tagging and stemmer-assisted transliteration [PDF]

open access: yes, 2013
Machine Translation for Indian languages is an emerging research area. Transliteration is one such module that we design while designing a translation system. Transliteration means mapping of source language text into the target language.
Ameta, Juhi   +2 more
core   +1 more source

Reducing Confusion in Active Learning for Part-Of-Speech Tagging

open access: yesTransactions of the Association for Computational Linguistics, 2021
Active learning (AL) uses a data selection algorithm to select useful training samples to minimize annotation cost. This is now an essential tool for building low-resource syntactic analyzers such as part-of-speech (POS) taggers.
Aditi Chaudhary   +3 more
doaj   +1 more source

On the development of a tagset for Northern Sotho with special reference to the issue of standardisation

open access: yesLiterator, 2008
Working with corpora in the South African Bantu languages has up till now been limited to the utilisation of raw corpora. Such corpora, however, have limited functionality.
E. Taljard   +3 more
doaj   +1 more source

Text Preprocessing for Speech Synthesis [PDF]

open access: yes, 2006
In this paper we describe our text preprocessing modules for English text-to-speech synthesis. These modules comprise rule-based text normalization subsuming sentence segmentation and normalization of non-standard words, statistical part-of-speech ...
Pfitzinger, Hartmut R., Reichel, Uwe D.
core   +3 more sources

Bayesian Belief Networks to handle NLP problems [PDF]

open access: yesE3S Web of Conferences
In corpus linguistics, part-of-speech tagging (POS tagging or PoS tagging or POST), also called grammatical tagging is the process of marking up a word in a text (corpus) as corresponding to a particular part of speech, based on both its definition and ...
Sak Alexander
doaj   +1 more source

MALAY PART OF SPEECH TAGGING USING RULED-BASED APPROACH

open access: yesAsia-Pacific Journal of Information Technology and Multimedia, 2017
The research on part of speech (POS) tagging has been widely applied and used through a variety of approaches, particularly for European languages. But it is more challenging for Asian languages, especially Malay as it has some element of modification ...
Nur Ashikin Halid, Nazlia Omar
doaj   +1 more source

Detecting proper nouns in indonesian-language translation of the quran using a guided method

open access: yesJournal of King Saud University: Computer and Information Sciences, 2020
Proper nouns (often abbreviated PN or NNP) are a class of words important in labelling and subsequent text processing, especially in natural language processing (NLP). Name entity recognition (NER) is one study that requires PN.
Suwanto Raharjo   +2 more
doaj   +1 more source

Home - About - Disclaimer - Privacy