Results 21 to 30 of about 53,994 (254)
Old Catalan Morphosyntax: Developing an Annotated Corpus
This paper presents a full procedure for the development of a Part-of-Speech (POS) tagged corpus of Old Catalan. As an extremely low-resource language with rich inflection and frequent homographs, Old Catalan poses non-trivial problems in the development
Marieke Meelen, Afra Pujol i Campeny
doaj +1 more source
Simple Semi-Supervised POS Tagging [PDF]
We tackle the question: how much supervision is needed to achieve state-of-the-art performance in part-of-speech (POS) tagging, if we leverage lexical representations given by the model of Brown et al. (1992)? It has become a standard practice to use automatically induced “Brown clusters” in place of POS tags.
Karl Stratos, Michael Collins 0001
openaire +1 more source
Improving part-of-speech tagging in Amharic language using deep neural network
To date, several POS taggers have been introduced to facilitate the success of semantic analysis for different languages. However, the task of POS tagging becomes a bit intricate in morphologically complex languages, like Amharic.
Sintayehu Hirpassa, G.S. Lehal
doaj +1 more source
Turkish PoS Tagging by Reducing Sparsity with Morpheme Tags in Small Datasets [PDF]
Sparsity is one of the major problems in natural language processing. The problem becomes even more severe in agglutinating languages that are highly prone to be inflected. We deal with sparsity in Turkish by adopting morphological features for part-of-speech tagging.
Burcu Can +2 more
openalex +5 more sources
The infinite HMM for unsupervised PoS tagging [PDF]
We extend previous work on fully unsupervised part-of-speech tagging. Using a non-parametric version of the HMM, called the infinite HMM (iHMM), we address the problem of choosing the number of hidden states in unsupervised Markov models for PoS tagging.
Jurgen Van Gael +2 more
openaire +2 more sources
The challenge of POS tagging and lemmatization in morphologically rich languages is examined by comparing German and Latin. We start by defining an NLP evaluation roadmap to model the combination of tools and resources guiding our experiments.
Rüdiger Gleim +8 more
doaj +1 more source
POS Tagging Bahasa Madura dengan Menggunakan Algoritma Brill Tagger
Bahasa Madura adalah bahasa daerah yang selain digunakan di Pulau Madura juga digunakan di daerah lainnya seperti di kota Jember, Pasuruan, dan Probolinggo.
Nindian Puspa Dewi, Ubaidi Ubaidi
doaj +1 more source
Deep Learning based Tamil Parts of Speech (POS) Tagger [PDF]
This paper addresses the problem of part of speech (POS) tagging for the Tamil language, which is low resourced and agglutinative. POS tagging is the process of assigning syntactic categories for the words in a sentence.
S. Anbukkarasi, S. Varadhaganapathy
doaj +1 more source
POS tagging using relaxation labelling [PDF]
compressed & uuencoded postscript file.
openaire +3 more sources
Implementation of Kadazan Tagger Based on Brill's Method
We present and evaluate the implementation of Part of Speech (POS) Tagging for the Kadazan language by using the Transformation-based approach. The main purpose of this study is to develop an automatic POS tagging for the Kadazan language, which had ...
Marylyn Alex, Lailatul Qadri Zakaria
doaj +1 more source

