Results 31 to 40 of about 63,855 (326)
Morphological Analysis of the Slovak Language
This paper proposes a new statistic-based method of segmenting words by identification of a suffix. Ability to identify suffix can improve morphological analysis by allowing the classifier to assign tags to words previously unseen in the training corpus.
Daniel Hladek, Jan Stas, Josef Juhar
doaj +1 more source
Grammar-Supervised End-to-End Speech Recognition with Part-of-Speech Tagging and Dependency Parsing
For most automatic speech recognition systems, many unacceptable hypothesis errors still make the recognition results absurd and difficult to understand.
Genshun Wan +5 more
doaj +1 more source
Morphological Tagging and Lemmatization in the Albanian Language
An important element of Natural Language Processing is parts of speech tagging. With fine-grained word-class annotations, the word forms in a text can be enhanced and can also be used in downstream processes, such as dependency parsing.
Mati Diellza Nagavci +2 more
doaj +1 more source
Chunking clinical text containing non-canonical language [PDF]
Free text notes typed by primary care physicians during patient consultations typically contain highly non-canonical language. Shallow syntactic analysis of free text notes can help to reveal valuable information for the study of disease and treatment ...
Carroll, John +2 more
core +2 more sources
Implementation of Kadazan Tagger Based on Brill's Method
We present and evaluate the implementation of Part of Speech (POS) Tagging for the Kadazan language by using the Transformation-based approach. The main purpose of this study is to develop an automatic POS tagging for the Kadazan language, which had ...
Marylyn Alex, Lailatul Qadri Zakaria
doaj +1 more source
Part of Speech Tagging of Marathi Text Using Trigram Method [PDF]
In this paper we present a Marathi part of speech tagger. It is a morphologically rich language. It is spoken by the native people of Maharashtra. The general approach used for development of tagger is statistical using trigram Method.
Joshi, Nisheeth +2 more
core +2 more sources
Part of speech tagging in context [PDF]
We present a new HMM tagger that exploits context on both sides of a word to be tagged, and evaluate it in both the unsupervised and supervised case. Along the way, we present the first comprehensive comparison of unsupervised methods for part-of-speech tagging, noting that published results to date have not been comparable across corpora or lexicons ...
Michele Banko, Robert C. Moore
openaire +1 more source
Do Multi-Sense Embeddings Improve Natural Language Understanding? [PDF]
Learning a distinct representation for each sense of an ambiguous word could lead to more powerful and fine-grained models of vector-space representations. Yet while `multi-sense' methods have been proposed and tested on artificial word-similarity tasks,
Jurafsky, Dan, Li, Jiwei
core +1 more source
ReqTagger: A Rule-Based Tagger for Automatic Glossary of Terms Extraction from Ontology Requirements
Glossary of Terms extraction from textual requirements is an important step in ontology engineering methodologies. Although initially it was intended to be performed manually, last years have shown that some degree of automatization is possible. Based on
Wiśniewski Dawid +2 more
doaj +1 more source
The Alcohol Language Corpus (ALC) is the first publicly available speech corpus comprising intoxicated and sober speech of 162 female and male German speakers.
Barfüßer, Sabine +2 more
core +2 more sources

