Results 31 to 40 of about 63,855 (326)

Morphological Analysis of the Slovak Language

open access: yesAdvances in Electrical and Electronic Engineering, 2015
This paper proposes a new statistic-based method of segmenting words by identification of a suffix. Ability to identify suffix can improve morphological analysis by allowing the classifier to assign tags to words previously unseen in the training corpus.
Daniel Hladek, Jan Stas, Josef Juhar
doaj   +1 more source

Grammar-Supervised End-to-End Speech Recognition with Part-of-Speech Tagging and Dependency Parsing

open access: yesApplied Sciences, 2023
For most automatic speech recognition systems, many unacceptable hypothesis errors still make the recognition results absurd and difficult to understand.
Genshun Wan   +5 more
doaj   +1 more source

Morphological Tagging and Lemmatization in the Albanian Language

open access: yesSEEU Review, 2021
An important element of Natural Language Processing is parts of speech tagging. With fine-grained word-class annotations, the word forms in a text can be enhanced and can also be used in downstream processes, such as dependency parsing.
Mati Diellza Nagavci   +2 more
doaj   +1 more source

Chunking clinical text containing non-canonical language [PDF]

open access: yes, 2014
Free text notes typed by primary care physicians during patient consultations typically contain highly non-canonical language. Shallow syntactic analysis of free text notes can help to reveal valuable information for the study of disease and treatment ...
Carroll, John   +2 more
core   +2 more sources

Implementation of Kadazan Tagger Based on Brill's Method

open access: yesJournal of ICT Research and Applications, 2014
We present and evaluate the implementation of Part of Speech (POS) Tagging for the Kadazan language by using the Transformation-based approach. The main purpose of this study is to develop an automatic POS tagging for the Kadazan language, which had ...
Marylyn Alex, Lailatul Qadri Zakaria
doaj   +1 more source

Part of Speech Tagging of Marathi Text Using Trigram Method [PDF]

open access: yes, 2013
In this paper we present a Marathi part of speech tagger. It is a morphologically rich language. It is spoken by the native people of Maharashtra. The general approach used for development of tagger is statistical using trigram Method.
Joshi, Nisheeth   +2 more
core   +2 more sources

Part of speech tagging in context [PDF]

open access: yesProceedings of the 20th international conference on Computational Linguistics - COLING '04, 2004
We present a new HMM tagger that exploits context on both sides of a word to be tagged, and evaluate it in both the unsupervised and supervised case. Along the way, we present the first comprehensive comparison of unsupervised methods for part-of-speech tagging, noting that published results to date have not been comparable across corpora or lexicons ...
Michele Banko, Robert C. Moore
openaire   +1 more source

Do Multi-Sense Embeddings Improve Natural Language Understanding? [PDF]

open access: yes, 2015
Learning a distinct representation for each sense of an ambiguous word could lead to more powerful and fine-grained models of vector-space representations. Yet while `multi-sense' methods have been proposed and tested on artificial word-similarity tasks,
Jurafsky, Dan, Li, Jiwei
core   +1 more source

ReqTagger: A Rule-Based Tagger for Automatic Glossary of Terms Extraction from Ontology Requirements

open access: yesFoundations of Computing and Decision Sciences, 2022
Glossary of Terms extraction from textual requirements is an important step in ontology engineering methodologies. Although initially it was intended to be performed manually, last years have shown that some degree of automatization is possible. Based on
Wiśniewski Dawid   +2 more
doaj   +1 more source

Alcohol Language Corpus [PDF]

open access: yes, 2011
The Alcohol Language Corpus (ALC) is the first publicly available speech corpus comprising intoxicated and sober speech of 162 female and male German speakers.
Barfüßer, Sabine   +2 more
core   +2 more sources

Home - About - Disclaimer - Privacy