Results 1 to 10 of about 190,852 (136)
Corpora for Computational Linguistics [PDF]
Since the mid 90s corpora has become very important for computational linguistics. This paper offers a survey of how they are currently used in different fields of the discipline, with particular emphasis on anaphora and coreference resolution, automatic
Evans, Richard +4 more
core +3 more sources
Corpora for computational linguistics Corpora for computational linguistics
Since the mid 90s corpora has become very important for computational linguistics. This paper offers a survey of how they are currently used in different fields of the discipline, with particular emphasis on anaphora and coreference resolution, automatic summarisation and term extraction. Their influence on other fields is also briefly discussed ...
Constantin Orasan +4 more
openaire +2 more sources
Schrödinger's tree—On syntax and neural language models
In the last half-decade, the field of natural language processing (NLP) has undergone two major transitions: the switch to neural networks as the primary modeling paradigm and the homogenization of the training regime (pre-train, then fine-tune).
Artur Kulmizev +2 more
doaj +1 more source
Testing the Effectiveness of the Diagnostic Probing Paradigm on Italian Treebanks
The outstanding performance recently reached by neural language models (NLMs) across many natural language processing (NLP) tasks has steered the debate towards understanding whether NLMs implicitly learn linguistic competence.
Alessio Miaschi +4 more
doaj +1 more source
Acoustic compression in Zoom audio does not compromise voice recognition performance
Human voice recognition over telephone channels typically yields lower accuracy when compared to audio recorded in a studio environment with higher quality.
Valeriia Perepelytsia, Volker Dellwo
doaj +1 more source
Dialectology for Computational Linguists
This paper provides an overview of computational work in dialectology. Wehave published similar surveys in the not-too-distant past (Heeringa and Prokic,2018; Wieling and Nerbonne, 2015), but these were aimed at dialectologists andgeneral linguists, respectively.
Nerbonne, J. +3 more
openaire +5 more sources
Inter-Coder Agreement for Computational Linguistics [PDF]
This article is a survey of methods for measuring agreement among corpus annotators. It exposes the mathematics and underlying assumptions of agreement coefficients, covering Krippendorff's alpha as well as Scott's pi and Cohen's kappa; discusses the ...
Atkins Sue +12 more
core +2 more sources
Computational linguistics and linguistic theory [PDF]
The present paper is an attempt to justify and explain the direction of present research in Ottawa (Carleton University and also University of Ottawa) on Computational manipulation of speech. Our actual realizations are not necessarily original; rather, we are trying to make use of the findings of other workers, assembling them, however, in a different
openaire +2 more sources
Semi-Automatic Construction of a Readability Corpus for the Vietnamese Language
Text readability is a measure of how easy or difficult it is to read a text. This readability factor plays a crucial role in the processes of drafting and comprehending the texts, affecting the choice of proper texts for reading.
An-Vinh Luong, Dien Dinh
doaj +1 more source
OGER++: hybrid multi-type entity recognition
Background We present a text-mining tool for recognizing biomedical entities in scientific literature. OGER++ is a hybrid system for named entity recognition and concept recognition (linking), which combines a dictionary-based annotator with a corpus ...
Lenz Furrer +3 more
doaj +1 more source

