Results 21 to 30 of about 182,846 (314)
Computational historical linguistics [PDF]
AbstractComputational approaches to historical linguistics have been proposed for half a century. Within the last decade, this line of research has received a major boost, owing both to the transfer of ideas and software from computational biology and to the release of several large electronic data resources suitable for systematic comparative work. In
openaire +3 more sources
Computational Sociolinguistics: A Survey [PDF]
Language is a social phenomenon and variation is inherent to its social nature. Recently, there has been a surge of interest within the computational linguistics (CL) community in the social dimension of language.
de Jong, Franciska+3 more
core +5 more sources
The Bulgarian National Corpus: Theory and Practice in Corpus Design
The paper discusses several key concepts related to the development of corpora and reconsiders them in light of recent developments in NLP. On the basis of an overview of present-day corpora, we conclude that the dominant practices of corpus design do ...
Svetla Koeva+5 more
doaj +1 more source
Modeling the Paraphrase Detection Task over a Heterogeneous Graph Network with Data Augmentation
Paraphrase detection is a Natural-Language Processing (NLP) task that aims at automatically identifying whether two sentences convey the same meaning (even with different words). For the Portuguese language, most of the works model this task as a machine-
Rafael T. Anchiêta+2 more
doaj +1 more source
Compositionality in Computational Linguistics
Neural models greatly outperform grammar-based models across many tasks in modern computational linguistics. This raises the question of whether linguistic principles, such as the Principle of Compositionality, still have value as modeling tools. We review the recent literature and find that while an overly strict interpretation of compositionality ...
Donatelli, Lucia, Koller, Alexander
openaire +1 more source
Empirical Risk Minimization for Probabilistic Grammars: Sample Complexity and Hardness of Learning [PDF]
Probabilistic grammars are generative statistical models that are useful for compositional and sequential structures. They are used ubiquitously in computational linguistics.
Cohen, S. B., Smith, N. A.
core +3 more sources
Weisfeiler-Leman in the
Several metrics have been proposed for assessing the similarity of (abstract) meaning representations (AMRs), but little is known about how they relate to human similarity ratings. Moreover, the current metrics have complementary strengths and weaknesses:
Juri Opitz, Angel Daza, Anette Frank
doaj +1 more source
Learning Correlations between Linguistic Indicators and Semantic Constraints: Reuse of Context-Dependent Descriptions of Entities [PDF]
This paper presents the results of a study on the semantic constraints imposed on lexical choice by certain contextual indicators. We show how such indicators are computed and how correlations between them and the choice of a noun phrase description of a
Radev, Dragomir R.
core +8 more sources
An amendment to this paper has been published and can be accessed via the original article.
Leonardo Campillos-Llanos+3 more
doaj +1 more source
SUC-CORE: A Balanced Corpus Annotated with Noun Phrase Coreference
This paper describes SUC-CORE, a subset of the Stockholm Ume°a Corpus and the Swedish Treebank annotated with noun phrase coreference. While most coreference annotated corpora consist of exts of similar types within related domains, SUC-CORE consists of
Kristina Nilsson Björkenstam
doaj +1 more source