Results 21 to 30 of about 182,846 (314)

Computational historical linguistics [PDF]

open access: yesTheoretical Linguistics, 2019
AbstractComputational approaches to historical linguistics have been proposed for half a century. Within the last decade, this line of research has received a major boost, owing both to the transfer of ideas and software from computational biology and to the release of several large electronic data resources suitable for systematic comparative work. In
openaire   +3 more sources

Computational Sociolinguistics: A Survey [PDF]

open access: yes, 2016
Language is a social phenomenon and variation is inherent to its social nature. Recently, there has been a surge of interest within the computational linguistics (CL) community in the social dimension of language.
de Jong, Franciska   +3 more
core   +5 more sources

The Bulgarian National Corpus: Theory and Practice in Corpus Design

open access: yesJournal of Language Modelling, 2012
The paper discusses several key concepts related to the development of corpora and reconsiders them in light of recent developments in NLP. On the basis of an overview of present-day corpora, we conclude that the dominant practices of corpus design do ...
Svetla Koeva   +5 more
doaj   +1 more source

Modeling the Paraphrase Detection Task over a Heterogeneous Graph Network with Data Augmentation

open access: yesInformation, 2020
Paraphrase detection is a Natural-Language Processing (NLP) task that aims at automatically identifying whether two sentences convey the same meaning (even with different words). For the Portuguese language, most of the works model this task as a machine-
Rafael T. Anchiêta   +2 more
doaj   +1 more source

Compositionality in Computational Linguistics

open access: yesAnnual Review of Linguistics, 2023
Neural models greatly outperform grammar-based models across many tasks in modern computational linguistics. This raises the question of whether linguistic principles, such as the Principle of Compositionality, still have value as modeling tools. We review the recent literature and find that while an overly strict interpretation of compositionality ...
Donatelli, Lucia, Koller, Alexander
openaire   +1 more source

Empirical Risk Minimization for Probabilistic Grammars: Sample Complexity and Hardness of Learning [PDF]

open access: yes, 2012
Probabilistic grammars are generative statistical models that are useful for compositional and sequential structures. They are used ubiquitously in computational linguistics.
Cohen, S. B., Smith, N. A.
core   +3 more sources

Weisfeiler-Leman in the Bamboo: Novel AMR Graph Metrics and a Benchmark for AMR Graph Similarity

open access: yesTransactions of the Association for Computational Linguistics, 2021
Several metrics have been proposed for assessing the similarity of (abstract) meaning representations (AMRs), but little is known about how they relate to human similarity ratings. Moreover, the current metrics have complementary strengths and weaknesses:
Juri Opitz, Angel Daza, Anette Frank
doaj   +1 more source

Learning Correlations between Linguistic Indicators and Semantic Constraints: Reuse of Context-Dependent Descriptions of Entities [PDF]

open access: yes, 1998
This paper presents the results of a study on the semantic constraints imposed on lexical choice by certain contextual indicators. We show how such indicators are computed and how correlations between them and the choice of a noun phrase description of a
Radev, Dragomir R.
core   +8 more sources

Correction to: A clinical trials corpus annotated with UMLS entities to enhance the access to evidence‑based medicine

open access: yesBMC Medical Informatics and Decision Making, 2021
An amendment to this paper has been published and can be accessed via the original article.
Leonardo Campillos-Llanos   +3 more
doaj   +1 more source

SUC-CORE: A Balanced Corpus Annotated with Noun Phrase Coreference

open access: yesNorthern European Journal of Language Technology, 2013
This paper describes SUC-CORE, a subset of the Stockholm Ume°a Corpus and the Swedish Treebank annotated with noun phrase coreference. While most coreference annotated corpora consist of exts of similar types within related domains, SUC-CORE consists of
Kristina Nilsson Björkenstam
doaj   +1 more source

Home - About - Disclaimer - Privacy