Organizing an in-class hackathon to correct PDF-to-text conversion errors of 1.0 [PDF]
This paper describes a community effort to improve earlier versions of the full-text corpus of Genomics & Informatics by semi-automatically detecting and correcting PDF-to-text conversion errors and optical character recognition errors during the first ...
Sunho Kim +44 more
doaj +1 more source
Introducing a corpus of conversational stories. Construction and annotation of the Narrative Corpus [PDF]
Although widely seen as critical both in terms of its frequency and its social significance as a prime means of encoding and perpetuating moral stance and configuring self and identity, conversational narrative has received little attention in corpus ...
O'Donnell, Matthew Brook +1 more
core +1 more source
Conflicting Tendencies in the Development of Scientific and Technical Language Varieties: Metaphorization vs. Standardization [PDF]
The present paper discusses relations between meaning and context as an interactive process that promotes cognition and communication, both intralingual and interlingual.
Iļjinska, Larisa, Smirnova, Tatjana
core +2 more sources
Etablering af et juridisk tekstkorpus
The project involves the collection of a corpus of English-French-Danish legal texts exemplifying different text types and different themes within the subject area of the law of contract. Each language will be represented by 1 million words.
Gunhild Dyrberg +3 more
doaj +1 more source
Learning and teaching of connectors of contrargumentation in the Spanish language. The use of the student corpus [PDF]
The present article is a part of M.A. thesis completed and defended in the Institute of Linguistics at Adam Mickiewicz University in Poznań (June 2006).
Górska, Weronika
core +2 more sources
A Context-theoretic Framework for Compositionality in Distributional Semantics [PDF]
Techniques in which words are represented as vectors have proved useful in many applications in computational linguistics, however there is currently no general semantic formalism for representing meaning in terms of vectors.
Clarke, Daoud
core +2 more sources
COLLOCATIONS STUDY BASED ON THE TEXT CORPORA
The problem of extracting and studying collocations on the basis of linguistic corpora is considered. Such concepts of corpus linguistics as representativeness and the corpus volume, corpus manager (Concordance) and text corpora peculiarities are studied.
Tatyana Yurevna Paveleva
doaj +1 more source
Corpus-based translation research emerged in the late 1990s as a new area of research in the discipline of translation studies. It is informed by a specific area of linguistics known as corpus linguistics which involves the analysis of large corpora of ...
A. Kruger
doaj +1 more source
Temporal Relations at the Sentence and Text Genre Level: The Role of Linguistic Cueing and Non-linguistic Biases—An Annotation Study of a Bilingual Corpus [PDF]
AbstractThis study investigates the role of non-linguistic biases in the obligatory (verb tenses) and optional (discourse connectives) linguistic marking for inferring temporal relations at the sentence and the text genre levels. Specifically, we formulated and tested several assumptions: (1) thelinguistic cueing assumption(verb tenses inform language ...
Grisot, Cristina, Blochowiak, Joanna
openaire +2 more sources
Lexikos at eighteen: an analysis [PDF]
At eighteen, Lexikos became a major player in the field of linguistics, by being awarded an Impact Factor. This article presents a double analysis of the foundation that led to this success. On the one hand a thorough statistical study is undertaken with
de Schryver, Gilles-Maurice
core +2 more sources

