Parallel and comparable corpora - Open Access .click

Results 11 to 20 of about 3,792 (286)

Bilingual Lexicon Extraction from Comparable Corpora Enhanced with Parallel Corpora

, 2011
In this article, we present a simple and effective approach for extracting bilingual lexicon from comparable corpora enhanced with parallel corpora. We make use of structural characteristics of the documents comprising the comparable corpus to extract parallel sentences with a high degree of quality.
Morin, Emmanuel, Prochasson, Emmanuel
openaire +4 more sources

Parallel Sentence Retrieval From Comparable Corpora for Biomedical Text Simplification [PDF]

Proceedings - Natural Language Processing in a Deep Learning World, 2019
Parallel sentences provide semantically similar information which can vary on a given dimension , such as language or register. Parallel sentences with register variation (like expert and non-expert documents) can be exploited for the automatic text simplification.
Cardon, Rémi, Grabar, Natalia
core +4 more sources

How comparable are parallel corpora? Measuring the distribution of general vocabulary and connectives [PDF]

, 2011
In this paper, we question the homogeneity of a large parallel corpus by measuring the similarity between various sub-parts. We compare results obtained using a general measure of lexical similarity based on χ2 and by counting the number of discourse connectives.
Cartoni Bruno +3 more
openaire +4 more sources

Building of Parallel and Comparable Cybersecurity Corpora for Bilingual Terminology Extraction [PDF]

Linköping Electronic Conference Proceedings, 2022
The paper aims at presenting English-Lithuanian corpora for bilingual term extraction (BiTE) in the cybersecurity domain within the framework of the project DVITAS.
Utka, Andrius +5 more
openaire +3 more sources

Monolingual comparable corpora and parallel corpora in the search for features of translated language [PDF]

, 2011
For almost two decades now, mainstream corpus-based research in descriptive translation studies has focused on the computerized analysis of translated and comparable non-translated texts (so-called monolingual comparable corpora). These have been suggested to better serve a target-oriented, descriptive approach to translation research than parallel ...
BERNARDINI, SILVIA
openaire +3 more sources

TRANSLATION EQUIVALENTS IN BRITISH AND BULGARIAN PRINT MEDIA [PDF]

Studies in Linguistics, Culture, and FLT, 2017
The paper deals with collocations of intensifiers and adjectives existing in original English and Bulgarian texts and aims to pair them as possible translation equivalents, thus helping to map the possible renditions of one and the same concept in two ...
Irina Stoyanova-Georgieva
doaj +1 more source

Η συμβολή των παράλληλων και συγκρίσιμων σωμάτων κειμένων στη μετάφραση: ηλεκτρονικά εργαλεία επεξεργασίας και ανάλυσής τους

Syn-Thèses, 2012
This research has been co-financed by the European Union (European Social Fund – ESF) and Greek national funds through the Operational Program "Education and Lifelong Learning" of the National Strategic Reference Framework (NSRF) - Research Funding ...
Eleni Tziafa
doaj +1 more source

Die Komparativ- und Superlativformen besser/am besten/das Beste als Varianten in modalen Vergleichskonstruktionen

Linguistik Online, 2021
It is well known that linguistic variants play a key role in the acquisition of language skills in the first, second or foreign language as well as in writing and translation processes and in general in communicative interactions.
Doris Höhmann
doaj +1 more source

Multilingual digital resources with Bulgarian language

Cognitive Studies | Études cognitives, 2015
Multilingual digital resources with Bulgarian language The paper presents in brief Bulgarian language resources as a part of multilingual digital resources developed in the frame of some international projects, among them parallel annotated and aligned
Ludmila Dimitrova
doaj +1 more source

BUCC2017: A Hybrid Approach for Identifying Parallel Sentences in Comparable Corpora [PDF]

Proceedings of the 10th Workshop on Building and Using Comparable Corpora, 2017
A Statistical Machine Translation (SMT) system is always trained using large parallel corpus to produce effective translation. Not only is the corpus scarce, it also involves a lot of manual labor and cost. Parallel corpus can be prepared by employing comparable corpora where a pair of corpora is in two different languages pointing to the same domain ...
Sainik Kumar Mahata +2 more
openaire +1 more source

parallel corpora
comparable corpora
natural language processing

corpus linguistics
computational linguistics
corpus

[shs] humanities and social sciences
[info] computer science [cs]
corpus-based translation studies