Results 11 to 20 of about 3,792 (286)
Bilingual Lexicon Extraction from Comparable Corpora Enhanced with Parallel Corpora
In this article, we present a simple and effective approach for extracting bilingual lexicon from comparable corpora enhanced with parallel corpora. We make use of structural characteristics of the documents comprising the comparable corpus to extract parallel sentences with a high degree of quality.
Morin, Emmanuel, Prochasson, Emmanuel
openaire +4 more sources
Parallel Sentence Retrieval From Comparable Corpora for Biomedical Text Simplification [PDF]
Parallel sentences provide semantically similar information which can vary on a given dimension , such as language or register. Parallel sentences with register variation (like expert and non-expert documents) can be exploited for the automatic text simplification.
Cardon, Rémi, Grabar, Natalia
core +4 more sources
How comparable are parallel corpora? Measuring the distribution of general vocabulary and connectives [PDF]
In this paper, we question the homogeneity of a large parallel corpus by measuring the similarity between various sub-parts. We compare results obtained using a general measure of lexical similarity based on χ2 and by counting the number of discourse connectives.
Cartoni Bruno +3 more
openaire +4 more sources
Building of Parallel and Comparable Cybersecurity Corpora for Bilingual Terminology Extraction [PDF]
The paper aims at presenting English-Lithuanian corpora for bilingual term extraction (BiTE) in the cybersecurity domain within the framework of the project DVITAS.
Utka, Andrius +5 more
openaire +3 more sources
Monolingual comparable corpora and parallel corpora in the search for features of translated language [PDF]
For almost two decades now, mainstream corpus-based research in descriptive translation studies has focused on the computerized analysis of translated and comparable non-translated texts (so-called monolingual comparable corpora). These have been suggested to better serve a target-oriented, descriptive approach to translation research than parallel ...
BERNARDINI, SILVIA
openaire +3 more sources
TRANSLATION EQUIVALENTS IN BRITISH AND BULGARIAN PRINT MEDIA [PDF]
The paper deals with collocations of intensifiers and adjectives existing in original English and Bulgarian texts and aims to pair them as possible translation equivalents, thus helping to map the possible renditions of one and the same concept in two ...
Irina Stoyanova-Georgieva
doaj +1 more source
This research has been co-financed by the European Union (European Social Fund – ESF) and Greek national funds through the Operational Program "Education and Lifelong Learning" of the National Strategic Reference Framework (NSRF) - Research Funding ...
Eleni Tziafa
doaj +1 more source
It is well known that linguistic variants play a key role in the acquisition of language skills in the first, second or foreign language as well as in writing and translation processes and in general in communicative interactions.
Doris Höhmann
doaj +1 more source
Multilingual digital resources with Bulgarian language
Multilingual digital resources with Bulgarian language The paper presents in brief Bulgarian language resources as a part of multilingual digital resources developed in the frame of some international projects, among them parallel annotated and aligned
Ludmila Dimitrova
doaj +1 more source
BUCC2017: A Hybrid Approach for Identifying Parallel Sentences in Comparable Corpora [PDF]
A Statistical Machine Translation (SMT) system is always trained using large parallel corpus to produce effective translation. Not only is the corpus scarce, it also involves a lot of manual labor and cost. Parallel corpus can be prepared by employing comparable corpora where a pair of corpora is in two different languages pointing to the same domain ...
Sainik Kumar Mahata +2 more
openaire +1 more source

