Results 1 to 10 of about 11,846 (303)
Learning languages from parallel corpora [PDF]
This work describes a blueprint for an application that generates language learning exercises from parallel corpora. Word alignment and parallel structures allow for the automatic assessment of sentence pairs in the source and target languages, while ...
Johannes Graën
doaj +2 more sources
This chapter gives an overview of parallel corpora, i.e. corpora containing source texts in a given language, aligned with their translations in another language. More specifically, it focuses on directional corpora, i.e. parallel corpora where the source and target languages are clearly identified. These types of corpora are widely used in contrastive
Lefer, Marie-Aude
openaire +3 more sources
PARALLEL CORPORA IN TRANSLATOR EDUCATION [PDF]
Mainly four kinds of corpora have been used in the field of translator education: comparable corpora, parallel corpora, parallel learner corpora and ‘do it yourself’ or ‘disposable’ corpora. This article focuses on the use of parallel corpora in translator education.
Ruiz Yepes, Guadalupe
openaire +3 more sources
ParaCrawl: Web-Scale Acquisition of Parallel Corpora [PDF]
We report on methods to create the largest publicly available parallel corpora by crawling the web, using open source software. We empirically compare alternative methods and publish benchmark data sets for sentence alignment and sentence pair filtering.
Marta Bañón +18 more
openaire +6 more sources
From subtitles to parallel corpora [PDF]
We describe the preparation of parallel corpora based on professional quality subtitles in seven European language pairs. The main focus is the effect of the processing steps on the size and quality of the final corpora.
Fishel, Mark +6 more
openaire +5 more sources
Construction Method of Parallel Corpus for Minority Language Machine Translation [PDF]
The training performance of neural machine translation depends heavily on the scale and quality of parallel corpus.Unlike some common languages,the construction of high-quality parallel corpora between Chinese and minority languages has been lagging.The ...
LIU Yan, XIONG De-yi
doaj +1 more source
Bilingual Lexicon Extraction from Comparable Corpora Enhanced with Parallel Corpora
In this article, we present a simple and effective approach for extracting bilingual lexicon from comparable corpora enhanced with parallel corpora. We make use of structural characteristics of the documents comprising the comparable corpus to extract parallel sentences with a high degree of quality.
Morin, Emmanuel, Prochasson, Emmanuel
openaire +4 more sources
Empirical Analysis of Parallel Corpora and In-Depth Analysis Using LIWC
The machine translation system aims to translate source language into target language. Recent studies on MT systems mainly focus on neural machine translation.
Chanjun Park +6 more
doaj +1 more source
Parallel corpora are vital components in several applications of Natural Language Processing (NLP), particularly in machine translation. In this paper, we present a novel method for automatically creating parallel sentences from comparable corpora.
Maha Jarallah Althobaiti
doaj +1 more source
The neural machine translation models for the low-resource Kazakh–English language pair [PDF]
The development of the machine translation field was driven by people’s need to communicate with each other globally by automatically translating words, sentences, and texts from one language into another.
Vladislav Karyukin +4 more
doaj +2 more sources

