Results 1 to 10 of about 11,846 (303)

Learning languages from parallel corpora [PDF]

open access: yesSlovenščina 2.0: Empirične, aplikativne in interdisciplinarne raziskave, 2022
This work describes a blueprint for an application that generates language learning exercises from parallel corpora. Word alignment and parallel structures allow for the automatic assessment of sentence pairs in the source and target languages, while ...
Johannes Graën
doaj   +2 more sources

Parallel Corpora

open access: yes, 2020
This chapter gives an overview of parallel corpora, i.e. corpora containing source texts in a given language, aligned with their translations in another language. More specifically, it focuses on directional corpora, i.e. parallel corpora where the source and target languages are clearly identified. These types of corpora are widely used in contrastive
Lefer, Marie-Aude
openaire   +3 more sources

PARALLEL CORPORA IN TRANSLATOR EDUCATION [PDF]

open access: yesredit - Revista Electrónica de Didáctica de la Traducción y la Interpretación, 2016
Mainly four kinds of corpora have been used in the field of translator education: comparable corpora, parallel corpora, parallel learner corpora and ‘do it yourself’ or ‘disposable’ corpora. This article focuses on the use of parallel corpora in translator education.
Ruiz Yepes, Guadalupe
openaire   +3 more sources

ParaCrawl: Web-Scale Acquisition of Parallel Corpora [PDF]

open access: yesProceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
We report on methods to create the largest publicly available parallel corpora by crawling the web, using open source software. We empirically compare alternative methods and publish benchmark data sets for sentence alignment and sentence pair filtering.
Marta Bañón   +18 more
openaire   +6 more sources

From subtitles to parallel corpora [PDF]

open access: yes, 2012
We describe the preparation of parallel corpora based on professional quality subtitles in seven European language pairs. The main focus is the effect of the processing steps on the size and quality of the final corpora.
Fishel, Mark   +6 more
openaire   +5 more sources

Construction Method of Parallel Corpus for Minority Language Machine Translation [PDF]

open access: yesJisuanji kexue, 2022
The training performance of neural machine translation depends heavily on the scale and quality of parallel corpus.Unlike some common languages,the construction of high-quality parallel corpora between Chinese and minority languages has been lagging.The ...
LIU Yan, XIONG De-yi
doaj   +1 more source

Bilingual Lexicon Extraction from Comparable Corpora Enhanced with Parallel Corpora

open access: yes, 2011
In this article, we present a simple and effective approach for extracting bilingual lexicon from comparable corpora enhanced with parallel corpora. We make use of structural characteristics of the documents comprising the comparable corpus to extract parallel sentences with a high degree of quality.
Morin, Emmanuel, Prochasson, Emmanuel
openaire   +4 more sources

Empirical Analysis of Parallel Corpora and In-Depth Analysis Using LIWC

open access: yesApplied Sciences, 2022
The machine translation system aims to translate source language into target language. Recent studies on MT systems mainly focus on neural machine translation.
Chanjun Park   +6 more
doaj   +1 more source

A Simple Yet Robust Algorithm for Automatic Extraction of Parallel Sentences: A Case Study on Arabic-English Wikipedia Articles

open access: yesIEEE Access, 2022
Parallel corpora are vital components in several applications of Natural Language Processing (NLP), particularly in machine translation. In this paper, we present a novel method for automatically creating parallel sentences from comparable corpora.
Maha Jarallah Althobaiti
doaj   +1 more source

The neural machine translation models for the low-resource Kazakh–English language pair [PDF]

open access: yesPeerJ Computer Science, 2023
The development of the machine translation field was driven by people’s need to communicate with each other globally by automatically translating words, sentences, and texts from one language into another.
Vladislav Karyukin   +4 more
doaj   +2 more sources

Home - About - Disclaimer - Privacy