Results 241 to 250 of about 3,792 (286)
Some of the next articles are maybe not open access.

Comparable parallel corpora

Studies in Corpus Linguistics, 2019
Are papers presented in corpus-based translation studies truly scientific? These are normally done on only one language pair, often on purpose-made parallel corpora, and can normally not be replicated. Therefore their value is limited in a strictly scientific sense.
exaly   +2 more sources

Parallel sentence generation from comparable corpora for improved SMT

Machine Translation, 2011
A parallel corpus is an essential resource for statistical machine translation (SMT) but is often not available in the required amounts for all domains and languages. An approach is presented here which aims at producing parallel corpora from available comparable corpora.
Holger Schwenk
exaly   +2 more sources

Extracting parallel phrases from comparable corpora

2014 International Conference on Asian Language Processing (IALP), 2014
Hailong Cao, Tiejun Zhao
exaly   +2 more sources

Comparable or Parallel Corpora?

International Journal of Lexicography, 1996
exaly   +2 more sources

Building Subject-aligned Comparable Corpora and Mining it for Truly Parallel Sentence Pairs

open access: yesProcedia Technology, 2014
Parallel sentences are a relatively scarce but extremely useful resource for many applications including cross-lingual retrieval and statistical machine translation.
Krzysztof Wołk, Krzysztof Marasek
exaly   +2 more sources

Semi-Automatic Parallel Corpora Extraction from Comparable News Corpora

Polibits, 2010
The parallel corpus is a necessary resource in many multi/cross lingual natural language processing applications that include Machine Translation and Cross Lingual Information Retreival. Preparation of large scale parallel corpus takes time and also demands the linguistics skill.
Thoudam Doren Singh   +1 more
openaire   +1 more source

Fuzzy Influenced Process to Generate Comparable to Parallel Corpora

ACM Transactions on Asian and Low-Resource Language Information Processing, 2023
Data-driven supervised approaches rely on the parallel corpus. Due to lack of data and resources availability, it has become more difficult to achieve accurate outputs. In addition, the efficiency of the machine translation system depends on the quality of the used corpora.
Debajyoty Banik   +2 more
openaire   +1 more source

Studying Anglicisms with Comparable and Parallel Corpora

Belgian Journal of Linguistics, 2007
"Meanings are established in individual languages by contrasts of similar items in semantic fields" and "as a consequence, semantic structures do not match cross-linguistically" (Gorlach 2003: 93). This broad statement about the formation of meaning can be adopted as a theoretical principle underlying the study of lexical borrowing, a phenomenon ...
openaire   +2 more sources

Mining Parallel Data from Comparable Corpora via Triangulation

2011 International Conference on Asian Language Processing, 2011
This paper improves an unsupervised method for extracting parallel sentence pairs from a comparable corpus by using the triangulation through a third language. Before, an unsupervised method for extracting parallel sentence pairs from a comparable corpus has been proposed.
Thi-Ngoc-Diep Do   +2 more
openaire   +1 more source

Home - About - Disclaimer - Privacy