Results 261 to 270 of about 11,846 (303)

Exercise attenuates polycystic ovary syndrome development via improved mitochondrial proteostasis. [PDF]

open access: yesAm J Physiol Endocrinol Metab
Shin C   +9 more
europepmc   +1 more source

Working with parallel corpora

Studies in Corpus Linguistics, 2019
Although parallel corpora are vital for cross-linguistic and natural language processing (NLP) research, most have been designed for just one particular purpose, which may unnecessarily restrict their usefulness and usability. My argument is that the usefulness of existing parallel corpora increases exponentially when data so obtained are combined with
Rosa Rabadan
exaly   +2 more sources

Comparable parallel corpora

Studies in Corpus Linguistics, 2019
Are papers presented in corpus-based translation studies truly scientific? These are normally done on only one language pair, often on purpose-made parallel corpora, and can normally not be replicated. Therefore their value is limited in a strictly scientific sense.
exaly   +2 more sources

A Statistical View on Bilingual Lexicon Extraction: From Parallel Corpora to Non-Parallel Corpora

open access: yes, 1998
We present two problems for statistically extracting bilingual lexicon: (1) How can noisy parallel corpora be used? (2) How can non-parallel yet comparable corpora be used? We describe our own work and contribution in relaxing the constraint of using only clean parallel corpora.
Fung, Pascale
openaire   +3 more sources

Parallel subtitle corpora and their applications in machine translation and translatology

open access: yesPerspectives: Studies in Translation Theory and Practice, 2013
SUMAT is a project funded through the EU ICT Policy Support Programme (2011–2014). It involves four subtitling companies (InVision, DDS, Titelbild, VSI) and five technical partners (ALS, ATC, TextShuttle, University of Maribor, Vicomtech).For the SUMAT ...
Martin Volk, Panayota Georgakopoulou
exaly   +2 more sources

Quantifying the utility of parallel corpora

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, 2001
Our English-Chinese cross-language IR system is trained from parallel corpora; we investigate its performance as a function of training corpus size for three different training corpora. We find that the performance of the system as trained on the three parallel corpora can be related by a simple measure, namely the out-of-vocabulary rate of query words.
Martin Franz   +3 more
openaire   +1 more source

Mining Patents for Parallel Corpora

2008
Masao Utiyama, Hitoshi Isahara
exaly   +2 more sources

Dual Subtitles as Parallel Corpora

2018
In this paper, we leverage the existence of dual subtitles as a source of parallel data. Dual subtitles present viewers with two languages simultaneously, and are generally aligned in the segment level, which removes the need to automatically perform this alignment.
Shikun Zhang, Wang Ling, Chris Dyer
openaire   +2 more sources

Home - About - Disclaimer - Privacy