Results 21 to 30 of about 3,792 (286)
The use of English, Czech and French punctuation marks in reference, parallel and comparable web corpora: a question of methodology [PDF]
This paper analyses the frequency of six punctuation marks (the comma, period, colon, semicolon, question mark and exclamation mark) in three languages (English, French and Czech) in three different types of corpora — comparable web corpora, large ...
Olga Nádvorníková
doaj
Parallel Sentence Alignment from Biomedical Comparable Corpora
Parallel sentences provide semantically similar information which can vary on a given dimension, such as language or register. Parallel sentences with register variation (like expert and non-expert documents) can be exploited for the automatic text simplification.
Cardon, Rémi, Grabar, Natalia
openaire +3 more sources
zNLP: Identifying Parallel Sentences in Chinese-English Comparable Corpora [PDF]
This paper describes the zNLP system for the BUCC 2017 shared task. Our system identifies parallel sentence pairs in Chinese-English comparable corpora by translating word-by-word Chinese sentences into English, using the search engine Solr to select near-parallel sentences and then by using an SVM classifier to identify true parallel sentences from ...
Zheng Zhang 0019, Pierre Zweigenbaum
openaire +1 more source
In this article, we explain how electronic corpora can be used as translation tools (CAT tools) to complement other, more “traditional” tools like on-line dictionaries, electronic glossaries, translation memories, or machine translation.
Rudy Loock
doaj +1 more source
Pasti dat: srovnatelnost dat jazykových korpusů [PDF]
Despite the apparent unambiguity of data provided by corpora, the data reflect different composition of the corpora, different conceptions of the synchronic period of a given language, different linguistic traditions, different orthography and other ...
Markus Giger, Jana Kocková
doaj +1 more source
Electronic Corpora for Legal English Translator/ Interpreter Training - A Case Study
This article looks into the experience of using parallel and comparable corpora in the training of future legal English translators and interpreters between English and Montenegrin.
Bulatović Vesna
doaj +1 more source
Digital Corpora and their Applications in Semantic Studies and Lexicography
Digital Corpora and their Applications in Semantic Studies and Lexicography The paper describes the first Bulgarian-Polish digital resources — parallel and comparable corpora, and their applications in the semantic studies and lexicography for creation
Ludmila Dimitrova +1 more
doaj +1 more source
Such and sådan – the same but different
English such has a variety of uses, which nearly always involve the expression of similarity, and such can therefore be called a ʻsimilativeʼ. Swedish sådan has very similar uses.
Johan van der Auwera, Evie Coussé
doaj +1 more source
Building bilingual dictionaries from parallel web documents
In this paper we describe a system for automatically constructing a bilingual dictionary for cross-language information retrieval applications. We describe how we automatically target candidate parallel documents, filter the candidate documents and ...
Ounis, I. +8 more
core +1 more source
Wiki-Translator: Multilingual Experiments for In-Domain Translations [PDF]
The benefits of using comparable corpora for improving translation quality for statistical machine translators have been already shown by various researchers.
Dan Tufis +2 more
doaj

