Results 21 to 30 of about 91,129 (293)

Named entity transliteration with comparable corpora [PDF]

open access: yesProceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the ACL - ACL '06, 2006
In this paper we investigate Chinese-English name transliteration using comparable corpora, corpora where texts in the two languages deal in some of the same topics --- and therefore share references to named entities --- but are not translations of each other.
Richard Sproat, Tao Tao, ChengXiang Zhai
openaire   +1 more source

Electronic Corpora for Legal English Translator/ Interpreter Training - A Case Study

open access: yesRomanian Journal of English Studies, 2018
This article looks into the experience of using parallel and comparable corpora in the training of future legal English translators and interpreters between English and Montenegrin.
Bulatović Vesna
doaj   +1 more source

The compilation of comparable corpora in the business field and its relevance to translation and terminology

open access: yesCalidoscópio, 2010
This article aims at highlighting the importance of Corpus Linguistics particularly to the compiling of specialized comparable corpora to the field of Translation as well as to the practice of translation itself.
Adriane Orenha
doaj   +1 more source

Featurebased method for document alignment in comparable news corpora [PDF]

open access: yes, 2009
In this paper, we present a feature-based method to align documents with similar content across two sets of bilingual comparable corpora from daily news texts.
Ai Ti Aw, Min Zhang, Thuy Vu
core   +1 more source

Set-Theoretic Alignment for Comparable Corpora [PDF]

open access: yesProceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2016
We describe and evaluate a simple method to extract parallel sentences from comparable corpora. The approach, termed STACC, is based on expanded lexical sets and the Jaccard similarity coefficient. We evaluate our system against state-of-theart methods on a large range of datasets in different domains, for ten language pairs, showing that it either ...
Thierry Etchegoyhen, Andoni Azpeitia
openaire   +1 more source

L’utilisation des corpus électroniques chez le traducteur professionnel : quand ? comment ? pour quoi faire ?

open access: yesILCEA, 2016
In this article, we explain how electronic corpora can be used as translation tools (CAT tools) to complement other, more “traditional” tools like on-line dictionaries, electronic glossaries, translation memories, or machine translation.
Rudy Loock
doaj   +1 more source

Application of multilingual corpus in contrastive studies (on the example of the Bulgarian-Polish-Lithuanian parallel corpus)

open access: yesCognitive Studies | Études cognitives, 2015
Application of multilingual corpus in contrastive studies (on the example of the Bulgarian-Polish-Lithuanian parallel corpus) In this paper we present applications of a trilingual corpus in language research.
Ludmila Dimitrova   +3 more
doaj   +1 more source

The discourse of instructions - between cognitive and anthropological linguistics [PDF]

open access: yesJužnoslovenski Filolog, 2014
In this paper, the discourse of instructions corpus (DIC), composed of Bunjevac speech, Kosovo-Resava and Prizren-Timok dialects, spoken in a wide area in Serbia, was applied theoretical and methodological approaches common to anthropological ...
Ćirković Svetlana M.
doaj   +1 more source

Termhood-based Comparability Metrics of Comparable Corpus in Special Domain

open access: yes, 2013
Cross-Language Information Retrieval (CLIR) and machine translation (MT) resources, such as dictionaries and parallel corpora, are scarce and hard to come by for special domains.
C.Y. Kit, T. Talvensaari, T. Talvensaari
core   +1 more source

Exploring Reusability and Reproducibility for a Research Infrastructure for L1 and L2 Learner Corpora

open access: yesInformation, 2021
Up until today research in various educational and linguistic domains such as learner corpus research, writing research, or second language acquisition has produced a substantial amount of research data in the form of L1 and L2 learner corpora.
Alexander König   +2 more
doaj   +1 more source

Home - About - Disclaimer - Privacy