Results 21 to 30 of about 91,129 (293)
Named entity transliteration with comparable corpora [PDF]
In this paper we investigate Chinese-English name transliteration using comparable corpora, corpora where texts in the two languages deal in some of the same topics --- and therefore share references to named entities --- but are not translations of each other.
Richard Sproat, Tao Tao, ChengXiang Zhai
openaire +1 more source
Electronic Corpora for Legal English Translator/ Interpreter Training - A Case Study
This article looks into the experience of using parallel and comparable corpora in the training of future legal English translators and interpreters between English and Montenegrin.
Bulatović Vesna
doaj +1 more source
This article aims at highlighting the importance of Corpus Linguistics particularly to the compiling of specialized comparable corpora to the field of Translation as well as to the practice of translation itself.
Adriane Orenha
doaj +1 more source
Featurebased method for document alignment in comparable news corpora [PDF]
In this paper, we present a feature-based method to align documents with similar content across two sets of bilingual comparable corpora from daily news texts.
Ai Ti Aw, Min Zhang, Thuy Vu
core +1 more source
Set-Theoretic Alignment for Comparable Corpora [PDF]
We describe and evaluate a simple method to extract parallel sentences from comparable corpora. The approach, termed STACC, is based on expanded lexical sets and the Jaccard similarity coefficient. We evaluate our system against state-of-theart methods on a large range of datasets in different domains, for ten language pairs, showing that it either ...
Thierry Etchegoyhen, Andoni Azpeitia
openaire +1 more source
In this article, we explain how electronic corpora can be used as translation tools (CAT tools) to complement other, more “traditional” tools like on-line dictionaries, electronic glossaries, translation memories, or machine translation.
Rudy Loock
doaj +1 more source
Application of multilingual corpus in contrastive studies (on the example of the Bulgarian-Polish-Lithuanian parallel corpus) In this paper we present applications of a trilingual corpus in language research.
Ludmila Dimitrova +3 more
doaj +1 more source
The discourse of instructions - between cognitive and anthropological linguistics [PDF]
In this paper, the discourse of instructions corpus (DIC), composed of Bunjevac speech, Kosovo-Resava and Prizren-Timok dialects, spoken in a wide area in Serbia, was applied theoretical and methodological approaches common to anthropological ...
Ćirković Svetlana M.
doaj +1 more source
Termhood-based Comparability Metrics of Comparable Corpus in Special Domain
Cross-Language Information Retrieval (CLIR) and machine translation (MT) resources, such as dictionaries and parallel corpora, are scarce and hard to come by for special domains.
C.Y. Kit, T. Talvensaari, T. Talvensaari
core +1 more source
Up until today research in various educational and linguistic domains such as learner corpus research, writing research, or second language acquisition has produced a substantial amount of research data in the form of L1 and L2 learner corpora.
Alexander König +2 more
doaj +1 more source

