Results 31 to 40 of about 91,129 (293)
Finding Translation Examples for Under-Resourced Language Pairs or for Narrow Domains; the Case for Machine Translation [PDF]
The cyberspace is populated with valuable information sources, expressed in about 1500 different languages and dialects. Yet, for the vast majority of WEB surfers this wealth of information is practically inaccessible or meaningless.
Dan Tufis
doaj
Multilingual digital resources with Bulgarian language
Multilingual digital resources with Bulgarian language The paper presents in brief Bulgarian language resources as a part of multilingual digital resources developed in the frame of some international projects, among them parallel annotated and aligned
Ludmila Dimitrova
doaj +1 more source
Multidimensional (MD) analysis contributes to a comprehensive linguistic description of second language (L2) writing texts. However, cross-linguistic studies based on MD analysis are still insufficient and need to be increased.
Yujiao Zhang
doaj +1 more source
An Approach for Chinese-Japanese Named Entity Equivalents Extraction Using Inductive Learning and Hanzi-Kanji Mapping Table [PDF]
Named Entity Translation Equivalents extraction plays a critical role in machine translation (MT) and cross language information retrieval (CLIR). Traditional methods are often based on large-scale parallel or comparable corpora.
Araki, Kenji +4 more
core +1 more source
Fachkommunikation, Popularisierung, Übersetzung: Empirische Vergleiche am Beispiel der Nominalphrase im Englischen und Deutschen [PDF]
This article examines the contrasts and commonalities between languages for specific purposes (LSP) and their popularizations on the one hand and the frequency patterns of LSP register features in English and German on the other. For this purpose corpora
Silvia Hansen-Schirra +3 more
doaj +2 more sources
Building and Aligning Comparable Corpora
Comparable corpus is a set of topic aligned documents in multiple languages, which are not necessarily translations of each other. These documents are useful for multilingual natural language processing when there is no parallel text available in some domains or languages.
Motaz Saad +2 more
openaire +2 more sources
Comparing corpora and lexical ambiguity [PDF]
In this paper we compare two types of corpus, focusing on the lexical ambiguity of each of them. The first corpus consists mainly of newspaper articles and literature excerpts, while the second belongs to the medical domain. To conduct the study, we have used two different disambiguation tools.
Patrick Ruch, Arnaud Gaudinat
openaire +2 more sources
Beyond Bilingual: Multi-sense Word Embeddings using Multilingual Context
Word embeddings, which represent a word as a point in a vector space, have become ubiquitous to several NLP tasks. A recent line of work uses bilingual (two languages) corpora to learn a different vector for each sense of a word, by exploiting ...
Chang, Kai-Wei +4 more
core +1 more source
Noisy-parallel and comparable corpora filtering methodology for the extraction of bi-lingual equivalent data at sentence level [PDF]
Text alignment and text quality are critical to the accuracy of Machine Translation (MT) systems, some NLP tools, and any other text processing tasks requiring bilingual data.
Wołk, Krzysztof
core +3 more sources
ABSTRACT Introduction Progressive Supranuclear Palsy (PSP) is a neurodegenerative ‘tauopathy’ with predominating pathology in the basal ganglia and midbrain. Caudal tau spread frequently implicates the cerebellum; however, the pattern of atrophy remains equivocal.
Chloe Spiegel +8 more
wiley +1 more source

