Results 51 to 60 of about 2,430,336 (342)
Automated Construction of Arabic-English Parallel Corpus [PDF]
Large-scale parallel corpus has become a reliable resource to cross the language barriers between the user and the web. These parallel texts provide the primary training material for statistical translation models and testing machine translation systems.
doaj +1 more source
Building the Spanish-Croatian Parallel Corpus [PDF]
This paper describes the building of the first Spanish-Croatian unidirectional parallel corpus, which has been constructed at the Faculty of Humanities and Social Sciences of the University of Zagreb. The corpus is comprised of eleven Spanish novels and their translations to Croatian done by six different professional translators.
Mikelenić, Bojana, Tadić, Marko
openaire +3 more sources
Developing and implementing an English-Spanish literary parallel audio-textual corpus for data-driven ESL learning [PDF]
The purpose of this paper is to present the LITTERA corpus, an English-Spanish literary parallel speech corpus created for the purpose of language learning, and to sketch out a few pedagogical applications for the study of English phonology by Spanish ...
Michael Lang, Xavier Gómez Guinovart
doaj +1 more source
JW300: A Wide-Coverage Parallel Corpus for Low-Resource Languages
Viable cross-lingual transfer critically depends on the availability of parallel texts. Shortage of such resources imposes a development and evaluation bottleneck in multilingual processing.
Zeljko Agic, Ivan Vulic
semanticscholar +1 more source
Leveraging Multilingual News Websites for Building a Kurdish Parallel Corpus [PDF]
Machine translation has been a major motivation of development in natural language processing. Despite the burgeoning achievements in creating more efficient machine translation systems, thanks to deep learning methods, parallel corpora have remained ...
Sina Ahmadi +2 more
semanticscholar +1 more source
Bilingual Corpus – Digital Repository for Preservation of Language Heritage
The article briefly reviews bilingual Slovak-Bulgarian/Bulgarian- Slovak parallel and aligned corpus. The corpus is collected and developed as results of the collaboration in the frameworks of the joint research project between Institute of Mathematics ...
Ludmila Dimitrova, Radovan Garabík
doaj +1 more source
Incrementally and Flexibly Extracting Parallel Corpus from Web [PDF]
Extracting parallel corpus from the web is important for machine translation and other multilingual processing tasks.This paper proposes an incremental web parallel corpus extraction method,which incrementally updates language text length statistics for ...
LIU Xiaofeng, ZHENG Yucheng, LI Dongyang
doaj +1 more source
Modeling Code-Switch Languages Using Bilingual Parallel Corpus
Language modeling is the technique to estimate the probability of a sequence of words. A bilingual language model is expected to model the sequential dependency for words across languages, which is difficult due to the inherent lack of suitable training ...
Grandee Lee, Haizhou Li
semanticscholar +1 more source
Language Resources – a Part of World Cultural Heritage
This article briefly reviews multilingual language resources for Bulgarian, developed in the frame of some international projects: the first-ever annotated Bulgarian MTE digital lexical resources, Bulgarian-Polish corpus, Bulgarian-Slovak parallel and ...
Ludmila Dimitrova
doaj +1 more source
The paper relates about our ongoing work on the creation of a corpus of Bulgarian and Ukrainian parallel texts. We discuss some differences in the approaches and the interpretation of some concepts, as well as various problems associated with the ...
Olena Siruk, Ivan Derzhanski
doaj +1 more source

