Results 51 to 60 of about 2,430,336 (342)

Automated Construction of Arabic-English Parallel Corpus [PDF]

open access: yesJournal of the ACS Advances in Computer Science, 2009
Large-scale parallel corpus has become a reliable resource to cross the language barriers between the user and the web. These parallel texts provide the primary training material for statistical translation models and testing machine translation systems.
doaj   +1 more source

Building the Spanish-Croatian Parallel Corpus [PDF]

open access: yes, 2020
This paper describes the building of the first Spanish-Croatian unidirectional parallel corpus, which has been constructed at the Faculty of Humanities and Social Sciences of the University of Zagreb. The corpus is comprised of eleven Spanish novels and their translations to Croatian done by six different professional translators.
Mikelenić, Bojana, Tadić, Marko
openaire   +3 more sources

Developing and implementing an English-Spanish literary parallel audio-textual corpus for data-driven ESL learning [PDF]

open access: yesDELTA: Documentação de Estudos em Lingüística Teórica e Aplicada, 2021
The purpose of this paper is to present the LITTERA corpus, an English-Spanish literary parallel speech corpus created for the purpose of language learning, and to sketch out a few pedagogical applications for the study of English phonology by Spanish ...
Michael Lang, Xavier Gómez Guinovart
doaj   +1 more source

JW300: A Wide-Coverage Parallel Corpus for Low-Resource Languages

open access: yesAnnual Meeting of the Association for Computational Linguistics, 2019
Viable cross-lingual transfer critically depends on the availability of parallel texts. Shortage of such resources imposes a development and evaluation bottleneck in multilingual processing.
Zeljko Agic, Ivan Vulic
semanticscholar   +1 more source

Leveraging Multilingual News Websites for Building a Kurdish Parallel Corpus [PDF]

open access: yesACM Trans. Asian Low Resour. Lang. Inf. Process., 2020
Machine translation has been a major motivation of development in natural language processing. Despite the burgeoning achievements in creating more efficient machine translation systems, thanks to deep learning methods, parallel corpora have remained ...
Sina Ahmadi   +2 more
semanticscholar   +1 more source

Bilingual Corpus – Digital Repository for Preservation of Language Heritage

open access: yesDigital Presentation and Preservation of Cultural and Scientific Heritage, 2012
The article briefly reviews bilingual Slovak-Bulgarian/Bulgarian- Slovak parallel and aligned corpus. The corpus is collected and developed as results of the collaboration in the frameworks of the joint research project between Institute of Mathematics ...
Ludmila Dimitrova, Radovan Garabík
doaj   +1 more source

Incrementally and Flexibly Extracting Parallel Corpus from Web [PDF]

open access: yesJisuanji kexue
Extracting parallel corpus from the web is important for machine translation and other multilingual processing tasks.This paper proposes an incremental web parallel corpus extraction method,which incrementally updates language text length statistics for ...
LIU Xiaofeng, ZHENG Yucheng, LI Dongyang
doaj   +1 more source

Modeling Code-Switch Languages Using Bilingual Parallel Corpus

open access: yesAnnual Meeting of the Association for Computational Linguistics, 2020
Language modeling is the technique to estimate the probability of a sequence of words. A bilingual language model is expected to model the sequential dependency for words across languages, which is difficult due to the inherent lack of suitable training ...
Grandee Lee, Haizhou Li
semanticscholar   +1 more source

Language Resources – a Part of World Cultural Heritage

open access: yesDigital Presentation and Preservation of Cultural and Scientific Heritage, 2011
This article briefly reviews multilingual language resources for Bulgarian, developed in the frame of some international projects: the first-ever annotated Bulgarian MTE digital lexical resources, Bulgarian-Polish corpus, Bulgarian-Slovak parallel and ...
Ludmila Dimitrova
doaj   +1 more source

Linguistic Corpora as International Cultural Heritage: The Corpus of Bulgarian and Ukrainian Parallel Texts

open access: yesDigital Presentation and Preservation of Cultural and Scientific Heritage, 2013
The paper relates about our ongoing work on the creation of a corpus of Bulgarian and Ukrainian parallel texts. We discuss some differences in the approaches and the interpretation of some concepts, as well as various problems associated with the ...
Olena Siruk, Ivan Derzhanski
doaj   +1 more source

Home - About - Disclaimer - Privacy