Results 21 to 30 of about 2,430,336 (342)
YouTube-ASL: A Large-Scale, Open-Domain American Sign Language-English Parallel Corpus [PDF]
Machine learning for sign languages is bottlenecked by data. In this paper, we present YouTube-ASL, a large-scale, open-domain corpus of American Sign Language (ASL) videos and accompanying English captions drawn from YouTube.
David C. Uthus +2 more
semanticscholar +1 more source
DEplain: A German Parallel Corpus with Intralingual Translations into Plain Language for Sentence and Document Simplification [PDF]
Text simplification is an intralingual translation task in which documents, or sentences of a complex source text are simplified for a target audience.
Regina Stodden +2 more
semanticscholar +1 more source
AVATAR: A Parallel Corpus for Java-Python Program Translation [PDF]
Program translation refers to migrating source code from one programming language to another. It has tremendous practical value in software development, as porting software across languages is time-consuming and costly.
W. Ahmad +3 more
semanticscholar +1 more source
DuRecDial 2.0: A Bilingual Parallel Corpus for Conversational Recommendation [PDF]
In this paper, we provide a bilingual parallel human-to-human recommendation dialog dataset (DuRecDial 2.0) to enable researchers to explore a challenging task of multilingual and cross-lingual conversational recommendation.
Zeming Liu +4 more
semanticscholar +1 more source
ParCourE: A Parallel Corpus Explorer for a Massively Multilingual Corpus [PDF]
The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language ...
Ayyoob Imani +4 more
openaire +2 more sources
This study adopts a corpus-assisted approach to explore the translation strategies that Netflix subtitlers opted for in rendering 1564 English swear words into Arabic.
Hussein Abu-Rayyash +2 more
semanticscholar +1 more source
Parallel corpora are vital components in several applications of Natural Language Processing (NLP), particularly in machine translation. In this paper, we present a novel method for automatically creating parallel sentences from comparable corpora.
Maha Jarallah Althobaiti
doaj +1 more source
Icelandic Parallel Abstracts Corpus
We present a new Icelandic-English parallel corpus, the Icelandic Parallel Abstracts Corpus (IPAC), composed of abstracts from student theses and dissertations. The texts were collected from the Skemman repository which keeps records of all theses, dissertations and final projects from students at Icelandic universities. The corpus was aligned based on
Haukur Barri Símonarson +1 more
openaire +2 more sources
Research in data-driven methods for Machine Translation has greatly benefited from the increasing availability of parallel corpora. Processing the same text in two different languages yields useful information on how words and phrases are translated from a source language into a target language.
De Pauw, G +2 more
openaire +3 more sources
The rise of the internet has generated a need for fast online translations, which human translators cannot meet. Statistical tools such as Google and Baidu Translate provide automatic translation from one written language to another.
Muhammad Afzaal +3 more
semanticscholar +1 more source

