Results 21 to 30 of about 2,430,336 (342)

YouTube-ASL: A Large-Scale, Open-Domain American Sign Language-English Parallel Corpus [PDF]

open access: yesNeural Information Processing Systems, 2023
Machine learning for sign languages is bottlenecked by data. In this paper, we present YouTube-ASL, a large-scale, open-domain corpus of American Sign Language (ASL) videos and accompanying English captions drawn from YouTube.
David C. Uthus   +2 more
semanticscholar   +1 more source

DEplain: A German Parallel Corpus with Intralingual Translations into Plain Language for Sentence and Document Simplification [PDF]

open access: yesAnnual Meeting of the Association for Computational Linguistics, 2023
Text simplification is an intralingual translation task in which documents, or sentences of a complex source text are simplified for a target audience.
Regina Stodden   +2 more
semanticscholar   +1 more source

AVATAR: A Parallel Corpus for Java-Python Program Translation [PDF]

open access: yesAnnual Meeting of the Association for Computational Linguistics, 2021
Program translation refers to migrating source code from one programming language to another. It has tremendous practical value in software development, as porting software across languages is time-consuming and costly.
W. Ahmad   +3 more
semanticscholar   +1 more source

DuRecDial 2.0: A Bilingual Parallel Corpus for Conversational Recommendation [PDF]

open access: yesConference on Empirical Methods in Natural Language Processing, 2021
In this paper, we provide a bilingual parallel human-to-human recommendation dialog dataset (DuRecDial 2.0) to enable researchers to explore a challenging task of multilingual and cross-lingual conversational recommendation.
Zeming Liu   +4 more
semanticscholar   +1 more source

ParCourE: A Parallel Corpus Explorer for a Massively Multilingual Corpus [PDF]

open access: yesProceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing: System Demonstrations, 2021
The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language ...
Ayyoob Imani   +4 more
openaire   +2 more sources

Strategies of translating swear words into Arabic: a case study of a parallel corpus of Netflix English-Arabic movie subtitles

open access: yesHumanities and Social Sciences Communications, 2023
This study adopts a corpus-assisted approach to explore the translation strategies that Netflix subtitlers opted for in rendering 1564 English swear words into Arabic.
Hussein Abu-Rayyash   +2 more
semanticscholar   +1 more source

A Simple Yet Robust Algorithm for Automatic Extraction of Parallel Sentences: A Case Study on Arabic-English Wikipedia Articles

open access: yesIEEE Access, 2022
Parallel corpora are vital components in several applications of Natural Language Processing (NLP), particularly in machine translation. In this paper, we present a novel method for automatically creating parallel sentences from comparable corpora.
Maha Jarallah Althobaiti
doaj   +1 more source

Icelandic Parallel Abstracts Corpus

open access: yesCoRR, 2021
We present a new Icelandic-English parallel corpus, the Icelandic Parallel Abstracts Corpus (IPAC), composed of abstracts from student theses and dissertations. The texts were collected from the Skemman repository which keeps records of all theses, dissertations and final projects from students at Icelandic universities. The corpus was aligned based on
Haukur Barri Símonarson   +1 more
openaire   +2 more sources

The SAWA corpus [PDF]

open access: yesProceedings of the First Workshop on Language Technologies for African Languages - AfLaT '09, 2009
Research in data-driven methods for Machine Translation has greatly benefited from the increasing availability of parallel corpora. Processing the same text in two different languages yields useful information on how words and phrases are translated from a source language into a target language.
De Pauw, G   +2 more
openaire   +3 more sources

Automated and Human Interaction in Written Discourse: A Contrastive Parallel Corpus-based Investigation of Metadiscourse Features in Machine-Human Translations

open access: yesSAGE Open, 2022
The rise of the internet has generated a need for fast online translations, which human translators cannot meet. Statistical tools such as Google and Baidu Translate provide automatic translation from one written language to another.
Muhammad Afzaal   +3 more
semanticscholar   +1 more source

Home - About - Disclaimer - Privacy