Results 1 to 10 of about 6,791 (215)
Bootstrapping parallel corpora [PDF]
We present two methods for the automatic creation of parallel corpora. Whereas previous work into the automatic construction of parallel corpora has focused on harvesting them from the web, we examine the use of existing parallel corpora to bootstrap data for new language pairs.
Chris Callison-Burch, Miles Osborne
openaire +1 more source
This chapter gives an overview of parallel corpora, i.e. corpora containing source texts in a given language, aligned with their translations in another language. More specifically, it focuses on directional corpora, i.e. parallel corpora where the source and target languages are clearly identified. These types of corpora are widely used in contrastive
openaire +2 more sources
Learning languages from parallel corpora
This work describes a blueprint for an application that generates language learning exercises from parallel corpora. Word alignment and parallel structures allow for the automatic assessment of sentence pairs in the source and target languages, while users of the application continuously improve the quality of the data with their interactions, thus ...
openaire +4 more sources
Aligning sentences in parallel corpora [PDF]
In this paper we describe a statistical technique for aligning sentences with their translations in two parallel corpora. In addition to certain anchor points that are available in our data, the only information about the sentences that we use for calculating alignments is the number of tokens that they contain.
Peter F. Brown +2 more
openaire +1 more source
Automatic alignment in parallel corpora [PDF]
This paper addresses the alignment issue in the framework of exploitation of large bimultilingual corpora for translation purposes. A generic alignment scheme is proposed that can meet varying requirements of different applications. Depending on the level at which alignment is sought, appropriate surface linguistic information is invoked coupled with ...
Harris Papageorgiou +2 more
openaire +1 more source
Tumour–host interactions in Drosophila: mechanisms in the tumour micro‐ and macroenvironment
This review examines how tumour–host crosstalk takes place at multiple levels of biological organisation, from local cell competition and immune crosstalk to organism‐wide metabolic and physiological collapse. Here, we integrate findings from Drosophila melanogaster studies that reveal conserved mechanisms through which tumours hijack host systems to ...
José Teles‐Reis, Tor Erik Rusten
wiley +1 more source
Parallel Sentence Alignment from Biomedical Comparable Corpora
Parallel sentences provide semantically similar information which can vary on a given dimension, such as language or register. Parallel sentences with register variation (like expert and non-expert documents) can be exploited for the automatic text simplification.
Cardon, Rémi, Grabar, Natalia
openaire +3 more sources
ABSTRACT Objective Super‐Refractory Status Epilepticus (SRSE) is a rare, life‐threatening neurological emergency with unclear etiology in many cases. Mitochondrial dysfunction, often due to disease‐causing genetic variants, is increasingly recognized as a cause, with each gene producing distinct pathophysiological mechanisms.
Pouria Mohammadi +2 more
wiley +1 more source
Diagnostic Utility of the ATG9A Ratio in AP‐4–Associated Hereditary Spastic Paraplegia
ABSTRACT Adaptor protein complex 4–associated hereditary spastic paraplegia (AP‐4‐HSP), a childhood‐onset neurogenetic disorder and frequent mimic of cerebral palsy, is caused by biallelic variants in the adaptor protein complex 4 (AP‐4) subunit genes (AP4B1 [for SPG47], AP4M1 [for SPG50], AP4E1 [for SPG51], and AP4S1 [for SPG52]).
Habibah A. P. Agianda +12 more
wiley +1 more source
D4.3 Survey specific parallel corpora
This document describes the [MCSQ]: Multilingual Corpus of Survey Questionnaires (MCSQ), a database of survey questionnaires’ texts. The report summarizes technical information about Version 1.0 (Ada Lovelace) of the MCSQ, dated in June 2020. It links to the repository to access the code and files generating the database.
Zavala-Rojas, Diana, Danielly Sorato
openaire +2 more sources

