Results 1 to 10 of about 971,215 (201)
A learner corpus is born this way: From raw data to processed dataset [PDF]
This data article presents the development of a learner corpus (i.e. a systematic computerized web-based repository of written texts produced by language learners) from the initial phase of the development where written assignments were collected from ...
Chung Hong Danny Leung +2 more
doaj +2 more sources
VALICO-UD: Treebanking an Italian Learner Corpus in Universal Dependencies
This article describes an ongoing project for the development of a novel Italian treebank in Universal Dependencies format: VALICO-UD. It consists of texts written by Italian L2 learners of different mother tongues (German, French, Spanish and English ...
Elisa Di Nuovo +4 more
doaj +2 more sources
The SweLL Language Learner Corpus
The article presents a new language learner corpus for Swedish, SweLL, and the methodology from collection and pesudonymisation to protect personal information of learners to annotation adapted to second language learning.
Elena Volodina +10 more
doaj +2 more sources
The current study responds to the call for increased dialogue among different areas of additionallanguage research. Specifically, we bring together learner corpus research (LCR) and variationist approaches to second language acquisition in order to ...
Aarnes Gudmestad +2 more
exaly +2 more sources
This paper presents a model for combining contrastive analysis and interlanguage analysis. It can be seen as an extension of Granger’s (1996) Integrated Contrastive Model, but it explicitly requires a bidirectional parallel corpus for the contrastive ...
Hilde Hasselgård +1 more
doaj +2 more sources
The MuLeCo project: A learner corpus of L1 German learners of Romance languages
The importance of learner corpora for foreign language acquisition research as well as their role in data-driven learning and other learning contexts is now widely recognised.
Stephan Lücke +3 more
doaj +2 more sources
Saudi Learner Translation Corpus: The design and compilation of an English-Arabic learner translation corpus. [PDF]
This article introduces the Saudi Learner Translation Corpus (SauLTC), an innovative multi-version English–Arabic parallel corpus featuring part-of-speech tagging. We describe the corpus parameters and compilation process and explain how textual processing and sentence alignment are conducted.
Al-Harthi M +4 more
europepmc +3 more sources
Error annotation in a Learner Corpus of Portuguese.
We present the error tagging system of the COPLE2 corpus and the first results of its implementation.. The system takes advantage of the corpus architecture and the possibilities of the TEITOK environment to reduce manual effort and produce a final standoff, multilevel annotation with position-based tags that account for the main error types observed ...
Mendes, Amália, del Río, Iria
openaire +4 more sources
Learner Corpus in German as a Data Source for Education and Science
One example of the digitalization of education is the creation of a linguistic learner corpus of student papers in a foreign language at an educational institution in order to use this corpus for research, teaching and learner analytics.
Irina Kotiurova, Liudmila Shchegoleva
doaj +1 more source
This article describes the design and construction of the Tracking Written Learner Language (TRAWL) Corpus. The corpus combines several features that are all rare for learner corpora: it is longitudinal, following individual pupils over several years; it
Hildegunn Dirdal +4 more
semanticscholar +1 more source

