Results 11 to 20 of about 975,756 (347)
The Varieties of English for Specific Purposes dAtabase (VESPA first release) is the result of an international corpus compilation project that aims to address the lack of large-scale, open access, multi-L1, multi-discipline and multi-register learner ...
M. Paquot +8 more
semanticscholar +2 more sources
Inter-rater reliability in Learner Corpus Research
In Learner Corpus Research (LCR), a common source of errors stems from manual coding and annotation of linguistic features. To estimate the amount of error present in a coded dataset, coefficients of inter-rater reliability are used. However, despite
Tove Larsson, M. Paquot, Luke Plonsky
semanticscholar +2 more sources
Lexical simplification in learner translation: A corpus-based approach
The advance of corpus-based methodology in translation studies has greatly enhanced our understanding of the nature of translational language. While most research efforts have focused on identifying the unique features of translations carried out by ...
Ho Ling Kwok, Sara Laviosa, Kanglong Liu
semanticscholar +2 more sources
Building a learner corpus [PDF]
The need for data about the acquisition of Czech by non-native learners prompted the compilation of the first learner corpus of Czech. After introducing its basic design and parameters, including a multi-tier manual annotation scheme and error taxonomy, we focus on the more technical aspects: the transcription of hand-written source texts, process of ...
Jirka Hana +3 more
openaire +1 more source
The CELI corpus: Design and linguistic annotation of a new online learner corpus
This article introduces the CELI corpus, a new learner corpus of written Italian consisting of ca. 600,000 tokens, evenly distributed among CEFR (Common European Framework of Reference for Languages) proficiency levels B1, B2, C1 and C2.
S. Spina +3 more
semanticscholar +1 more source
This article investigates the current state of Chinese Learner Corpus Research (CLCR) and outlines its main features and critical issues. Thirty years have passed since the compilation of the first Chinese learner corpus.
Alessia Iurato
doaj +1 more source
Creating a learner corpus infrastructure: Experiences from making learner corpora available [PDF]
With language resources being collected in many - also small - projects in learner corpus research with considerate amounts of time and ef- fort spent in this activity, making these types of data available in a FAIR way, with standardized and reasoned ...
Frey Jennifer-Carmen +2 more
doaj +1 more source
The Jinan Chinese Learner Corpus [PDF]
We present the Jinan Chinese Learner Corpus, a large collection of L2 Chinese texts produced by learners that can be used for educational tasks. The present work introduces the data and provides a detailed description. Currently, the corpus contains approximately 6 million Chinese characters written by students from over 50 different L1 backgrounds ...
Maolin Wang +2 more
openaire +1 more source
Offering insights into varieties of English language, corpus compilation practices and corpus-based research have great potential to enhance the field of English for Academic Purposes (EAP).
Ayşe Şahin Kızıl
doaj +1 more source
Learner Corpus Research Meets Chinese as a Second Language Acquisition: Achievements and Challenges
The article sheds light on Chinese as a Second Language Learner Corpus Research, emphasising advances and lacks in this field. First, the paper describes the potential of learner corpora in the investigation of learner language. Second, it provides an
Iurato, Alessia
doaj +1 more source

