Learner corpus - Open Access .click

Results 11 to 20 of about 975,756 (347)

The Varieties of English for Specific Purposes dAtabase (VESPA): Towards a multi-L1 and multi-register learner corpus of disciplinary writing

Research in Corpus Linguistics, 2022
The Varieties of English for Specific Purposes dAtabase (VESPA first release) is the result of an international corpus compilation project that aims to address the lack of large-scale, open access, multi-L1, multi-discipline and multi-register learner ...
M. Paquot +8 more
semanticscholar +2 more sources

Inter-rater reliability in Learner Corpus Research

, 2020
In Learner Corpus Research (LCR), a common source of errors stems from manual coding and annotation of linguistic features. To estimate the amount of error present in a coded dataset, coefficients of inter-rater reliability are used. However, despite
Tove Larsson, M. Paquot, Luke Plonsky
semanticscholar +2 more sources

Lexical simplification in learner translation: A corpus-based approach

Corpus-based Translation Studies (CBTS), 2023
The advance of corpus-based methodology in translation studies has greatly enhanced our understanding of the nature of translational language. While most research efforts have focused on identifying the unique features of translations carried out by ...
Ho Ling Kwok, Sara Laviosa, Kanglong Liu
semanticscholar +2 more sources

Building a learner corpus [PDF]

Language Resources and Evaluation, 2014
The need for data about the acquisition of Czech by non-native learners prompted the compilation of the first learner corpus of Czech. After introducing its basic design and parameters, including a multi-tier manual annotation scheme and error taxonomy, we focus on the more technical aspects: the transcription of hand-written source texts, process of ...
Jirka Hana +3 more
openaire +1 more source

The CELI corpus: Design and linguistic annotation of a new online learner corpus

Second Language Research, 2023
This article introduces the CELI corpus, a new learner corpus of written Italian consisting of ca. 600,000 tokens, evenly distributed among CEFR (Common European Framework of Reference for Languages) proficiency levels B1, B2, C1 and C2.
S. Spina +3 more
semanticscholar +1 more source

Investigating Chinese learner corpus research and learner corpora: Main features, critical issues and future pathways

Kervan. International Journal of Afro-Asiatic Studies, 2022
This article investigates the current state of Chinese Learner Corpus Research (CLCR) and outlines its main features and critical issues. Thirty years have passed since the compilation of the first Chinese learner corpus.
Alessia Iurato
doaj +1 more source

Creating a learner corpus infrastructure: Experiences from making learner corpora available [PDF]

ITM Web of Conferences, 2020
With language resources being collected in many - also small - projects in learner corpus research with considerate amounts of time and ef- fort spent in this activity, making these types of data available in a FAIR way, with standardized and reasoned ...
Frey Jennifer-Carmen, König Alexander, Fišer Darja +2 more
doaj +1 more source

The Jinan Chinese Learner Corpus [PDF]

Proceedings of the Tenth Workshop on Innovative Use of NLP for Building Educational Applications, 2015
We present the Jinan Chinese Learner Corpus, a large collection of L2 Chinese texts produced by learners that can be used for educational tasks. The present work introduces the data and provides a detailed description. Currently, the corpus contains approximately 6 million Chinese characters written by students from over 50 different L1 backgrounds ...
Maolin Wang, Shervin Malmasi, Mingxuan Huang +2 more
openaire +1 more source

Corpus of Academic Learner English (CALE): A new corpus at the intersection of corpus linguistics and English for academic purposes

Literacy Trek, 2020
Offering insights into varieties of English language, corpus compilation practices and corpus-based research have great potential to enhance the field of English for Academic Purposes (EAP).
Ayşe Şahin Kızıl
doaj +1 more source

Learner Corpus Research Meets Chinese as a Second Language Acquisition: Achievements and Challenges

Annali di Ca’ Foscari: Serie Orientale, 2022
The article sheds light on Chinese as a Second Language Learner Corpus Research, emphasising advances and lacks in this field. First, the paper describes the potential of learner corpora in the investigation of learner language. Second, it provides an
Iurato, Alessia
doaj +1 more source

linguistics
education
computer science

psychology
4. education
learner corpora

corpus linguistics
learner corpus research
10. no inequality