Results 1 to 10 of about 213,157 (286)
Learning languages from parallel corpora
This work describes a blueprint for an application that generates language learning exercises from parallel corpora. Word alignment and parallel structures allow for the automatic assessment of sentence pairs in the source and target languages, while ...
Johannes Graën
doaj +4 more sources
The acquisition of a second language requires the construction or reconstruction of linguistic knowledge about the new language system. Learners of a second language have to acquire the linguistic structures of the second language by constructing or ...
Daniel Grégoire Grevisse +2 more
doaj +1 more source
Creation of Spanish Language Corpora as One of the Priorities of RAE in the Era of Digitalization
The creation of Spanish language corpora is becoming one of the priorities of the Royal Academy of the Spanish Language (RAE) in the 21st century. The work on the compilation of Spanish language corpora is a response to the challenges of the modern era ...
I. I. Gorelaya, Y. R. Ziganshina
doaj +1 more source
Freely Available Arabic Corpora: A Scoping Review
Background: Corpora play a vital role when training machine learning (ML) models and building systems that use natural language processing (NLP). It can be challenging for researchers to access corpora in a language other than English, and even more so ...
Arfan Ahmed +5 more
doaj +1 more source
This article describes the development of the digital infrastructure at a research data centre for audio-visual linguistic research data, the Hamburg Centre for Language Corpora (HZSK) at the University of Hamburg in Germany, over the past ten years. The
Hanna Hedeland
doaj +1 more source
Language Identification as part of the Text Corpus Creation Pipeline at the Language Bank of Finland
The Language Bank of Finland hosts text corpora originating from Finland. Two of the most used ones are the Newspaper and Periodical Corpus of the National Library of Finland and the Suomi24 Corpus. The Language Bank has received considerable additions
Tommi Jauhiainen +3 more
doaj +1 more source
Review of corpus tools for vocabulary teaching and learning
This review aims to introduce corpora as useful tools for facilitating vocabulary teaching and learning. Corpora have long been applied to improve learner language learning, but their direct implication in classroom teaching is rare.
Ma Qing, Mei Fang
doaj +1 more source
In no uncertain terms : a dataset for monolingual and multilingual automatic term extraction from comparable corpora [PDF]
Automatic term extraction is a productive field of research within natural language processing, but it still faces significant obstacles regarding datasets and evaluation, which require manual term annotation.
Hoste, Veronique +2 more
core +2 more sources
The health and life science domains are well known for their wealth of named entities found in large free text corpora, such as scientific literature and electronic health records.
Nona Naderi +12 more
doaj +1 more source
The company that words keep: comparing the statistical structure of child- versus adult-directed language [PDF]
Does child-directed language differ from adult-directed language in ways that might facilitate word learning? Associative structure (the probability that a word appears with its free associates), contextual diversity, word repetitions and frequency were ...
Block +14 more
core +1 more source

