Multilingual Language Processing From Bytes
We describe an LSTM-based model which we call Byte-to-Span (BTS) that reads text as bytes and outputs span annotations of the form [start, length, label] where start positions, lengths, and labels are separate entries in our vocabulary.
Brunk, Cliff +3 more
core +1 more source
THE IMPACT OF VOCABULARY INSTRUCTION ON VOCABULARY SIZE LEVELS OF STUDENTS
The purpose of this study is to determine the difference between two groups of students as regards to vocabulary size levels after the implementation of vocabulary enhancement activities. While the students in control group followed the regular curriculum including the second one thousand most frequent words in English, the students in experimental ...
Toprakoğlu, Mert, Dilman, Hakan
openaire +2 more sources
This study investigates the predictive value of an academic vocabulary size screening test in Dutch for early academic achievement in higher education, in a context where Dutch is the predominant L1 of instruction.
Pieterjan Bonne, Jordi Casteleyn
doaj +1 more source
Biomedical Terminologies and Ontologies: Enabling Biomedical Semantic Interoperability and Standards in Europe [PDF]
In the management of biomedical data, vocabularies such as ontologies and terminologies (O/Ts) are used for (i) domain knowledge representation and (ii) interoperability. The knowledge representation role supports the automated reasoning on, and analysis
Brochhausen, Mathias +5 more
core
Predicting the out-of-vocabulary rate and the required vocabulary size for speech processing applications [PDF]
The paper describes an approach for predicting both the vocabulary size and the resulting out-of-vocabulary rate (OOV rate) for a hypothetical extension of an existing text corpus. By splitting the original corpus into two different sub corpora, vocabulary and OOV rate can be determined for that special constellation.
Johannes Müller 0004 +2 more
openaire +1 more source
Assessing English Students` Vocabulary Size of Lampung State Islamic University
The aims of this research were to measure the English students’ vocabulary size as well to measure the students’ vocabulary size which was less and more than 1000 words.
Iwan Kurniawan
doaj +3 more sources
Effective Cross-lingual Transfer of Neural Machine Translation Models without Shared Vocabularies
Transfer learning or multilingual model is essential for low-resource neural machine translation (NMT), but the applicability is limited to cognate languages by sharing their vocabularies.
Gao, Yingbo, Kim, Yunsu, Ney, Hermann
core
LEXTALE_CH: A quick, character-based proficiency test for Mandarin Chinese [PDF]
Research in second language acquisition suggests that objective performance-based assessments may provide more reliable and valid measures of second language proficiency than subjective self-ratings. To measure proficiency in English as a second language,
Chan, I. Lei, Chang, Charles B.
core
Slim Embedding Layers for Recurrent Neural Language Models
Recurrent neural language models are the state-of-the-art models for language modeling. When the vocabulary size is large, the space taken to store the model parameters becomes the bottleneck for the use of recurrent neural language models. In this paper,
Kulhanek, Raymond +4 more
core +1 more source
Vocabulary Learning Strategies And Vocabulary Size Of The Indonesian Senior High Students [PDF]
Penelitian ini bertujuan untuk meneliti hubungan antara strategi belajar kosa kata siswa dengan penguasaan kosa kata bahasa Inggris. Subyek penelitian ini sebanyak 120 siswa pada tahun ke dua di MAN 1 Bandar Lampung.
Mahpul, M. (Mahpul) +2 more
core +1 more source

