Large Vocabulary Size Improves Large Language Models [PDF]
This paper empirically investigates the relationship between subword vocabulary size and the performance of large language models (LLMs) to provide insights on how to define the vocabulary size. Experimental results show that larger vocabulary sizes lead to better performance in LLMs.
arxiv
Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies [PDF]
Research on scaling large language models (LLMs) has primarily focused on model parameters and training data size, overlooking the role of vocabulary size. We investigate how vocabulary size impacts LLM scaling laws by training models ranging from 33M to 3B parameters on up to 500B characters with various vocabulary configurations.
arxiv
Bridging Social Innovation with Forest and Landscape Restoration
Abstract Mitigating climate change, preventing mass species extinctions, improving rural livelihoods, and disaster risk reduction are among today's most urgent challenges. To meet these challenges, a large number of social actors need to agree to engage and act collectively on Forest and Landscape Restoration (FLR), ensuring its dual goal of restoring ...
Aurélio Padovezi+3 more
wiley +1 more source
El enriquecimiento lexical en inglés desde la lectura libre y voluntaria de textos auténticos en las carreras técnicas universitarias / The enrichment of the English lexicon based on the free voluntary reading of authentic texts in university technical majorings [PDF]
The paper deals with the enrichment of the English lexicon by students of university technical majoring. The starting point is the establishment of intertextual relationships from the free voluntary reading of authentic texts related to this area of ...
Pedro Fabricio Molina García
doaj
Communications and discussion. Repetition versus recall in memorizing vocabularies; The effect of continuous exercise and of rest upon difficult mental multiplication. [PDF]
Edward L. Thorndike
openalex +1 more source
Listening to Hong Kong children's perspectives through pretend play
Abstract Quality in early childhood education and care (ECEC) has become an increasing concern in recent years. The issue has been regularly discussed by different stakeholders. However, the rising concern regarding quality in ECEC has not seriously taken into account children's perspectives.
Suzannie K. Y. Leung
wiley +1 more source
The Issue of Relationship in Lotman’s Writing and the Russian Language Today
The article is devoted to the memory of Y.M. Lotman, who attended lectures at the Faculty of Philology of the St. Petersburg (Leningrad) University as schoolboy, was a student of philology during the so-called “Leningrad Affair” (1946-1950).
Verbitskaya L.A.,
doaj +1 more source
Abstract This paper reports on the findings of a natural experiment based on a sample of 1123 children aged 4–8 from the provinces of Punjab in Pakistan, and Gujarat in India. It looks at the impact of attendance (or not) in early schooling on the cognitive and social–emotional development of young children.
Nadia Siddiqui+7 more
wiley +1 more source