Results 1 to 10 of about 248,393 (200)
FedMed: A Federated Learning Framework for Language Modeling [PDF]
Federated learning (FL) is a privacy-preserving technique for training a vast amount of decentralized data and making inferences on mobile devices. As a typical language modeling problem, mobile keyboard prediction aims at suggesting a probable next word
Xing Wu, Zhaowang Liang, Jianjia Wang
doaj +2 more sources
The sociolinguistic foundations of language modeling [PDF]
In this article, we introduce a sociolinguistic perspective on language modeling. We claim that language models in general are inherently modeling varieties of language, and we consider how this insight can inform the development and deployment of ...
Jack Grieve +9 more
doaj +2 more sources
Multi-task learning (MTL) approaches are actively used for various natural language processing (NLP) tasks. The Multi-Task Deep Neural Network (MT-DNN) has contributed significantly to improving the performance of natural language understanding (NLU ...
Suhyune Son +4 more
doaj +1 more source
Next-Generation Neural Networks: Capsule Networks With Routing-by-Agreement for Text Classification
These days, neural networks constantly prove their high capacity for nearly every application case and are considered as key technology for learning systems.
Nikolai A. K. Steur, Friedhelm Schwenker
doaj +1 more source
Contemporary Approaches in Evolving Language Models
This article provides a comprehensive survey of contemporary language modeling approaches within the realm of natural language processing (NLP) tasks.
Dina Oralbekova +4 more
doaj +1 more source
Plain Template Insertion: Korean-Prompt-Based Engineering for Few-Shot Learners
Prompt-based learning is a method used for language models to interpret natural language by remembering the prior knowledge acquired and the training objective.
Jaehyung Seo +7 more
doaj +1 more source
Automatic story and item generation for reading comprehension assessments with transformers
Reading comprehension is one of the essential skills for students as they make a transition from learning to read to reading to learn. Over the last decade, the increased use of digital learning materials for promoting literacy skills (e.g., oral fluency
Seyma Nur Yildirim-erbasli, Okan Bulut
doaj +1 more source
Red Teaming Language Models with Language Models
Language Models (LMs) often cannot be deployed because of their potential to harm users in hard-to-predict ways. Prior work identifies harmful behaviors before deployment by using human annotators to hand-write test cases. However, human annotation is expensive, limiting the number and diversity of test cases.
Ethan Perez +8 more
openaire +2 more sources
New RNN Activation Technique for Deeper Networks: LSTCM Cells
Long short-term memory (LSTM) has shown good performance when used with sequential data, but gradient vanishing or exploding problem can arise, especially when using deeper layers to solve complex problems. Thus, in this paper, we propose a new LSTM cell
Soo-Han Kang, Ji-Hyeong Han
doaj +1 more source
Language Models are not Models of Language.
Natural Language Processing (NLP) has become one of the leading application areas in the current Artificial Intelligence boom. Transfer learning has enabled large deep learning neural networks trained on the language modeling task to vastly improve performance in almost all downstream language tasks.
openaire +7 more sources

