Results 51 to 60 of about 123,842 (329)
A Survey of Available Corpora for Building Data-Driven Dialogue Systems [PDF]
During the past decade, several areas of speech and language understanding have witnessed substantial breakthroughs from the use of data-driven models. In the area of dialogue systems, the trend is less obvious, and most practical systems are still built
Iulian Serban+4 more
semanticscholar +1 more source
Semantic Analysis of Kiswahili Words Using the Self Organizing Map
Acquisition of semantic knowledge to support natural language processing tasks is a non-trivial task, and more so if manually undertaken. This paper presents an automatic lexical acquisition method that learns semantic properties of Kiswahili words ...
Wanjiku Ng'ang'a
doaj +1 more source
Learning Crosslingual Word Embeddings without Bilingual Corpora [PDF]
Crosslingual word embeddings represent lexical items from different languages in the same vector space, enabling transfer of NLP tools. However, previous attempts had expensive resource requirements, difficulty incorporating monolingual data or were ...
Long Duong+4 more
semanticscholar +1 more source
Estado de embriaguez en la fraseología española-croata: algunos apuntes contrastivos [PDF]
Dentro del campo de la investigación fraseológica contrastiva (en este caso, español-croata) destaca la alta productividad y frecuencia de uso en la lengua hablada de las unidades fraseológicas (UF) referidas al estado de embriaguez.
Mušura, Josipa+1 more
doaj
The IMPACT project Polish Ground-Truth texts as a Djvu corpus
The IMPACT project Polish Ground-Truth texts as a Djvu corpus The purpose of the paper is twofold. First, to describe the already implemented idea of DjVu corpora, i.e.
Janusz S. Bień
doaj +1 more source
The Onsager Equation for Corpora [PDF]
We consider extensions of excluded volume interactions for complex corpora that generalize simple rod-like particles. The Onsager equation can be defined for quite general configuration spaces, and the dimension reduction of the phase space in the limit of highly intense interaction can be shown.
openaire +3 more sources
Landscape of BRAF transcript variants in human cancer
We investigate the annotation of BRAF variants, focusing on protein‐coding BRAF‐220 (formerly BRAF‐reference) and BRAF‐204 (BRAF‐X1). The IsoWorm pipeline allows us to quantify these variants in human cancer, starting from RNA‐sequencing data. BRAF‐204 is more abundant than BRAF‐220 and impacts patient survival.
Maurizio S. Podda+5 more
wiley +1 more source
Keeping up with the digital age: New data sources in research on languages for specific purposes [PDF]
Social media exchanges (for example, via Facebook or Twitter), blogs, and forums, amongst many other electronic genres, have come to be used as relatively bona fide testimonies of language use nowadays.
Amanda Roig-Marín
doaj
Language and the pandemic: The construction of semantic frames in Greek-German comparison [PDF]
This paper aims to provide an insight into the way native speakers of different first languages (L1) who live in the same country and are therefore influenced to the same degree by the current Covid-19 pandemic (e.g.
Nikolaos Katsaounis
doaj +1 more source
This review highlights how foundation models enhance predictive healthcare by integrating advanced digital twin modeling with multiomics and biomedical data. This approach supports disease management, risk assessment, and personalized medicine, with the goal of optimizing health outcomes through adaptive, interpretable digital simulations, accessible ...
Sakhaa Alsaedi+2 more
wiley +1 more source