Exploring the Use of Foundation Models for Named Entity Recognition and Lemmatization Tasks in Slavic Languages [PDF]
This paper describes Adam Mickiewicz University's (AMU) solution for the 4th Shared Task on SlavNER. The task involves the identification, categorization, and lemmatization of named entities in Slavic languages. Our approach involved exploring the use of foundation models for these tasks.
Gabriela Pałka, Artur Nowakowski
arxiv +3 more sources
CLASSLA-Stanza: The Next Step for Linguistic Processing of South Slavic Languages [PDF]
We present CLASSLA-Stanza, a pipeline for automatic linguistic annotation of the South Slavic languages, which is based on the Stanza natural language processing pipeline. We describe the main improvements in CLASSLA-Stanza with respect to Stanza, and give a detailed description of the model training process for the latest 2.1 release of the pipeline ...
Luka Tercon, Nikola Ljubesic
arxiv +3 more sources
Resources and Few-shot Learners for In-context Learning in Slavic Languages [PDF]
Despite the rapid recent progress in creating accurate and compact in-context learners, most recent work focuses on in-context learning (ICL) for tasks in English. However, the ability to interact with users of languages outside English presents a great potential for broadening the applicability of language technologies to non-English speakers.
Michal vStef'anik+3 more
arxiv +3 more sources
Hierarchical Ambiguities in Copula Coordinate Structures in Slovene and Other Slavic Languages [PDF]
The nature of coordination, including derivation and constituency in coordinate structures, was debated by earlier writers, such as Aristotle and Dionysius Thrax, up to the present day by linguists such as Bloomfield and Chomsky.
Donald F. Reindl
openalex +2 more sources
Spoken Corpora of Slavic Languages
Spoken corpora are collections of transcribed and annotated audio and /or video recordings of languages or language varieties. The aim of this paper is to present an overview of 51 spoken corpora currently available for Slavic languages and dialects, in ...
N. Dobrushina, Elena Sokur
semanticscholar +2 more sources
Slavic languages – “SVO” languages without SVO qualities?
Slavic languages are commonly classified as SVO languages, with an exceptional property, though, namely an atypically extensive variability of word order.
H. Haider, Luka Szucsich
semanticscholar +3 more sources
Cross-Domain Adaptation of Spoken Language Identification for Related Languages: The Curious Case of Slavic Languages [PDF]
State-of-the-art spoken language identification (LID) systems, which are based on end-to-end deep neural networks, have shown remarkable success not only in discriminating between distant languages but also between closely-related languages or even different spoken varieties of the same language.
Badr M. Abdullah+3 more
arxiv +3 more sources
Rediscovering the Slavic Continuum in Representations Emerging from Neural Models of Spoken Language Identification [PDF]
Deep neural networks have been employed for various spoken language recognition tasks, including tasks that are multilingual by definition such as spoken language identification. In this paper, we present a neural model for Slavic language identification in speech signals and analyze its emergent representations to investigate whether they reflect ...
Badr M. Abdullah+4 more
arxiv +3 more sources
Exploiting Cross-Dialectal Gold Syntax for Low-Resource Historical Languages: Towards a Generic Parser for Pre-Modern Slavic [PDF]
This paper explores the possibility of improving the performance of specialized parsers for pre-modern Slavic by training them on data from different related varieties. Because of their linguistic heterogeneity, pre-modern Slavic varieties are treated as low-resource historical languages, whereby cross-dialectal treebank data may be exploited to ...
Nilo Pedrazzini
arxiv +3 more sources
Analysis of Features and Classifiers in Emotion Recognition Systems: Case Study of Slavic Languages [PDF]
Today’s human-computer interaction systems have a broad variety of applications in which automatic human emotion recognition is of great interest. Literature contains many different, more or less successful forms of these systems. This work emerged as an
Željko Nedeljković+2 more
openalex +2 more sources