SentiGOLD: A Large Bangla Gold Standard Multi-Domain Sentiment Analysis Dataset and its Evaluation [PDF]
This study introduces SentiGOLD, a Bangla multi-domain sentiment analysis dataset. Comprising 70,000 samples, it was created from diverse sources and annotated by a gender-balanced team of linguists. SentiGOLD adheres to established linguistic conventions agreed upon by the Government of Bangladesh and a Bangla linguistics committee. Unlike English and
arxiv +1 more source
ENTROPY OF LANGUAGE SYSTEM AS MAIN DEVELOPMENT INDICATOR
Article is devoted to the research of language system entropy. It is one of the main concept of synergetics and synergy. Entropy can be applied to description of language processes and detection of functioning and development features of language ...
Ирина Михайловна Некипелова
doaj +1 more source
Distilling Knowledge from Text-to-Image Generative Models Improves Visio-Linguistic Reasoning in CLIP [PDF]
Image-text contrastive models like CLIP have wide applications in zero-shot classification, image-text retrieval, and transfer learning. However, they often struggle on compositional visio-linguistic tasks (e.g., attribute-binding or object-relationships) where their performance is no better than random chance. To address this, we introduce SDS-CLIP, a
arxiv
ChatGPT is a Potential Zero-Shot Dependency Parser [PDF]
Pre-trained language models have been widely used in dependency parsing task and have achieved significant improvements in parser performance. However, it remains an understudied question whether pre-trained language models can spontaneously exhibit the ability of dependency parsing without introducing additional parser structure in the zero-shot ...
arxiv
A novel coronavirus was found on December 2019. WHO named the disease caused by this virus COVID-19. The respiratory virus has been spreading rapidly, causing a global pandemic. To prevent infection, governments all over the world compel their citizens to maintain physical distance and stay at home.
Adhitya, Galant Nanta+2 more
openaire +2 more sources
PHRASEMES IN ANDRIĆ’S SHORT STORIES AND THEIR TRANSLATION EQUIVALENTS IN GERMAN LANGUAGE [PDF]
Phraseology as a linguistic discipline does not have long tradition, but that does not alleviate its importance either within one language or in contrasting two languages.
Magdalena Ramljak, Marija Musa
doaj
A Massively Multilingual Analysis of Cross-linguality in Shared Embedding Space [PDF]
In cross-lingual language models, representations for many different languages live in the same space. Here, we investigate the linguistic and non-linguistic factors affecting sentence-level alignment in cross-lingual pretrained language models for 101 languages and 5,050 language pairs.
arxiv
MORPHOLOGICAL AND MORPHOPHONEMIC PROCESS (NATURE, TYPES, AND RULES)
Morphology or morphemic is defined as the study of the morpheme and their arrangements in building new larger morphological constructions. Morph is a physical form representing some morpheme in a language. Morpheme is the minimal unit of linguistics in a
Dwi Astuti Wahyu Nurhayati
doaj +1 more source
Pedagogical Efficacy of Planned v.s unplanned focus on form Instruction in Scaffolding transitional Devices used by Iranian EFL Learners in writing Paragraphs [PDF]
Indubitably, the efficacy of metalinguistic awareness in improving EFL learners’ grammatical knowledge and writing quality has long been an area of great interest in applied linguistics Brown (2001).
Azizzolah Dabaghi+2 more
doaj
A Critical Analysis of the British Newspapers’ Coverage of the Underweight and Overweight [PDF]
By triangulating Corpus Linguistics with Critical Discourse Analysis, this paper sought to discover how the British popular press and quality press represented two polar ends of body image with respect to weight which are: underweight and overweight. The
Zeynep Cihan Koca-Helvacı
doaj +1 more source