REALIZAREA CATEGORIEI COERENŢEI ÎN SCRIEREA ACADEMICĂ A STUDENŢILOR
În baza unui corpus constituit din texte ştiinţifice scrise de studenţi, am analizat mai multe tipuri de greşeli de coerenţă. În abordarea noastră am utilizat idei şi sugestii oferite de lingvistica textului, bazându-ne în special pe şcoala lingvistică ...
Elena VARZARI
doaj
Tendências de tradução de mexicanismos em roteiros e episódios das séries televisivas Chaves e Chapolin: análise com base na linguística de corpus e na tradução audiovisual [PDF]
Tese (doutorado) - Universidade Federal de Santa Catarina, Centro de Comunicação e Expressão, Programa de Pós-Graduação em Estudos da Tradução, Florianópolis, 2013.Esta pesquisa, por um lado, teve o objetivo de identificar tendências de tradução de ...
Santos, Orlanda Miranda
core
Benchmarking Large Language Models for Polymer Property Predictions
Large language models (LLMs) are fine‐tuned on polymer thermal property datasets to directly predict glass transition, melting, and decomposition temperatures from SMILES inputs. Compared to state‐of‐the‐art models such as Polymer Genome, polyGNN, and polyBERT, LLMs achieve competitive yet lower accuracy.
Sonakshi Gupta +3 more
wiley +1 more source
Using collocation analysis to reveal the construction of minority groups: The case of refugees, asylum seekers and immigrants in the UK press. [PDF]
Refugees, asylum seekers, and immigrants (henceforth RASIM) coming into the UK have attracted increased press attention (Greenslade, 2005). As their representation in the press can construct their identity (Duffy and Rowden, 2005: 6, in Greenslade, 2005:
McEnery, Tony +3 more
core
Vagueness and referential ambiguity in a large-scale annotated corpus
In this paper, we argue that difficulties in the definition of coreference itself contribute to lower inter-annotator agreement in certain cases. Data from a large referentially annotated corpus serves to corroborate this point, using a quantitative ...
Versley, Yannick
core +1 more source
Semantic Embeddings of Chemical Elements for Enhanced Materials Inference and Discovery
ElementBERT extracts semantic embeddings of chemical elements from 1.29 million alloy‐related abstracts, providing robust descriptors that improve prediction accuracy by up to 23% across titanium, high‐entropy, and shape memory alloys, with demonstrated generalization on alloy compositions reported in 2025.
Yunze Jia +7 more
wiley +1 more source
DeepSeek‐Lattice‐KG integrates a domain‐adapted 14B LLM with a Neo4j lattice knowledge graph distilled from 50,000 papers. It analyzes queries, retrieves supporting subgraphs, and generates grounded answers; on a 2100‐question, six‐domain benchmark, it achieves 94.8% accuracy.
Zhiyang Shu +6 more
wiley +1 more source
Search for near-duplicate texts in the linguistic corpus VepKar
Fedor Bykov, Andrew Krizhanovsky
openaire +1 more source
Linguistic Strategies in Propagandistic Texts: A Corpus-Based Discourse Analysis
This article examines the linguistic characteristics of propagandistic texts based on the Propaganda and Disinformation corpus annotated according to themes (narratives) and propaganda techniques. The aim of the study is to reveal how linguistic means are employed in constructing manipulative communication and ideological influence.
Vilma Zubaitienė +2 more
openaire +1 more source
Lexical bundles in scientific English: A corpus-based study of native and non-native writing [PDF]
[eng] The present dissertation is a corpus-based investigation of the frequency, structure and functions of lexical bundles in published scientific writing in English, whose main objective is the creation of an inventory of the most frequent and ...
Lorenzo Salazar, Danica Joy
core

