Text and corpus linguistics - Open Access .click

Results 121 to 130 of about 13,324 (303)

REALIZAREA CATEGORIEI COERENŢEI ÎN SCRIEREA ACADEMICĂ A STUDENŢILOR

Studia Universitatis Moldaviae: Stiinte Umanistice, 2015
În baza unui corpus constituit din texte ştiinţifice scrise de studenţi, am analizat mai multe tipuri de greşeli de coerenţă. În abordarea noastră am utilizat idei şi sugestii oferite de lingvistica textului, bazându-ne în special pe şcoala lingvistică ...
Elena VARZARI
doaj

Tendências de tradução de mexicanismos em roteiros e episódios das séries televisivas Chaves e Chapolin: análise com base na linguística de corpus e na tradução audiovisual [PDF]

, 2013
Tese (doutorado) - Universidade Federal de Santa Catarina, Centro de Comunicação e Expressão, Programa de Pós-Graduação em Estudos da Tradução, Florianópolis, 2013.Esta pesquisa, por um lado, teve o objetivo de identificar tendências de tradução de ...
Santos, Orlanda Miranda
core

Benchmarking Large Language Models for Polymer Property Predictions

Macromolecular Rapid Communications, EarlyView.
Large language models (LLMs) are fine‐tuned on polymer thermal property datasets to directly predict glass transition, melting, and decomposition temperatures from SMILES inputs. Compared to state‐of‐the‐art models such as Polymer Genome, polyGNN, and polyBERT, LLMs achieve competitive yet lower accuracy.
Sonakshi Gupta +3 more
wiley +1 more source

Using collocation analysis to reveal the construction of minority groups: The case of refugees, asylum seekers and immigrants in the UK press. [PDF]

, 2007
Refugees, asylum seekers, and immigrants (henceforth RASIM) coming into the UK have attracted increased press attention (Greenslade, 2005). As their representation in the press can construct their identity (Duffy and Rowden, 2005: 6, in Greenslade, 2005:
McEnery, Tony +3 more
core

Vagueness and referential ambiguity in a large-scale annotated corpus

, 2009
In this paper, we argue that difficulties in the definition of coreference itself contribute to lower inter-annotator agreement in certain cases. Data from a large referentially annotated corpus serves to corroborate this point, using a quantitative ...
Versley, Yannick
core +1 more source

Semantic Embeddings of Chemical Elements for Enhanced Materials Inference and Discovery

Materials Genome Engineering Advances, EarlyView.
ElementBERT extracts semantic embeddings of chemical elements from 1.29 million alloy‐related abstracts, providing robust descriptors that improve prediction accuracy by up to 23% across titanium, high‐entropy, and shape memory alloys, with demonstrated generalization on alloy compositions reported in 2025.
Yunze Jia +7 more
wiley +1 more source

DeepSeek‐Lattice‐KG: A Compact Language Model With Knowledge Graph Augmentation for Lattice Structure Design

Materials Genome Engineering Advances, EarlyView.
DeepSeek‐Lattice‐KG integrates a domain‐adapted 14B LLM with a Neo4j lattice knowledge graph distilled from 50,000 papers. It analyzes queries, retrieves supporting subgraphs, and generates grounded answers; on a 2100‐question, six‐domain benchmark, it achieves 94.8% accuracy.
Zhiyang Shu +6 more
wiley +1 more source

Search for near-duplicate texts in the linguistic corpus VepKar

Proceedings of the Karelian Research Centre of the Russian Academy of Sciences, 2023
Fedor Bykov, Andrew Krizhanovsky
openaire +1 more source

Linguistic Strategies in Propagandistic Texts: A Corpus-Based Discourse Analysis

Lietuvių kalba
This article examines the linguistic characteristics of propagandistic texts based on the Propaganda and Disinformation corpus annotated according to themes (narratives) and propaganda techniques. The aim of the study is to reveal how linguistic means are employed in constructing manipulative communication and ideological influence.
Vilma Zubaitienė +2 more
openaire +1 more source

Lexical bundles in scientific English: A corpus-based study of native and non-native writing [PDF]

, 2011
[eng] The present dissertation is a corpus-based investigation of the frequency, structure and functions of lexical bundles in published scientific writing in English, whose main objective is the creation of an inventory of the most frequent and ...
Lorenzo Salazar, Danica Joy
core

corpus linguistics
linguistics
philology

natural language processing
corpus
computational linguistics

4. education
text
fos: computer and information sciences