Results 21 to 30 of about 397,215 (279)
Validating choices: Texts in the Trésor de la Langue Française
None
Paul Fortier, Suzy Santos
doaj +1 more source
Exploração de corpora para extração e descrição de léxico de especialidade
A exploração de corpora para a extração de léxico de especialidade é um método consensual e comum na construção de recursos lexicais. No entanto, as metodologias empregadas não são explicitamente discutidas, dificultando a comparação e a determinação de ...
Chiara Barbero, Raquel Amaro
doaj +1 more source
Cloves (Syzygium aromaticum), a tree in the Myrtaceae family, are indigenous to the Maluku Islands in Indonesia and are widely utilized as a spice. Essential oils are commonly extracted from clove leaves, flower buds, and stalks.
Jakty Kusuma +7 more
doaj +1 more source
Usage on the move: Evolution and re-volution [PDF]
One of the problems involved in using corpora to investigate language change is that many corpora are synchronic, particularly spoken ones. To observe change, a combination of methods is the most fruitful approach.
Michael McCarthy
doaj +1 more source
Semi-Supervised Learning for Neural Machine Translation
While end-to-end neural machine translation (NMT) has made remarkable progress recently, NMT systems only rely on parallel corpora for parameter estimation. Since parallel corpora are usually limited in quantity, quality, and coverage, especially for low-
Cheng, Yong +6 more
core +1 more source
Emotion analysis in socially unacceptable discourse
Texts often express the writer’s emotional state, and it was shown that emotion information has potential for hate speech detection and analysis. In this work, we present a methodology for quantitative analysis of emotion in text.
Jasmin Franza +2 more
doaj +1 more source
Four Datasets Derived from an Archive of Personal Homepages (1995–2009)
While data from social media are easily accessible, understanding how individuals expressed themselves on the Internet in its initial years of public availability (the mid-late 1990s) has proved difficult.
Sean C. Rife
doaj +1 more source
This chapter gives an overview of parallel corpora, i.e. corpora containing source texts in a given language, aligned with their translations in another language. More specifically, it focuses on directional corpora, i.e. parallel corpora where the source and target languages are clearly identified. These types of corpora are widely used in contrastive
openaire +2 more sources
MultiMWE: building a multi-lingual multi-word expression (MWE) parallel corpora [PDF]
Multi-word expressions (MWEs) are a hot topic in research in natural language processing (NLP), including topics such as MWE detection, MWE decomposition, and research investigating the exploitation of MWEs in other NLP fields such as Machine Translation.
Han, Lifeng +2 more
core +1 more source
On documenting language change as it happens
This study examines the grammaticalization of motion verbs in Italian within the periphrastic construction “motion verb + a + infinitive”. Verbs such as andare ‘to go’, venire ‘to come’ and tornare ‘to return’ develop functional uses and express ...
Emanuela Li Destri
doaj +1 more source

