Results 31 to 40 of about 397,215 (279)

Finding predominant word senses in untagged text [PDF]

open access: yes, 2004
In word sense disambiguation (WSD), the heuristic of choosing the most common sense is extremely powerful because the distribution of the senses of a word is often skewed.
Carroll, John   +3 more
core   +3 more sources

Beyond Bilingual: Multi-sense Word Embeddings using Multilingual Context

open access: yes, 2017
Word embeddings, which represent a word as a point in a vector space, have become ubiquitous to several NLP tasks. A recent line of work uses bilingual (two languages) corpora to learn a different vector for each sense of a word, by exploiting ...
Chang, Kai-Wei   +4 more
core   +1 more source

Creating a bilingual dictionary of collocations: A learner-oriented approach

open access: yesIndonesian Journal of Applied Linguistics, 2021
Considering the lack of specialised dictionaries in certain fields, a creative way of teaching through corpora-based work was proposed in a seminar for master’s students of translation studies (University of Ljubljana, Slovenia).
Sonia Vaupot
doaj   +1 more source

Germline TP53 Mutations Causing Diamond–Blackfan Anemia: A French Report

open access: yesPediatric Blood &Cancer, EarlyView.
ABSTRACT Diamond–Blackfan anemia is a rare congenital erythroblastopenia typically caused by mutations in ribosomal protein genes. Recently, gain‐of‐function mutations in TP53 have been identified as a novel cause of Diamond–Blackfan anemia. We report two French patients who both harbored a heterozygous TP53 deletion (NM_000546.5: c.1077delA; p ...
Rafael Moisan   +6 more
wiley   +1 more source

Keeping up with the digital age: New data sources in research on languages for specific purposes [PDF]

open access: yesIbérica, 2017
Social media exchanges (for example, via Facebook or Twitter), blogs, and forums, amongst many other electronic genres, have come to be used as relatively bona fide testimonies of language use nowadays.
Amanda Roig-Marín
doaj  

Learner Corpora

open access: yes, 2020
This chapter deals with learner corpora, that is, collections of (spoken and/or written) texts produced by learners of a language. It describes their main characteristics, with particular emphasis on those that are distinctive of learner corpora. Special types of corpora are introduced, such as longitudinal learner corpora or local learner corpora. The
openaire   +2 more sources

Performance of the Charniak-Lease parser on biological text using different training corpora [PDF]

open access: yes, 2008
POS tagging is used as the first step in many NLP workflows, although the accuracy of tag assignment frequently goes unchecked. We hypothesize that changing the training corpora for a parser will affect its POS tagging of a target corpus.
Alison V. Callahan, Michel Dumontier
core   +1 more source

Linking neurogenesis, oligodendrogenesis, and myelination defects to neurodevelopmental disruption in primary mitochondrial disorders

open access: yesFEBS Letters, EarlyView.
Mitochondrial remodeling shapes neural and glial lineage progression by matching metabolic supply with demand. Elevated OXPHOS supports differentiation and myelin formation, while myelin compaction lowers mitochondrial dependence, revealing mitochondria as key drivers of developmental energy adaptation.
Sahitya Ranjan Biswas   +3 more
wiley   +1 more source

Eesti keele kui teise keele õpikute lausete analüüs ja selle rakendamine eri keeleoskustasemete sõnastike näitelausete automaatsel valikul

open access: yesEesti Rakenduslingvistika Ühingu Aastaraamat, 2019
Artikli eesmärk on välja töötada korpuspäringusüsteemi Sketch Engine heade näitelausete tööriista GDEX (Good Dictionary Example) eesti mooduli versioonid, mis aitavad korpusest tuvastada eri keeleoskustasemetele vastavaid eri leksikaalse, süntaktilise ja
Kristina Koppel
doaj   +1 more source

On creating a large-scale corpus-based academic multi-word unit resource

open access: yesVocabulary Learning and Instruction, 2020
This study outlines the steps taken to create an academic multi-word unit list derived from corpus data. It gives details on the procedure used and the rationale behind why certain approaches were utilised.
James Rogers
doaj   +1 more source

Home - About - Disclaimer - Privacy