Results 11 to 20 of about 119,985 (286)
Bibliographic Analysis on Research Publications using Authors, Categorical Labels and the Citation Network [PDF]
Bibliographic analysis considers the author's research areas, the citation network and the paper content among other things. In this paper, we combine these three in a topic model that produces a bibliographic model of authors, topics and documents ...
Buntine, Wray, Lim, Kar Wai
core +1 more source
In the past 25 years, the increasing user-friendliness of concordance software and availability of digital corpora have favoured the development of corpus linguistics.
Joseph Rézeau
doaj +1 more source
An annotated corpus with nanomedicine and pharmacokinetic parameters [PDF]
A vast amount of data on nanomedicines is being generated and published, and natural language processing (NLP) approaches can automate the extraction of unstructured text-based data. Annotated corpora are a key resource for NLP and information extraction
Jimenez, Ivan +2 more
core +2 more sources
Annotating Argument Schemes [PDF]
Argument schemes are abstractions substantiating the inferential connection between premise(s) and conclusion in argumentative communication. Identifying such conventional patterns of reasoning is essential to the interpretation and evaluation of ...
Lawrence, John +4 more
core +5 more sources
CorpVis: An Online Emotional Speech Corpora Visualisation Interface [PDF]
Our research in emotional speech analysis has led to the construction of several dedicated high quality, online corpora of natural emotional speech assets. The requirements for querying, retrieval and organization of assets based on both their metadata descriptors and their analysis data led to the construction of a suitable interface for data ...
Charlie Cullen +3 more
openaire +3 more sources
Frequency of Multiple Modals with Might on the Web
The paper investigates multiple modals such as might could, might would, might can and would might. These are just a few of the core modal combinations with might which are being looked at through the use of iWeb and GloWbE, two corpora of online texts ...
Cehan Nadina
doaj +1 more source
The collection of representative corpus samples of both child language and online (CMC) language varieties is crucial for linguistic research that is motivated by applications to the protection of children online. In this paper, we present an extensive survey of corpora available for these two areas.
Baron, Alistair +4 more
openaire +2 more sources
The Discrete Infinite Logistic Normal Distribution [PDF]
We present the discrete infinite logistic normal distribution (DILN), a Bayesian nonparametric prior for mixed membership models. DILN is a generalization of the hierarchical Dirichlet process (HDP) that models correlation structure between the weights ...
Blei, David, Paisley, John, Wang, Chong
core +3 more sources
Bridging the Gap from the Other Side: How Corpora Are Used by English Teachers in Norwegian Schools
Researchers have written of ‘bridging the gap’ between corpus linguistics and teaching practice. This study focuses on in-service English teacher informants from Norwegian schools, to try to address the ‘gap’ from the teaching practice ‘side’, rather ...
Barry Kavanagh
doaj +3 more sources
Creating a bilingual dictionary of collocations: A learner-oriented approach
Considering the lack of specialised dictionaries in certain fields, a creative way of teaching through corpora-based work was proposed in a seminar for master’s students of translation studies (University of Ljubljana, Slovenia).
Sonia Vaupot
doaj +1 more source

