Results 41 to 50 of about 123,842 (329)
Creating Training Corpora for NLG Micro-Planners
In this paper, we present a novel framework for semi-automatically creating linguistically challenging micro-planning data-to-text corpora from existing Knowledge Bases.
Claire Gardent+3 more
semanticscholar +1 more source
Conceptualization of the Sensory Experience: A Frame-Based Approach [PDF]
Background. Te search for reliable means of establishing relations between sensory qualia and their conceptualization in language has given rise to several approaches, both philosophical and psychophysical.
Georgiy B. Blinnikov
doaj +1 more source
¿Qué hacer con textos que no se pueden publicar? Datos derivados, criterios FAIR y TEI
In certain circumstances, data prepared by projects cannot be published openly. In the last decades different projects have published derived data (also called extracted features) to avoid this problem.
José Calvo Tello, Nanette Rißler-Pipka
doaj +1 more source
Labeled LDA: A supervised topic model for credit attribution in multi-labeled corpora
A significant portion of the world's text is tagged by readers on social bookmarking websites. Credit attribution is an inherent problem in these corpora because most pages have multiple tags, but the tags do not always apply with equal specificity ...
Daniel Ramage+3 more
semanticscholar +1 more source
JSUT and JVS: Free Japanese voice corpora for accelerating speech synthesis research
: In this paper, we develop two corpora for speech synthesis research. Thanks to improvements in machine learning techniques, including deep learning, speech synthesis is becoming a machine learning task.
Shinnosuke Takamichi+6 more
semanticscholar +1 more source
The web as a corpus: a resource for translation
[full article, abstract in English; abstract in Lithuanian] Accessing ready-made corpora may not be always easy. This is especially true for less dominant languages such as Persian for which the number of available corpora is very limited.
Helia Vaezian
doaj +1 more source
The paper "Es ist vieles getan, es bleibt vieles zu tun." ("We have achieved a lot, there is still a lot to achieve.") presents a method for integrating a digital corpus into a German teaching-learning scenario.
Eva Schaeffer-Lacroix
doaj +1 more source
Corpora Annotated with Negation: An Overview
Negation is a universal linguistic phenomenon with a great qualitative impact on natural language processing applications. The availability of corpora annotated with negation is essential to training negation processing systems.
Salud María Jiménez-Zafra+3 more
semanticscholar +1 more source
Contrastive Research in Translation Study: Corpus Approach
This article discusses some issues of the corpus approach application in the professional activities of a linguist-translator. It describes the collection of corpora of the English language, created by Mark Davis, Professor of Linguistics of Brigham ...
Elena M. Kovalenko
doaj +1 more source
Emotion analysis in socially unacceptable discourse
Texts often express the writer’s emotional state, and it was shown that emotion information has potential for hate speech detection and analysis. In this work, we present a methodology for quantitative analysis of emotion in text.
Jasmin Franza+2 more
doaj +1 more source