On Language Models for Creoles [PDF]
Creole languages such as Nigerian Pidgin English and Haitian Creole are under-resourced and largely ignored in the NLP literature. Creoles typically result from the fusion of a foreign language with multiple local languages, and what grammatical and lexical features are transferred to the creole is a complex process. While creoles are generally stable,
arxiv
Quasi Error-free Text Classification and Authorship Recognition in a large Corpus of English Literature based on a Novel Feature Set [PDF]
The Gutenberg Literary English Corpus (GLEC) provides a rich source of textual data for research in digital humanities, computational linguistics or neurocognitive poetics. However, so far only a small subcorpus, the Gutenberg English Poetry Corpus, has been submitted to quantitative text analyses providing predictions for scientific studies of ...
arxiv
The current paper deals with the nature of justice in George R. R. Martin’s novel series A Song of Ice and Fire (1996- ) and Tales of Dunk and Egg (1998- ) under the light of Daniel Van Ness’ theory of restorative justice.
Rohani S., Abootalebi H.
doaj +2 more sources
Theory of Transformation Groups, by S. Lie and F. Engel (Vol. I, 1888). Modern Presentation and English Translation [PDF]
The goal of this modern presentation, followed by an English translation from the German, is to make available some parts of Lie's very systematic mathematical thought which deserve to join the contemporary literature, and above all also, to be read.
arxiv
Did Hypatia Know about Negative Numbers? [PDF]
In this Letter we comment on one particular aspect of Hypatia's enigmatic biography by translating into English a short poem that appeared in a recent review of the third revised Polish edition of Maria Dzielska's book about Hypatia. It poses a simple and specifc question: did Hypatia know about the negative numbers?
arxiv
An Annotated Dataset of Coreference in English Literature [PDF]
We present in this work a new dataset of coreference annotations for works of literature in English, covering 29,103 mentions in 210,532 tokens from 100 works of fiction. This dataset differs from previous coreference datasets in containing documents whose average length (2,105.3 words) is four times longer than other benchmark datasets (463.7 for ...
arxiv
English Prompts are Better for NLI-based Zero-Shot Emotion Classification than Target-Language Prompts [PDF]
Emotion classification in text is a challenging task due to the processes involved when interpreting a textual description of a potential emotion stimulus. In addition, the set of emotion categories is highly domain-specific. For instance, literature analysis might require the use of aesthetic emotions (e.g., finding something beautiful), and social ...
arxiv +1 more source
Concentrating on different types of persuasion strategies used by medical experts and political figures, the present study aims to examine persuasion strategies in medical vs. political discourse. Four corpora of speeches and interviews of four different
Mohammad Mohtasham+3 more
doaj +1 more source
Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation [PDF]
Despite being the seventh most widely spoken language in the world, Bengali has received much less attention in machine translation literature due to being low in resources. Most publicly available parallel corpora for Bengali are not large enough; and have rather poor quality, mostly because of incorrect sentence alignments resulting from erroneous ...
arxiv
Vesicular Darier's disease: a case report and review of the English literature of a rare disease variant. [PDF]
Mourad A, Kurwa HA, Haber RM.
europepmc +1 more source