Adapting vs. Pre-training Language Models for Historical Languages [PDF]
As large language models such as BERT are becoming increasingly popular in Digital Humanities (DH), the question has arisen as to how such models can be made suitable for application to specific textual domains, including that of 'historical text'. Large
Enrique Manjavacas, Lauren Fonteyn
doaj +1 more source
La traduction littéraire automatique : Adapter la machine à la traduction humaine individualisée [PDF]
La traduction automatique neuronale et son adaptation à des domaines spécifiques par le biais de corpus spécialisés ont permis à cette technologie d’intégrer bien plus largement qu’auparavant le métier et la formation des traducteur·trice·s.
Damien Hansen+3 more
doaj +1 more source
Hate speech, Censorship, and Freedom of Speech: The Changing Policies of Reddit [PDF]
This paper examines the shift in focus on content policies and user attitudes on the social media platform Reddit. We do this by focusing on comments from general Reddit users from five posts made by admins (moderators) on updates to Reddit Content ...
Elissa Nakajima Wickham, Emily Öhman
doaj +1 more source
Fractal Sentiments and Fairy Tales - Fractal scaling of narrative arcs as predictor of the perceived quality of Andersen's fairy tales [PDF]
This article explores the sentiment dynamics present in narratives and their contribution to literary appreciation. Specifically, we investigate whether a certain type of sentiment development in a literary narrative correlates with its quality as ...
Yuri Bizzoni+3 more
doaj +1 more source
The expansion of isms, 1820-1917: Data-driven analysis of political language in digitized newspaper collections [PDF]
Words with the suffix-ism are reductionist terms that help us navigate complex social issues by using a simple one-word label for them. On the one hand they are often associated with political ideologies, but on the other they are present in many other ...
Jani Marjanen+3 more
doaj +3 more sources
Deep Learning for Period Classification of Historical Hebrew Texts [PDF]
In this study, we address the interesting task of classifying historical texts by their assumed period of writ-ing. This task is useful in digital humanity studies where many texts have unidentified publication dates.For years, the typical approach for ...
Chaya Liebeskind, Shmuel Liebeskind
doaj +3 more sources
Some Reflections on the Interface between Professional Machine Translation Literacy and Data Literacy [PDF]
Due to the widespread use of data-driven neural machine translation, both by professional translators and layperson users, an adequate machine translation literacy on the part of the users of this technology is becoming more and more important.
Ralph Krüger
doaj +1 more source
Source or target first? Comparison of two post-editing strategies with translation students [PDF]
We conducted an experiment with translation students to assess the influence of two different post-editing (PE) strategies (reading the source segment or the target segment first) on three aspects: PE time, ratio of corrected errors and number of ...
Lise Volkart+4 more
doaj +1 more source
TraduXio Project: Latest Upgrades and Feedback [PDF]
International audience TraduXio is a digital environment for computer assisted multilingual translation which is web-based, free to use and with an open source code.
Philippe Lacour, Aurélien Bénel
doaj +3 more sources
Spoken word corpus and dictionary definition for an African language [PDF]
The preservation of languages is critical to maintaining and strengthening the cultures and identities of communities, and this is especially true for under-resourced languages with a predominantly oral culture.
Wanjiku Nganga, Ikechukwu Achebe
doaj +3 more sources