Results 1 to 10 of about 4,052 (51)

Adapting vs. Pre-training Language Models for Historical Languages [PDF]

open access: yesJournal of Data Mining and Digital Humanities, 2022
As large language models such as BERT are becoming increasingly popular in Digital Humanities (DH), the question has arisen as to how such models can be made suitable for application to specific textual domains, including that of 'historical text'. Large
Enrique Manjavacas, Lauren Fonteyn
doaj   +1 more source

La traduction littéraire automatique : Adapter la machine à la traduction humaine individualisée [PDF]

open access: yesJournal of Data Mining and Digital Humanities, 2022
La traduction automatique neuronale et son adaptation à des domaines spécifiques par le biais de corpus spécialisés ont permis à cette technologie d’intégrer bien plus largement qu’auparavant le métier et la formation des traducteur·trice·s.
Damien Hansen   +3 more
doaj   +1 more source

Hate speech, Censorship, and Freedom of Speech: The Changing Policies of Reddit [PDF]

open access: yesJournal of Data Mining and Digital Humanities, 2022
This paper examines the shift in focus on content policies and user attitudes on the social media platform Reddit. We do this by focusing on comments from general Reddit users from five posts made by admins (moderators) on updates to Reddit Content ...
Elissa Nakajima Wickham, Emily Öhman
doaj   +1 more source

Fractal Sentiments and Fairy Tales - Fractal scaling of narrative arcs as predictor of the perceived quality of Andersen's fairy tales [PDF]

open access: yesJournal of Data Mining and Digital Humanities, 2022
This article explores the sentiment dynamics present in narratives and their contribution to literary appreciation. Specifically, we investigate whether a certain type of sentiment development in a literary narrative correlates with its quality as ...
Yuri Bizzoni   +3 more
doaj   +1 more source

The expansion of isms, 1820-1917: Data-driven analysis of political language in digitized newspaper collections [PDF]

open access: yesJournal of Data Mining and Digital Humanities, 2020
Words with the suffix-ism are reductionist terms that help us navigate complex social issues by using a simple one-word label for them. On the one hand they are often associated with political ideologies, but on the other they are present in many other ...
Jani Marjanen   +3 more
doaj   +3 more sources

Deep Learning for Period Classification of Historical Hebrew Texts [PDF]

open access: yesJournal of Data Mining and Digital Humanities, 2020
In this study, we address the interesting task of classifying historical texts by their assumed period of writ-ing. This task is useful in digital humanity studies where many texts have unidentified publication dates.For years, the typical approach for ...
Chaya Liebeskind, Shmuel Liebeskind
doaj   +3 more sources

Some Reflections on the Interface between Professional Machine Translation Literacy and Data Literacy [PDF]

open access: yesJournal of Data Mining and Digital Humanities, 2023
Due to the widespread use of data-driven neural machine translation, both by professional translators and layperson users, an adequate machine translation literacy on the part of the users of this technology is becoming more and more important.
Ralph Krüger
doaj   +1 more source

Source or target first? Comparison of two post-editing strategies with translation students [PDF]

open access: yesJournal of Data Mining and Digital Humanities, 2022
We conducted an experiment with translation students to assess the influence of two different post-editing (PE) strategies (reading the source segment or the target segment first) on three aspects: PE time, ratio of corrected errors and number of ...
Lise Volkart   +4 more
doaj   +1 more source

TraduXio Project: Latest Upgrades and Feedback [PDF]

open access: yesJournal of Data Mining and Digital Humanities, 2021
International audience TraduXio is a digital environment for computer assisted multilingual translation which is web-based, free to use and with an open source code.
Philippe Lacour, Aurélien Bénel
doaj   +3 more sources

Spoken word corpus and dictionary definition for an African language [PDF]

open access: yesJournal of Data Mining and Digital Humanities, 2020
The preservation of languages is critical to maintaining and strengthening the cultures and identities of communities, and this is especially true for under-resourced languages with a predominantly oral culture.
Wanjiku Nganga, Ikechukwu Achebe
doaj   +3 more sources

Home - About - Disclaimer - Privacy