Results 1 to 10 of about 173,908 (57)

Word Sense Induction with Attentive Context Clustering [PDF]

open access: yesJournal of Data Mining and Digital Humanities, 2022
This paper presents ACCWSI (Attentive Context Clustering WSI), a method for Word Sense Induction, suitable for languages with limited resources. Pretrained on a small corpus and given an ambiguous word (a query word) and a set of excerpts that contain it,
Moshe Stekel, Amos Azaria, Shai Gordin
doaj   +1 more source

You Actually Look Twice At it (YALTAi): using an object detection approach instead of region segmentation within the Kraken engine [PDF]

open access: yesJournal of Data Mining and Digital Humanities, 2023
Layout Analysis (the identification of zones and their classification) is the first step along line segmentation in Optical Character Recognition and similar tasks.
Thibault Clérice
doaj   +1 more source

Impact of Image Enhancement Methods on Automatic Transcription Trainings with eScriptorium [PDF]

open access: yesJournal of Data Mining and Digital Humanities, 2023
This study stems from the Desenrollando el cordel (Untangling the cordel) project, which focuses on 19th-century Spanish prints editing. It evaluates the impact of image enhancement methods on the automatic transcription of low-quality documents, both in
Pauline Jacsont, Elina Leblanc
doaj   +1 more source

DeepL et Google Translate face à l'ambiguïté phraséologique [PDF]

open access: yesJournal of Data Mining and Digital Humanities, 2022
Malgré les progrès de la traduction automatique neuronale, l'intelligence artificielle ne permet toujours pas à la machine de comprendre pour déjouer tous les pièges de la traduction, notamment ceux de l'ambiguïté lexicale, phraséologique, syntaxique et ...
Françoise Bacquelaine
doaj   +1 more source

Handwritten Text Recognition for Documentary Medieval Manuscripts [PDF]

open access: yesJournal of Data Mining and Digital Humanities, 2023
Handwritten Text Recognition (HTR) techniques aim to accurately recognize sequences of characters in input manuscript images by training artificial intelligence models to capture historical writing features.
Sergio Torres Aguilar, Vincent Jolivet
doaj   +1 more source

French vital records data gathering and analysis through image processing and machine learning algorithms [PDF]

open access: yesJournal of Data Mining and Digital Humanities, 2021
Vital records are rich of meaningful historical data concerning city as well as countryside inhabitants that can be used, among others, to study former populations and then reveal the social, economic and demographic characteristics of those populations.
Cyprien Plateau-Holleville   +3 more
doaj   +1 more source

Generic HTR Models for Medieval Manuscripts. The CREMMALab Project [PDF]

open access: yesJournal of Data Mining and Digital Humanities, 2023
In the Humanities, the emergence of digital methods has opened up research questions to quantitative analysis. This is why HTR technology is increasingly involved in humanities research projects following precursors such as the Himanis project.
Ariane Pinche
doaj   +1 more source

New schemes for simplifying binary constraint satisfaction problems [PDF]

open access: yesDiscrete Mathematics & Theoretical Computer Science, 2020
Finding a solution to a Constraint Satisfaction Problem (CSP) is known to be an NP-hard task. This has motivatedthe multitude of works that have been devoted to developing techniques that simplify CSP instances before or duringtheir resolution.The ...
Wady Naanaa
doaj   +1 more source

How competitors become collaborators—Bridging the gap(s) between machine learning algorithms and clinicians

open access: yesBioethics, Volume 36, Issue 2, Page 134-142, February 2022., 2022
Abstract For some years, we have been witnessing a steady stream of high‐profile studies about machine learning (ML) algorithms achieving high diagnostic accuracy in the analysis of medical images. That said, facilitating successful collaboration between ML algorithms and clinicians proves to be a recalcitrant problem that may exacerbate ethical ...
Thomas Grote, Philipp Berens
wiley   +1 more source

Transcribing Foucault’s handwriting with Transkribus [PDF]

open access: yesJournal of Data Mining and Digital Humanities, 2019
The Foucault Fiches de Lecture (FFL) project aims both to explore and to make available online a large set of Michel Foucault’s reading notes (organized citations, references and comments) held at the BnF since 2013.
Marie-Laure Massot   +2 more
doaj   +1 more source

Home - About - Disclaimer - Privacy