Results 151 to 160 of about 62,074 (317)

HistNERo: Historical Named Entity Recognition for the Romanian Language [PDF]

open access: yesarXiv
This work introduces HistNERo, the first Romanian corpus for Named Entity Recognition (NER) in historical newspapers. The dataset contains 323k tokens of text, covering more than half of the 19th century (i.e., 1817) until the late part of the 20th century (i.e., 1990). Eight native Romanian speakers annotated the dataset with five named entities.
arxiv  

Intensified decadal variability in tropical climate during the late 19th century [PDF]

open access: bronze, 2009
Toby R. Ault   +6 more
openalex   +1 more source

Rethinking Chlorine: Essential Chemical or Replaceable Risk?

open access: yesChemSusChem, Accepted Article.
This review critically examines the dual nature of chlorine as both an indispensable base chemical and a potential risk. Chlorine and its by‐product hydrogen chloride play essential roles in the production of pharmaceuticals, plastics, agrochemicals, and disinfectants.
Johannes Schwan   +6 more
wiley   +1 more source

Reading the unreadable: Creating a dataset of 19th century English newspapers using image-to-text language models [PDF]

open access: yesarXiv
Oscar Wilde said, "The difference between literature and journalism is that journalism is unreadable, and literature is not read." Unfortunately, The digitally archived journalism of Oscar Wilde's 19th century often has no or poor quality Optical Character Recognition (OCR), reducing the accessibility of these archives and making them unreadable both ...
arxiv  

Home - About - Disclaimer - Privacy