Docs2KG: Unified Knowledge Graph Construction from Heterogeneous Documents Assisted by Large Language Models [PDF]
Even for a conservative estimate, 80% of enterprise data reside in unstructured files, stored in data lakes that accommodate heterogeneous formats. Classical search engines can no longer meet information seeking needs, especially when the task is to browse and explore for insight formulation. In other words, there are no obvious search keywords to use.
arxiv
Findings from the Workshop on User-Centered Design of Language Archives [PDF]
This white paper describes findings from the workshop on User-Centered Design of Language Archives organized in February 2016 by Christina Wasson (University of North Texas) and Gary Holton (University of Hawai‘i at Mānoa).
Holton, Gary+2 more
core +1 more source
A Novel Digitalization Approach for Smart Materials – Ontology‐Based Access to Data and Models
In order to access heterogeneous material data and model‐based knowledge, the established ontology‐based data access (OBDA) is extended to include material models. This novel ontology‐based data and model access (OBDMA) enables the computation of new responses beyond stored data.
Jürgen Maas+15 more
wiley +1 more source
Catálogo de los pergaminos del Archivo de la Catedral de Murcia
El artículo ofrece un Catálogo completo de los pergaminos que se conservan a día de hoy en el Archivo de la Catedral de Murcia (ACM). En total se regestan 282 piezas datadas entre 1250 y 1978, de formato y temas diversos y con diferente estado de ...
Isabel García Díaz+1 more
doaj
Verifiable Source Code Documentation in Controlled Natural Language [PDF]
Writing documentation about software internals is rarely considered a rewarding activity. It is highly time-consuming and the resulting documentation is fragile when the software is continuously evolving in a multi-developer setting. Unfortunately, traditional programming environments poorly support the writing and maintenance of documentation ...
arxiv
FAIR and Structured Data: A Domain Ontology Aligned with Standard‐Compliant Tensile Testing
The digitalization in materials science and engineering is discussed, emphasizing the importance of digital workflows and ontologies in managing diverse experimental data. Challenges such as quality assurance and data interoperability are tackled with semantic web technologies, focusing and introducing the tensile test ontology (TTO).
Markus Schilling+6 more
wiley +1 more source
Digital archiving of manuscripts and other heritage items for conservation and information retrieval [PDF]
Expression of cultural heritage looking from the informatics angle falls into text, images, video and sound categories. ICT can be used to conserve all these heritage items like; the text information consisting of palm leaf manuscripts, stone tablets ...
core
An Automatized Simulation Workflow for Powder Pressing Simulations Using SimStack
The implementation of Workflow active Nodes (WaNos) for the convenient execution and automated evaluation of discrete element method calculations of powder pressing is showcased. Purposeful combination of WaNos creates timesaving and resource‐effective computational workflows.
Bjoern Mieller+2 more
wiley +1 more source
More Documents, Same Length: Isolating the Challenge of Multiple Documents in RAG [PDF]
Retrieval-augmented generation (RAG) provides LLMs with relevant documents. Although previous studies noted that retrieving many documents can degrade performance, they did not isolate how the quantity of documents affects performance while controlling for context length.
arxiv
Legal entity recognition in an agglutinating language and document connection network for EU Legislation and EU/Hungarian Case Law [PDF]
We have developed an application aiming at federated search for EU and Hungarian legislation and jurisdiction. It now contains above 1 million documents, with daily updates. The database holds documents downloaded from the EU sources EUR-Lex and Curia Online as well as public jurisdiction documents from the Constitutional Court of Hungary and The ...
arxiv