Results 31 to 40 of about 6,753,607 (254)

Tables to LaTeX: structure and content extraction from scientific tables [PDF]

open access: yes, 2022
Scientific documents contain tables that list important information in a concise fashion. Structure and content extraction from tables embedded within PDF research documents is a very challenging task due to the existence of visual features like spanning cells and content features like mathematical symbols and equations.
arxiv   +1 more source

A large-scale dataset for end-to-end table recognition in the wild [PDF]

open access: yes, 2023
Table recognition (TR) is one of the research hotspots in pattern recognition, which aims to extract information from tables in an image. Common table recognition tasks include table detection (TD), table structure recognition (TSR) and table content recognition (TCR). TD is to locate tables in the image, TCR recognizes text content, and TSR recognizes
arxiv   +1 more source

Automatic Logical Forms improve fidelity in Table-to-Text generation [PDF]

open access: yesExpert Systems with Applications, Volume 238, Part D, 15 March 2024, 121869, 2023
Table-to-text systems generate natural language statements from structured data like tables. While end-to-end techniques suffer from low factual correctness (fidelity), a previous study reported gains when using manual logical forms (LF) that represent the selected content and the semantics of the target text.
arxiv   +1 more source

StruBERT: Structure-aware BERT for Table Search and Matching [PDF]

open access: yes, 2022
A large amount of information is stored in data tables. Users can search for data tables using a keyword-based query. A table is composed primarily of data values that are organized in rows and columns providing implicit structural information. A table is usually accompanied by secondary information such as the caption, page title, etc., that form the ...
arxiv   +1 more source

Table of Contents SHEs: Conference Series

open access: yesSocial, Humanities, and Educational Studies (SHEs): Conference Series, 2021
Table of Contents SHEs: Conference ...
Table of Contents SHEs: Conference Series
semanticscholar   +1 more source

Sumário Bilingue

open access: yesRevista de Investigações Constitucionais, 2021
.
Bilingual Table of Contents
doaj   +3 more sources

TabLeX: A Benchmark Dataset for Structure and Content Information Extraction from Scientific Tables [PDF]

open access: yes, 2021
Information Extraction (IE) from the tables present in scientific articles is challenging due to complicated tabular representations and complex embedded text. This paper presents TabLeX, a large-scale benchmark dataset comprising table images generated from scientific articles. TabLeX consists of two subsets, one for table structure extraction and the
arxiv   +1 more source

Home - About - Disclaimer - Privacy