Unifying Vision, Text, and Layout for Universal Document Processing [PDF]
We propose Universal Document Processing (UDOP), a foundation Document AI model which unifies text, image, and layout modalities together with varied task formats, including document understanding and generation.
Zineng Tang +8 more
semanticscholar +1 more source
Unifying Layout Generation with a Decoupled Diffusion Model [PDF]
Layout generation aims to synthesize realistic graphic scenes consisting of elements with different attributes in-cluding category, size, position, and between-element relation.
Mude Hui +5 more
semanticscholar +1 more source
LayoutDiffusion: Improving Graphic Layout Generation by Discrete Diffusion Probabilistic Models [PDF]
Creating graphic layouts is a fundamental step in graphic designs. In this work, we present a novel generative model named LayoutDiffusion for automatic layout generation.
Junyi Zhang +4 more
semanticscholar +1 more source
Pharmaceutical Co-Crystallization: Regulatory Aspects, Design, Characterization, and Applications [PDF]
Pharmaceutical co-crystals are novel class of pharmaceutical substances, which possess an apparent probability of advancement of polished physical properties offering stable and patentable solid forms.
Abdul Raheem Thayyil +3 more
doaj +1 more source
Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation [PDF]
Content-aware graphic layout generation aims to automatically arrange visual elements along with a given content, such as an e-commerce product image. In this paper, we argue that the current layout generation approaches suffer from the limited training ...
Daichi Horita +4 more
semanticscholar +1 more source
Area-Universal Rectangular Layouts [PDF]
A rectangular layout is a partition of a rectangle into a finite set of interior-disjoint rectangles. Rectangular layouts appear in various applications: as rectangular cartograms in cartography, as floorplans in building architecture and VLSI design ...
Eppstein, David +3 more
core +7 more sources
LayoutLM: Pre-training of Text and Layout for Document Image Understanding [PDF]
Pre-training techniques have been verified successfully in a variety of NLP tasks in recent years. Despite the widespread use of pre-training models for NLP applications, they almost exclusively focus on text-level manipulation, while neglecting layout ...
Yiheng Xu +5 more
semanticscholar +1 more source
Peripheral visual field defect of vigabatrin in pediatric epilepsy: A review
Vigabatrin is the medication used for the treatment of infantile spasms and refractory complex partial seizures, but its usage has always been contradictory due to its effect on vision.
Umme Habeeba A. Pathan +3 more
doaj +1 more source
ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding [PDF]
Recent years have witnessed the rise and success of pre-training techniques in visually-rich document understanding. However, most existing methods lack the systematic mining and utilization of layout-centered knowledge, leading to sub-optimal ...
Qiming Peng +14 more
semanticscholar +1 more source
PubLayNet: Largest Dataset Ever for Document Layout Analysis [PDF]
Recognizing the layout of unstructured digital documents is an important step when parsing the documents into structured machine-readable format for downstream applications.
Xu Zhong +2 more
semanticscholar +1 more source

