Results 11 to 20 of about 2,766,627 (107)

Text Mining: Use of TF-IDF to Examine the Relevance of Words to Documents

open access: yesInternational Journal of Computer Applications, 2018
In this paper, the use of TF-IDF stands for (term frequency-inverse document frequency) is discussed in examining the relevance of key-words to documents in corpus. The study is focused on how the algorithm can be applied on number of documents.
Shahzad Qaiser, R. Ali
semanticscholar   +1 more source

Graph Convolution for Multimodal Information Extraction from Visually Rich Documents [PDF]

open access: yesNorth American Chapter of the Association for Computational Linguistics, 2019
Visually rich documents (VRDs) are ubiquitous in daily business and life. Examples are purchase receipts, insurance policy documents, custom declaration forms and so on.
Xiaojing Liu   +3 more
semanticscholar   +1 more source

A Divide-and-Conquer Approach to the Summarization of Long Documents

open access: yesIEEE/ACM Transactions on Audio Speech and Language Processing, 2020
We present a novel divide-and-conquer method for the neural summarization of long documents. Our method exploits the discourse structure of the document and uses sentence similarity to split the problem into an ensemble of smaller summarization problems.
Alexios Gidiotis, Grigorios Tsoumakas
semanticscholar   +1 more source

Question Answering by Reasoning Across Documents with Graph Convolutional Networks [PDF]

open access: yesNorth American Chapter of the Association for Computational Linguistics, 2018
Most research in reading comprehension has focused on answering questions based on individual documents or even single paragraphs. We introduce a neural model which integrates and reasons relying on information spread within documents and across multiple
Nicola De Cao, Wilker Aziz, Ivan Titov
semanticscholar   +1 more source

Chargrid: Towards Understanding 2D Documents [PDF]

open access: yesConference on Empirical Methods in Natural Language Processing, 2018
We introduce a novel type of text representation that preserves the 2D layout of a document. This is achieved by encoding each document page as a two-dimensional grid of characters.
Anoop R. Katti   +6 more
semanticscholar   +1 more source

Government Documents Story: The Impact of Eugenics Policy on Marginalized Groups in the United States

open access: yesDocuments to the People, 2022
Introduction: A Brief Overview of Eugenics in the United States In recent years, debates centered around the idea and phenomenon of discrimination existing or being built directly into our governmental system(s), which is commonly referred to as ...
Teresa Lausell
semanticscholar   +1 more source

A l’entorn de BiPaDI: un projecte de gestió del patrimoni bibliogràfic per al futur [PDF]

open access: yes, 2016
Presentació de l’experiència i la valoració del projecte BiPaDi del CRAI de la Universitat de Barcelona (UB), que ha suposat disposar per primera vegada d’una infraestructura dedicada exclusivament a la difusió i revalorització d’una de les biblioteques ...
Casals, Judit   +2 more
core  

Les col·leccions digitals patrimonials espanyoles: polítiques de col·lecció i presentació de la col·lecció [PDF]

open access: yes, 2014
Objectius: Analitzar l'existència i el contingut de documents de polítiques de col·lecció i criteris de selecció de les col·leccions digitals patrimonials espanyoles.
Estivill Rius, Assumpció   +2 more
core  

Les polítiques de col•lecció com a eina per informar a l’usuari de les col•leccions digitals: el cas de la Memòria digital de Catalunya [PDF]

open access: yes, 2010
The paper analyzes the kinds of information provided to users of the digital collections contained in the Memòria digital de Catalunya. Criteria to evaluate this information were developed according to ALA and IFLA guidelines for collection policy ...
Estivill Rius , Assumpció
core  

PositionRank: An Unsupervised Approach to Keyphrase Extraction from Scholarly Documents

open access: yesAnnual Meeting of the Association for Computational Linguistics, 2017
The large and growing amounts of online scholarly data present both challenges and opportunities to enhance knowledge discovery. One such challenge is to automatically extract a small set of keyphrases from a document that can accurately describe the ...
C. Florescu, Cornelia Caragea
semanticscholar   +1 more source

Home - About - Disclaimer - Privacy