Results 11 to 20 of about 1,802,735 (277)
Experiments with document archive size detection [PDF]
The size of a document archive is a very important parameter for resource selection in distributed information retrieval systems. In this paper, we present a method for automatically detecting the size (ie the number of documents) of a document archive ...
Crestani, F., Gibb, F., Wu, S.
core +1 more source
Nested Hierarchical Dirichlet Processes [PDF]
We develop a nested hierarchical Dirichlet process (nHDP) for hierarchical topic modeling. The nHDP is a generalization of the nested Chinese restaurant process (nCRP) that allows each word to follow its own path to a topic node according to a document ...
Blei, David M. +3 more
core +1 more source
Query-driven document partitioning and collection selection [PDF]
— We present a novel strategy to partition a document collection onto several servers and to perform effective collection selection. The method is based on the analysis of query logs. We proposed a novel document representation called query-vectors model.
Domenico Laforenza, Fabrizio Silvestri
core +2 more sources
Feature selection for document classification based on topology
Feature selection is the method of how to select the best subset of the document occurring in data core for using it in purposes of data mining or applications.
O.G. El Barbary, A.S. Salama
doaj +1 more source
Feature Selection Based on Term Frequency Reordering of Document Level
In this paper, we propose a new feature selection algorithm based on term frequency reordering of document level. In our proposed algorithm, it uses the document frequency to weigh the unbalanced factors of the data sets and considers the effect of the ...
Hongfang Zhou +3 more
doaj +1 more source
Opsin expression predicts male nuptial color in threespine stickleback. [PDF]
Theoretical models of sexual selection suggest that male courtship signals can evolve through the build-up of genetic correlations between the male signal and female preference.
Bolnick, Daniel I +3 more
core +2 more sources
Digitization of management: automation of the selection and classification in an arbitrary verbal context [PDF]
This article deals with an approach to extracting formal meaning in an arbitrary text document. According to the authors the formal semantic attribute (“semantic pattern”) will allow to solve the problems of automatic classification of verbal context ...
Meshkov Vladimir +2 more
doaj +1 more source
Individual Expert Selection and Ranking of Scientific Articles Using Document Length
Individual expert selection and ranking is a challenging research topic that has received a lot attention in recent years because of its importance related to referencing experts in particular domains and research fund allocation and management.
Fadly Akbar Saputra +2 more
doaj +1 more source
Multi-Document Neural Reading Comprehension Based on Bi-Directional Attention Mechanism [PDF]
Machine Reading Comprehension(MRC) is a question and answer task that automatically generates or extracts corresponding answers for a given text and specific questions.This task is of great significance to evaluating the understanding of computer systems
TANG Hongxuan, WU Kaili, ZHU Mengmeng, HONG Yu
doaj +1 more source
Firefly Algorithm based Feature Selection for Arabic Text Classification
Due to the large number of documents available in the internet, emails and digital libraries, document classification is becoming a crucial task extremely required.
Souad Larabi Marie-Sainte, Nada Alalyani
doaj +1 more source

