Feature selection by integrating document frequency with genetic algorithm for Amharic news document classification [PDF]
Text classification is the process of categorizing documents based on their content into a predefined set of categories. Text classification algorithms typically represent documents as collections of words and it deals with a large number of features ...
Demeke Endalie +2 more
doaj +2 more sources
Web Archiving as a Task of National and Local History Bibliography
The foreign experience of forming the majority of national web archives (in national libraries) and many accessible local history archives of public and university libraries has been analyzed due to the development of a model and methodology for ...
N. M. Balatskaya, M. B. Martirosova
doaj +1 more source
From Integrating to Learning: Insights from Spanish L2 Multiple Documents Selection in Reading Tasks
Previous literature has focused on investigating the use of sources in the classroom and how much they contribute to building a coherent mental representation of the texts.
Maha Soliman
doaj +1 more source
Users Behavior in Selecting Cited Bibliographies-A Case Study of National Taiwan University [PDF]
This project analyzes the behavior of selecting cited bibliographies of collegeand graduate students in National Taiwan University when they are writing theirterm papers and graduate theses.
Mu-Hsuan Huang
doaj +1 more source
Ranked document selection [PDF]
zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Munro, J. Ian +3 more
openaire +4 more sources
Document features selection using background knowledge and word clustering technique [PDF]
By everyday development of storage and communicational and electronic media, there are significant amount of information being collected and stored in different forms such as electronic documents and document databases makes it difficult to process them,
Hajar Farahmand +2 more
doaj +1 more source
Data-driven Feature Selection Methods for Text Classification: an Empirical Evaluation [PDF]
Dimensionality reduction is a crucial task in text classification. The most adopted strategy is feature selection using filter methods. This approach presents a difficulty in determining the best size for the final feature vector.
Rogerio C. P. Fragoso +2 more
doaj +3 more sources
Pearson Correlation-Based Feature Selection for Document Classification Using Balanced Training
Documents are stored in a digital form across several organizations. Printing this amount of data and placing it into folders instead of storing digitally is against the practical, economical, and ecological perspective.
Inzamam Mashood Nasir +6 more
doaj +1 more source
Frequent itemset-based feature selection and Rider Moth Search Algorithm for document clustering
Document clustering has recently been paid great attention in retrieval, navigation, and summarization of huge volumes of documents. With a better document clustering approach, computers can organize a document corpus automatically to a meaningful ...
Madhulika Yarlagadda +2 more
doaj +1 more source
Abstractive Multi-Document Summarization via Phrase Selection and Merging [PDF]
We propose an abstraction-based multi-document summarization framework that can construct new sentences by exploring more fine-grained syntactic units than sentences, namely, noun/verb phrases.
Bing, Lidong +5 more
core +1 more source

