Results 181 to 190 of about 76,818 (217)
Some of the next articles are maybe not open access.

Focused web crawler with revisit policy

Proceedings of the International Conference & Workshop on Emerging Trends in Technology - ICWET '11, 2011
Focused crawlers aim to search only the subset of the web related to a specific topic, and offer a potential solution to the problem. The major problem is how to retrieve the maximal set of relevant and quality pages. In this paper, We propose an architecture that concentrates more over page selection policy and page revisit policy The three-step ...
Bandu B. Meshram, S. Mali
openaire   +2 more sources

Adaptive Focused Website Segment Crawler

2016 19th International Conference on Network-Based Information Systems (NBiS), 2016
Focused web crawler has become indispensable for vertical search engines that provide a search service for specialized datasets. These vertical search engines have to collect specific web pages in the web space, whereas search engines such as Google and Bing gather web pages from all over the world.
Tanaphol Suebchua   +2 more
openaire   +2 more sources

An Algorithm OFC for the Focused Web Crawler

2007 International Conference on Machine Learning and Cybernetics, 2007
Based on reinforcement learning and fuzzy clustering theory, this paper proposes an algorithm OFC for the focused web crawler. We combine the naive Bayes classifiers with the fuzzy center-averaged clustering method to calculate the fuzzy memberships that are used to solve the valLie function mapping the hyperlinks to the future discounted rewards ...
openaire   +2 more sources

Finding seeds to bootstrap focused crawlers

World Wide Web, 2015
Focused crawlers are effective tools for applications requiring a high number of pages belonging to a specific topic. Several strategies for implementing these crawlers have been proposed in the literature, which aim to improve crawling efficiency by increasing the number of relevant pages retrieved while avoiding non-relevant pages.
Luciano Barbosa   +4 more
openaire   +2 more sources

The BINGO! focused crawler: from bookmarks to archetypes [PDF]

open access: possibleProceedings 18th International Conference on Data Engineering, 2003
The BINGO! system implements an approach to focused crawling that aims to overcome the limitations of the initial training data. To this end, BINGO! identifies, among the crawled and positively classified documents of a topic, characteristic "archetypes" and uses them for periodically re-training the classifier; this way the crawler is dynamically ...
Sergej Sizov   +3 more
openaire   +1 more source

Effect of feature selection method on the performance of focused crawlers—A case study on traditional and accelerated focused crawlers

2010 International Conference on Networking and Information Technology, 2010
This paper mainly focuses on the effect of feature selection method on the performance of Traditional Focused Crawler (TFC) and Accelerated Focused Crawler (AFC). Information retrieval methods like querying a search engine, usage of web catalog and browsing may not satisfy the information needs of all the users.
R. Krishna Chaitanya   +2 more
openaire   +2 more sources

Where to Crawl Next for Focused Crawlers

2010
Since WWW provides a large amount of data, it is useful for innovative and creative activities of human beings to retrieve interesting and useful information effectively and efficiently from WWW. In this paper, we attempt to propose a focused crawler for individual activities.
Masayoshi Aritsugi   +3 more
openaire   +2 more sources

Towards a Keyword-Focused Web Crawler

2013
This paper concerns predicting the content of textual web documents based on features extracted from web pages that link to them. It may be applied in an intelligent, keyword-focused web crawler. The experiments made on publicly available real data obtained from Open Directory Project with the use of several classification models are promising and ...
Marcin Sydow, Tomasz Kuśmierczyk
openaire   +2 more sources

A General Evaluation Framework for Adaptive Focused Crawlers

Proceedings of the 10th International Conference on Web Information Systems and Technologies, 2014
Focused crawling is increasingly seen as a solution to increase the freshness and coverage of local repository of documents related to specific topics by selectively traversing paths on the web. The adaptation is a peculiar feature that makes it possible to modify the search strategies according to the particular environment, its alterations and its ...
GASPARETTI, FABIO   +2 more
openaire   +3 more sources

FCHC: A Social Semantic Focused Crawler

2011
The World Wide Web is a huge collection of web pages where every second, new piece of information is added. Searching and retrieving relevant web resources is a protracted task and finding relevant resources w.r.t. some topic, without any explicit or implicit feedback adds more intricacy to the process.
Punam Bedi   +4 more
openaire   +2 more sources

Home - About - Disclaimer - Privacy