Results 281 to 290 of about 13,834 (319)
Some of the next articles are maybe not open access.

A Multi-Threaded Semantic Focused Crawler

Journal of Computer Science and Technology, 2012
The Web comprises of voluminous rich learning content. The volume of ever growing learning resources however leads to the problem of information overload. A large number of irrelevant search results generated from search engines based on keyword matching techniques further augment the problem.
Anjali Thukral   +4 more
openaire   +2 more sources

SAFSB: A self-adaptive focused crawler

2015 1st International Conference on Next Generation Computing Technologies (NGCT), 2015
There are about 3 billion indexed websites present in the WWW. Not all websites do not belong to a particular topic are indexed by a search engine say google.com, there are online platforms available where different users help the person asking for a (Universal Resource Locator) URL containing a topical information.
Mohd. Aamir Khan, Dilip Kumar Sharma
openaire   +2 more sources

Adaptive Focused Website Segment Crawler

2016 19th International Conference on Network-Based Information Systems (NBiS), 2016
Focused web crawler has become indispensable for vertical search engines that provide a search service for specialized datasets. These vertical search engines have to collect specific web pages in the web space, whereas search engines such as Google and Bing gather web pages from all over the world.
Tanaphol Suebchua   +2 more
openaire   +2 more sources

The BINGO! focused crawler: from bookmarks to archetypes [PDF]

open access: possibleProceedings 18th International Conference on Data Engineering, 2003
The BINGO! system implements an approach to focused crawling that aims to overcome the limitations of the initial training data. To this end, BINGO! identifies, among the crawled and positively classified documents of a topic, characteristic "archetypes" and uses them for periodically re-training the classifier; this way the crawler is dynamically ...
Sergej Sizov   +3 more
openaire   +1 more source

Finding seeds to bootstrap focused crawlers

World Wide Web, 2015
Focused crawlers are effective tools for applications requiring a high number of pages belonging to a specific topic. Several strategies for implementing these crawlers have been proposed in the literature, which aim to improve crawling efficiency by increasing the number of relevant pages retrieved while avoiding non-relevant pages.
Luciano Barbosa   +4 more
openaire   +2 more sources

An Algorithm OFC for the Focused Web Crawler

2007 International Conference on Machine Learning and Cybernetics, 2007
Based on reinforcement learning and fuzzy clustering theory, this paper proposes an algorithm OFC for the focused web crawler. We combine the naive Bayes classifiers with the fuzzy center-averaged clustering method to calculate the fuzzy memberships that are used to solve the valLie function mapping the hyperlinks to the future discounted rewards ...
openaire   +2 more sources

Focused web crawler with revisit policy

Proceedings of the International Conference & Workshop on Emerging Trends in Technology - ICWET '11, 2011
Focused crawlers aim to search only the subset of the web related to a specific topic, and offer a potential solution to the problem. The major problem is how to retrieve the maximal set of relevant and quality pages. In this paper, We propose an architecture that concentrates more over page selection policy and page revisit policy The three-step ...
Bandu B. Meshram, S. Mali
openaire   +2 more sources

Where to Crawl Next for Focused Crawlers

2010
Since WWW provides a large amount of data, it is useful for innovative and creative activities of human beings to retrieve interesting and useful information effectively and efficiently from WWW. In this paper, we attempt to propose a focused crawler for individual activities.
Masayoshi Aritsugi   +3 more
openaire   +2 more sources

Effect of feature selection method on the performance of focused crawlers—A case study on traditional and accelerated focused crawlers

2010 International Conference on Networking and Information Technology, 2010
This paper mainly focuses on the effect of feature selection method on the performance of Traditional Focused Crawler (TFC) and Accelerated Focused Crawler (AFC). Information retrieval methods like querying a search engine, usage of web catalog and browsing may not satisfy the information needs of all the users.
R. Krishna Chaitanya   +2 more
openaire   +2 more sources

Towards a Keyword-Focused Web Crawler

2013
This paper concerns predicting the content of textual web documents based on features extracted from web pages that link to them. It may be applied in an intelligent, keyword-focused web crawler. The experiments made on publicly available real data obtained from Open Directory Project with the use of several classification models are promising and ...
Marcin Sydow, Tomasz Kuśmierczyk
openaire   +2 more sources

Home - About - Disclaimer - Privacy