Results 281 to 290 of about 14,291 (315)
Some of the next articles are maybe not open access.
An Algorithm OFC for the Focused Web Crawler
2007 International Conference on Machine Learning and Cybernetics, 2007Based on reinforcement learning and fuzzy clustering theory, this paper proposes an algorithm OFC for the focused web crawler. We combine the naive Bayes classifiers with the fuzzy center-averaged clustering method to calculate the fuzzy memberships that are used to solve the valLie function mapping the hyperlinks to the future discounted rewards ...
openaire +2 more sources
Finding seeds to bootstrap focused crawlers
World Wide Web, 2015Focused crawlers are effective tools for applications requiring a high number of pages belonging to a specific topic. Several strategies for implementing these crawlers have been proposed in the literature, which aim to improve crawling efficiency by increasing the number of relevant pages retrieved while avoiding non-relevant pages.
Luciano Barbosa+4 more
openaire +2 more sources
The BINGO! focused crawler: from bookmarks to archetypes [PDF]
The BINGO! system implements an approach to focused crawling that aims to overcome the limitations of the initial training data. To this end, BINGO! identifies, among the crawled and positively classified documents of a topic, characteristic "archetypes" and uses them for periodically re-training the classifier; this way the crawler is dynamically ...
Sergej Sizov+3 more
openaire +1 more source
Where to Crawl Next for Focused Crawlers
2010Since WWW provides a large amount of data, it is useful for innovative and creative activities of human beings to retrieve interesting and useful information effectively and efficiently from WWW. In this paper, we attempt to propose a focused crawler for individual activities.
Masayoshi Aritsugi+3 more
openaire +2 more sources
2010 International Conference on Networking and Information Technology, 2010
This paper mainly focuses on the effect of feature selection method on the performance of Traditional Focused Crawler (TFC) and Accelerated Focused Crawler (AFC). Information retrieval methods like querying a search engine, usage of web catalog and browsing may not satisfy the information needs of all the users.
R. Krishna Chaitanya+2 more
openaire +2 more sources
This paper mainly focuses on the effect of feature selection method on the performance of Traditional Focused Crawler (TFC) and Accelerated Focused Crawler (AFC). Information retrieval methods like querying a search engine, usage of web catalog and browsing may not satisfy the information needs of all the users.
R. Krishna Chaitanya+2 more
openaire +2 more sources
FCHC: A Social Semantic Focused Crawler
2011The World Wide Web is a huge collection of web pages where every second, new piece of information is added. Searching and retrieving relevant web resources is a protracted task and finding relevant resources w.r.t. some topic, without any explicit or implicit feedback adds more intricacy to the process.
Punam Bedi+4 more
openaire +2 more sources
Towards a Keyword-Focused Web Crawler
2013This paper concerns predicting the content of textual web documents based on features extracted from web pages that link to them. It may be applied in an intelligent, keyword-focused web crawler. The experiments made on publicly available real data obtained from Open Directory Project with the use of several classification models are promising and ...
Marcin Sydow, Tomasz Kuśmierczyk
openaire +2 more sources
HAWK: A Focused Crawler with Content and Link Analysis
2008 IEEE International Conference on e-Business Engineering, 2008Maintaining currency of search engine indices by exhaustive crawling is rapidly becoming impossible due to the increasing size of the web. Focused crawlers aim to search only the subset of the web related to a specific topic, and offer a potential solution to the problem. But it also has problems. The major problem is how to retrieve the maximal set of
Xin Zhang, Xiaoyun Chen
openaire +2 more sources
A General Evaluation Framework for Adaptive Focused Crawlers
Proceedings of the 10th International Conference on Web Information Systems and Technologies, 2014Focused crawling is increasingly seen as a solution to increase the freshness and coverage of local repository of documents related to specific topics by selectively traversing paths on the web. The adaptation is a peculiar feature that makes it possible to modify the search strategies according to the particular environment, its alterations and its ...
GASPARETTI, FABIO+2 more
openaire +3 more sources
CRAWLER-LD: A Multilevel Metadata Focused Crawler Framework for Linked Data
2015The Linked Data best practices recommend to publish a new tripleset using well-known ontologies and to interlink the new tripleset with other triplesets. However, both are difficult tasks. This paper describes CRAWLER-LD, a metadata crawler that helps selecting ontologies and triplesets to be used, respectively, in the publication and the interlinking ...
Giseli Rabello Lopes+3 more
openaire +2 more sources