Results 21 to 30 of about 23,388 (326)

Discovering Land Cover Web Map Services from the Deep Web with JavaScript Invocation Rules

open access: yesISPRS International Journal of Geo-Information, 2016
Automatic discovery of isolated land cover web map services (LCWMSs) can potentially help in sharing land cover data. Currently, various search engine-based and crawler-based approaches have been developed for finding services dispersed throughout the ...
Dongyang Hou, Jun Chen, Hao Wu
doaj   +1 more source

GeoWeb Crawler: An Extensible and Scalable Web Crawling Framework for Discovering Geospatial Web Resources

open access: yesISPRS International Journal of Geo-Information, 2016
With the advance of the World-Wide Web (WWW) technology, people can easily share content on the Web, including geospatial data and web services. Thus, the “big geospatial data management” issues start attracting attention.
Chih-Yuan Huang, Hao Chang
doaj   +1 more source

An architecture for a focused trend parallel web crawler with the application of clickstream analysis [PDF]

open access: yes, 2012
The tremendous growth of the Web poses many challenges for all-purpose single-process crawlers including the presence of some irrelevant answers among search results and the coverage and scaling issues regarding the enormous dimension of the World Wide ...
Ahmadi-Abkenari, Fatemeh, Selamat, Ali
core   +1 more source

Review of web crawlers

open access: yesInternational Journal of Knowledge and Web Intelligence, 2014
The web is a repository of large amount of data. Information available in the web is organised in the form of pages. Due to the presence of unlimited amount of information, searching and finding out appropriate information from the web is a task which needs expertise.
S. R. Sreeja, Sangita Chaudhari
openaire   +2 more sources

Institutional Repository Keyword Analysis with Web Crawler

open access: yesCentral European Journal of Educational Research, 2022
This study aims at investigating procedures of semantic and linguistic extraction of keywords from metadata of documents indexed in the Institutional Repository Unesp.
Mariângela Spotti Lopes Fujita   +2 more
doaj   +1 more source

RCrawler: An R package for parallel web crawling and scraping

open access: yesSoftwareX, 2017
RCrawler is a contributed R package for domain-based web crawling and content scraping. As the first implementation of a parallel web crawler in the R environment, RCrawler can crawl, parse, store pages, extract contents, and produce data that can be ...
Salim Khalil, Mohamed Fakir
doaj   +1 more source

Research on bearing capacity of cross-type truss boom with variable cross-section of Crawler cranes

open access: yesMechanics & Industry, 2023
The web crossed truss boom is one of the commonly used truss boom structures of crawler cranes. However, the existing calculations fail to consider the limiting effect of the web members' bending resistance on the chord members, and cannot give full play
Fenglin Yao   +6 more
doaj   +1 more source

Application of ARIMA(1,1,0) Model for Predicting Time Delay of Search Engine Crawlers [PDF]

open access: yesInformatică economică, 2013
World Wide Web is growing at a tremendous rate in terms of the number of visitors and number of web pages. Search engine crawlers are highly automated programs that periodically visit the web and index web pages.
Jeeva JOSE, P. Sojan LAL
doaj   +1 more source

Optimization of Distributed Crawler under Hadoop

open access: yesMATEC Web of Conferences, 2015
Web crawler is an important link in the data acquisition of the World Wide Web. It is necessary to optimize traditional methods so as to meet the current needs in the face of the explosive growth of data.
Zhang Xiaochen, Xian Ming
doaj   +1 more source

Hybrid focused crawling on the Surface and the Dark Web

open access: yesEURASIP Journal on Information Security, 2017
Focused crawlers enable the automatic discovery of Web resources about a given topic by automatically navigating through the Web link structure and selecting the hyperlinks to follow by estimating their relevance to the topic of interest.
Christos Iliou   +4 more
doaj   +1 more source

Home - About - Disclaimer - Privacy