Results 1 to 10 of about 1,734,941 (332)
An Enhanced Semantic Focused Web Crawler Based on Hybrid String Matching Algorithm [PDF]
Topic precise crawler is a special purpose web crawler, which downloads appropriate web pages analogous to a particular topic by measuring cosine similarity or semantic similarity score.
K. S. Sakunthala Prabha+2 more
openalex +2 more sources
Web Crawler and Web Crawler Algorithms: A Perspective
A web crawler is also called spider. For the intention of web indexing it automatically searches on the WWW. As the W3 is increasing day by day, globally the number of web pages grown massively. To make the search sociable for users, searching engine are mandatory. So to discover the particular data from the WWW search engines are operated. It would be
K Velkumar+2 more
openaire +4 more sources
Nowadays web pages are implemented in various kinds of languages on the Web and web crawlers are important for search engine. Language specific crawlers are crawlers that traverse and collect the relative web pages using the successive URls of web page. There are very few research areas in crawling for Myanmar Language web sites.
Su Mon Khine, Yadana Thein
openalex +4 more sources
A Semantic Focused Web Crawler Based on a Knowledge Representation Schema
The Web has become the main source of information in the digital world, expanding to heterogeneous domains and continuously growing. By means of a search engine, users can systematically search over the web for particular information based on a text ...
Julio Hernandez+2 more
exaly +2 more sources
Design, implementation and experiment of a YeSQL Web Crawler [PDF]
We describe a novel, "focusable", scalable, distributed web crawler based on GNU/Linux and PostgreSQL that we designed to be easily extendible and which we have released under a GNU public licence.
Deveaud, Romain+4 more
core +2 more sources
A Brief History of Web Crawlers [PDF]
Web crawlers visit internet applications, collect data, and learn about new web pages from visited pages. Web crawlers have a long and interesting history. Early web crawlers collected statistics about the web.
Bochmann, Gregor V.+5 more
core +3 more sources
Web crawler research methodology [PDF]
In economic and social sciences it is crucial to test theoretical models against reliable and big enough databases. The general research challenge is to build up a well-structured database that suits well to the given research question and that is cost ...
Nemeslaki, András+1 more
core +3 more sources
Quantifying an online wildlife trade using a web crawler
Legally protected plants are illegally traded through online sales platforms and orchids are a significant component of this wildlife trade. This study focused on salep, a compound product made from wild collected orchid tubers from several genera ...
S. Masters+11 more
semanticscholar +1 more source
ORCA - a Benchmark for Data Web Crawlers [PDF]
The number of RDF knowledge graphs available on the Web grows constantly. Gathering these graphs at large scale for downstream applications hence requires the use of crawlers. Although Data Web crawlers exist, and general Web crawlers could be adapted to focus on the Data Web, there is currently no benchmark to fairly evaluate their performance.
Axel-Cyrille Ngonga Ngomo+4 more
openaire +2 more sources
IHWC: intelligent hidden web crawler for harvesting data in urban domains
Due to the massive size of the hidden web, searching, retrieving and mining rich and high-quality data can be a daunting task. Moreover, with the presence of forms, data cannot be accessed easily.
Sawroop Kaur+3 more
semanticscholar +1 more source