Results 11 to 20 of about 1,734,941 (332)

Web Crawler: A Review [PDF]

open access: yesInternational Journal of Computer Applications, 2013
Information Retrieval deals with searching and retrieving information within the documents and it also searches the online databases and internet. Web crawler is defined as a program or software which traverses the Web and downloads web documents in a methodical, automated manner.
Sanjeev Kumar Singh   +2 more
openaire   +1 more source

Summary of web crawler technology research

open access: yesJournal of Physics: Conference Series, 2020
With the continuous development of network information technology, there is a large amount of unstructured data called big data on the network. Human resources to collect information laborious, so web crawler technology came into being.
Linxuan Yu   +5 more
semanticscholar   +1 more source

Aplikasi Web Crawler Untuk Web Content Pada Mobile Phone [PDF]

open access: yes, 2009
Crawling is the process behind a search engine, which served through the World Wide Web in a structured and with certain ethics. Applications that run the crawling process is called Web Crawler, also called web spider or web robot.
Basori, A. H. (Ahmad)   +2 more
core   +3 more sources

An architecture for a focused trend parallel web crawler with the application of clickstream analysis [PDF]

open access: yes, 2012
The tremendous growth of the Web poses many challenges for all-purpose single-process crawlers including the presence of some irrelevant answers among search results and the coverage and scaling issues regarding the enormous dimension of the World Wide ...
Ahmadi-Abkenari, Fatemeh, Selamat, Ali
core   +1 more source

Review of web crawlers

open access: yesInternational Journal of Knowledge and Web Intelligence, 2014
The web is a repository of large amount of data. Information available in the web is organised in the form of pages. Due to the presence of unlimited amount of information, searching and finding out appropriate information from the web is a task which needs expertise.
S. R. Sreeja, Sangita Chaudhari
openaire   +2 more sources

Design of a Parallel and Scalable Crawler for the Hidden Web

open access: yesInternational Journal of Information Retrieval Research, 2022
The WWW contains huge amount of information from different areas. This information may be present virtually in the form of web pages, media, articles (research journals / magazine), blogs etc.
Sonali Gupta, K. Bhatia
semanticscholar   +1 more source

SIMHAR - Smart Distributed Web Crawler for the Hidden Web Using SIM+Hash and Redis Server

open access: yesIEEE Access, 2020
Developing a distributed web crawler obliges major engineering challenges, all of which are eventually associated to scale. To retain corpus of search engine and a reasonable state of freshness, the crawler must be distributed over multiple computers. In
Sawroop Kaur, Ieee G. GEETHA Member
semanticscholar   +1 more source

Web Crawler: Design And Implementation For Extracting Article-Like Contents

open access: yesCybernetics and Physics, 2020
The World Wide Web is a large, wealthy, and accessible information system whose users are increasing rapidly nowadays. To retrieve information from the web as per users’ requests, search engines are built to access web pages.
Ngo Le Huy Hien   +2 more
semanticscholar   +1 more source

The data extraction using distributed crawler inside multi-agent system [PDF]

open access: yes, 2013
The paper discusses the use of web crawler technology. We created an application based on standard web crawler. Our application is determined for data extraction.
Dubec, Patrik   +4 more
core   +2 more sources

Web Crawler: A Survey

open access: yes, 2022
- In world wide web, Web Crawler is working as a software agent which helps in web indexing. Web crawler at times called as spider or internet bot explicitly operated by many search engines. In Information Retrieval to colleting an information Web crawler play an important role. For effective searching and web indexing is mostly depends on web crawlers.
openaire   +2 more sources

Home - About - Disclaimer - Privacy